1.29k likes | 1.42k Vues
This presentation explores the intricacies of large real-world networks, focusing on identifying key patterns such as power laws, evolution laws, and clustering behaviors. With contributions from researchers like Christos Faloutsos and others, it addresses critical questions regarding the nature of graphs, virus propagation, and community detection. By analyzing network structures—from internet topology to food webs and friendship networks—this work sheds light on why these patterns occur and how they evolve over time, providing foundational insights for future research in network science.
E N D
Finding patterns in large, real networks Christos Faloutsos CMU
Thanks to • Deepayan Chakrabarti (CMU/Yahoo) • Michalis Faloutsos (UCR) • Spiros Papadimitriou (CMU/IBM) • George Siganos (UCR) C. Faloutsos
Introduction Protein Interactions [genomebiology.com] Internet Map [lumeta.com] Food Web [Martinez ’91] Graphs are everywhere! Friendship Network [Moody ’01] C. Faloutsos
Outline • Laws • static • time-evolution laws • why so many power laws? • tools/results • future steps C. Faloutsos
Motivating questions • Q1: What do real graphs look like? • Q2: How to generate realistic graphs? C. Faloutsos
Virus propagation • Q3: How do viruses/rumors propagate? • Q4: Who is the best person/computer to immunize against a virus? C. Faloutsos
Graph clustering & mining • Q5: which edges/nodes are ‘abnormal’? • Q6: split a graph in k ‘natural’ communities - but how to determine k? C. Faloutsos
Outline • Laws • static • time-evolution laws • why so many power laws? • tools/results • future steps C. Faloutsos
Topology Q1: How does the Internet look like? Any rules? (Looks random – right?) C. Faloutsos
Laws – degree distributions • Q: avg degree is ~2 - what is the most probable degree? count ?? degree 2 C. Faloutsos
WRONG ! count ?? count degree 2 2 Laws – degree distributions • Q: avg degree is ~2 - what is the most probable degree? degree C. Faloutsos
I.Power-law: outdegree O The plot is linear in log-log scale [FFF’99] freq = degree (-2.15) Frequency Exponent = slope O = -2.15 -2.15 Nov’97 Outdegree C. Faloutsos
-0.82 att.com log(degree) ibm.com log(rank) II.Power-law: rank R • The plot is a line in log-log scale April’98 R Rank: nodes in decreasing degree order C. Faloutsos
III.Power-law: eigen E Eigenvalue • Eigenvalues in decreasing order (first 20) • [Mihail+, 02]: R = 2 * E Exponent = slope E = -0.48 Dec’98 Rank of decreasing eigenvalue C. Faloutsos
IV. The Node Neighborhood • N(h) = # of pairs of nodes within h hops C. Faloutsos
IV. The Node Neighborhood • Q: average degree = 3 - how many neighbors should I expect within 1,2,… h hops? • Potential answer: 1 hop -> 3 neighbors 2 hops -> 3 * 3 … h hops -> 3h C. Faloutsos
IV. The Node Neighborhood • Q: average degree = 3 - how many neighbors should I expect within 1,2,… h hops? • Potential answer: 1 hop -> 3 neighbors 2 hops -> 3 * 3 … h hops -> 3h WRONG! WE HAVE DUPLICATES! C. Faloutsos
IV. The Node Neighborhood • Q: average degree = 3 - how many neighbors should I expect within 1,2,… h hops? • Potential answer: 1 hop -> 3 neighbors 2 hops -> 3 * 3 … h hops -> 3h WRONG x 2! ‘avg’ degree: meaningless! C. Faloutsos
IV. Power-law: hopplot H H = 2.83 # of Pairs H = 4.86 # of Pairs Pairs of nodes as a function of hops N(h)= hH Hops Router level ’95 Dec 98 Hops C. Faloutsos
... Observation • Q: Intuition behind ‘hop exponent’? • A: ‘intrinsic=fractal dimensionality’ of the network N(h) ~ h1 N(h) ~ h2 C. Faloutsos
Hop plots • More on fractal/intrinsic dimensionalities: very soon C. Faloutsos
Any other ‘laws’? Yes! C. Faloutsos
Any other ‘laws’? Yes! • Small diameter • six degrees of separation / ‘Kevin Bacon’ • small worlds [Watts and Strogatz] C. Faloutsos
Any other ‘laws’? • Bow-tie, for the web [Kumar+ ‘99] • IN, SCC, OUT, ‘tendrils’ • disconnected components C. Faloutsos
Any other ‘laws’? • power-laws in communities (bi-partite cores) [Kumar+, ‘99] Log(count) n:1 n:3 n:2 2:3 core (m:n core) Log(m) C. Faloutsos
More Power laws • Also hold for other web graphs [Barabasi+, ‘99], [Kumar+, ‘99] • citation graphs (see later) • and many more C. Faloutsos
Outline • Laws • static • time-evolution laws • why so many power laws? • tools/results • future steps C. Faloutsos
How do graphs evolve? C. Faloutsos
Time Evolution: rank R Domain level #days since Nov. ‘97 The rank exponent has not changed! [Siganos+, ‘03] C. Faloutsos
How do graphs evolve? • degree-exponent seems constant - anything else? C. Faloutsos
Evolution of diameter? • Prior analysis, on power-law-like graphs, hints that diameter ~ O(log(N)) or diameter ~ O( log(log(N))) • i.e.., slowly increasing with network size • Q: What is happening, in reality? C. Faloutsos
Evolution of diameter? • Prior analysis, on power-law-like graphs, hints that diameter ~ O(log(N)) or diameter ~ O( log(log(N))) • i.e.., slowly increasing with network size • Q: What is happening, in reality? • A: It shrinks(!!), towards a constant value C. Faloutsos
Shrinking diameter [Leskovec+05a] • Citations among physics papers • 11yrs; @ 2003: • 29,555 papers • 352,807 citations • For each month M, create a graph of all citations up to month M C. Faloutsos
Shrinking diameter • Authors & publications • 1992 • 318 nodes • 272 edges • 2002 • 60,000 nodes • 20,000 authors • 38,000 papers • 133,000 edges C. Faloutsos
Shrinking diameter • Patents & citations • 1975 • 334,000 nodes • 676,000 edges • 1999 • 2.9 million nodes • 16.5 million edges • Each year is a datapoint C. Faloutsos
Shrinking diameter • Autonomous systems • 1997 • 3,000 nodes • 10,000 edges • 2000 • 6,000 nodes • 26,000 edges • One graph per day C. Faloutsos
Temporal evolution of graphs • N(t) nodes; E(t) edges at time t • suppose that N(t+1) = 2 * N(t) • Q: what is your guess for E(t+1) =? 2 * E(t) C. Faloutsos
Temporal evolution of graphs • N(t) nodes; E(t) edges at time t • suppose that N(t+1) = 2 * N(t) • Q: what is your guess for E(t+1) =? 2 * E(t) • A: over-doubled! x C. Faloutsos
Temporal evolution of graphs • A: over-doubled - but obeying: E(t) ~ N(t)a for all t where 1<a<2 C. Faloutsos
Temporal evolution of graphs • A: over-doubled - but obeying: E(t) ~ N(t)a for all t • Identically: log(E(t)) / log(N(t)) = constant for all t C. Faloutsos
E(t) N(t) Densification Power Law ArXiv: Physics papers and their citations 1.69 C. Faloutsos
E(t) N(t) Densification Power Law ArXiv: Physics papers and their citations 1 1.69 ‘tree’ C. Faloutsos
E(t) N(t) Densification Power Law ArXiv: Physics papers and their citations ‘clique’ 2 1.69 C. Faloutsos
E(t) N(t) Densification Power Law U.S. Patents, citing each other 1.66 C. Faloutsos
E(t) N(t) Densification Power Law Autonomous Systems 1.18 C. Faloutsos
E(t) N(t) Densification Power Law ArXiv: authors & papers 1.15 C. Faloutsos
Summary of ‘laws’ • Power laws for degree distributions • …………... for eigenvalues, bi-partite cores • Small & shrinking diameter (‘6 degrees’) • ‘Bow-tie’ for web; ‘jelly-fish’ for internet • ``Densification Power Law’’, over time C. Faloutsos
Outline • Laws • static • time-evolution laws • why so many power laws? • other power laws • fractals & self-similarity • case study: Kronecker graphs • tools/results • future steps C. Faloutsos
Power laws • Many more power laws, in diverse settings: C. Faloutsos
A famous power law: Zipf’s law log(freq) “a” • Bible - rank vs frequency (log-log) “the” log(rank) C. Faloutsos