1 / 90

An introduction to large sparse graphs

An introduction to large sparse graphs. Fan Chung Graham UC San Diego. Yahoo IM graph Reid Andersen 2005. edge. vertex. A graph G = ( V, E ). Vertices cities. Edges flights. Graph models. _____________________________. Vertices cities people.

carol
Télécharger la présentation

An introduction to large sparse graphs

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. An introduction to large sparse graphs Fan Chung Graham UC San Diego

  2. Yahoo IM graph Reid Andersen2005

  3. edge vertex A graph G = (V,E)

  4. Vertices cities Edges flights Graph models _____________________________

  5. Vertices cities people Edges flights pairs of friends Graph models _____________________________

  6. Vertices cities people telephones Edges flights pairs of friends phone calls Graph models _____________________________

  7. Vertices cities people telephones web pages genes Edges flights pairs of friends phone calls linkings regulatory effects Graph models _____________________________

  8. Graph Theory has 250 years of history. Leonhard Euler, 1707-1783

  9. Geometric graphs

  10. Geometric graphs

  11. Geometric graphs

  12. Geometric graphs

  13. Geometric graphs Algebraic graphs

  14. Geometric graphs Algebraic graphs General graphs By Bill Cheswick

  15. Geometric graphs Algebraic graphs Real graphs (router graph)

  16. Geometric graphs Algebraic graphs Real graphs (protein interactions by Jawoong Jeong)

  17. Geometric graphs Algebraic graphs Real graphs (Web Cashe map by Bradley Huffaker)

  18. Geometric graphs Algebraic graphs Real graphs Collaboration graph

  19. SDSC, skitter (July 1998)

  20. Massive data Massive graphs

  21. Massive data Massive graphs The information we deal with is taking on a networked character.

  22. Massive data Massive graphs • WWW-graphs • a. Bill Cheswick

  23. Massive data Massive graphs • WWW-graphs • Call graphs • Acquaintance graphs a. Part of the collaboration graph

  24. Massive data Massive graphs • WWW-graphs • Call graphs • Acquaintance graphs • Graphs from any data a.base

  25. What does a massive graph look like?

  26. What does a massive graph look like? sparse clustered small diameter

  27. What does a massive graph look like? sparse clustered small diameter prohibitively large dynamically changing incomplete information

  28. What does a massive graph look like? sparse clustered small diameter prohibitively large dynamically changing incomplete information Hard to describe ! Harder to analyze !!

  29. Some prevailing characteristic of large realistic networks • Small world phenomenon Small diameter/average distance Clustering • Power law degree distribution

  30. A crucial observation Discovered by several groups independently. • Barabási, Albert and Jeung, 1999. • Broder, Kleinberg, Kumar, Raghavan, Rajagopalan aaand Tomkins, 1999. • M Faloutsos, P. Faloutsos and C. Faloutsos, 1999. • Abello, Buchsbaum, Reeds and Westbrook, 1999. • Aiello, Chung and Lu, 1999. Massive graphs satisfy the power law.

  31. Massive graphs satisfy the power law. Power decay degree distribution. The degree sequences satisfy thepower law: The number of vertices of degree j is proportional to j- where  is some constant 2.

  32. A random graph model for a power law graphs Two parameters:  and  P(,) : a random power law graph.  : log size  : log log growth rate

  33. A random graph model for a power law graphs Two parameters:  and  P(,) : a random power law graph. P(,) assigns uniform probability to all graphs satisfying: log y  -  log x where y = the number of vertices with degree x.

  34. Comparisons From real data From simulation

  35. The history of power law • Zipf’s law, 1949. (The nth most frequent word occurs at rate 1/n) • Yule’s law, 1942. • Lotka’s law, 1926. (distribution of authors in chemical abstracts) • Pareto, 1897 (Wealth distribution follows a power law.) (City population follows a power law.) 1907-1916 Natural language Bibliometrics Social sciences Nature

  36. Examples of power law • Inter • Internet graphs. • Call graphs. • Collaboration graphs. • Acquaintance graphs. • Language usage

  37. Faloutsos et al ‘99

  38. A subgraph of a BGP graph

  39. Another subgraph of a BGP graph

  40. SDSC, skitter (July 1998)

  41. Degree distribution of Call Graphs

  42. The collaboration graph is a power law graph, based on data from Math Review with 337451 authors with power 2.55

  43. Collaboration graph (Math Review) • 337,000 authors • 496,000 edges • Average 5.65 collaborations per person • Average 2.94 collaborators per person • Maximum degree 1401 • A giant component of size 208,000 • 84,000 isolated vertices

  44. Collaboration graph (Math Review) • 337,000 authors • 496,000 edges • Average 5.65 collaborations per person • Average 2.94 collaborators per person • Maximum degree 1401. • A giant component of size 208,000 • 84,000 isolated vertices Guess who?

  45. An induced subgraph of the collaboration graph with authors of Erdös number ≤ 2.

  46. A subgraph of the Hollywood graph.

  47. Ocurrences of words in TIME magazine articles 245412 terms.

  48. Occurrences of words in WSJ Collection, a 131.6 MB collection of 46449 newspaper articles aaaa (19 million terms). Top 50 terms are included here.

More Related