1 / 25

Milena Mihail mihail@cc.gatech.edu

Discussion Topic:. Models for Social Networks. Web Science Tea Feb 29, 08. Milena Mihail mihail@cc.gatech.edu. NSF : CDI. Elsewhere :. Yahoo: Raghavan WWW06 Brachman GT talk. Microsoft :

fredrica
Télécharger la présentation

Milena Mihail mihail@cc.gatech.edu

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Discussion Topic: Models for Social Networks Web Science Tea Feb 29, 08 Milena Mihail mihail@cc.gatech.edu

  2. NSF : CDI Elsewhere : Yahoo: Raghavan WWW06 Brachman GT talk Microsoft: New Cambridge Lab Jennifer Chayes Our non grassroots discussions : Super-Duper Data Center, ala Jeanette Wing Should revisit this point, in view of NSF-Google-IBM ? Chris Klaus GT talk PREAMBLE What is Web Science ? Our grassroots discussions : Includes some intersection of comp sci, economics, social sci.

  3. Parenthesis: MSN SemGrail 07 What is Web Science ? The study of the WWW, broadly defined. By virtue of the pervasiveness of the object of study. Systems-like science (like chemistry or biology). As opposed to “computer science” which is the study of “computation”, biology is the study of “life” from the cell to evolution to animals…. Should be studied in terms of its descriptive/predictive/explanatory/prescriptive analytic value.

  4. Parenthesis: MSN SemGrail 07 Why should there be Web Science ? Encourage collaboration across different areas. Something between the union and intersection of several areas. Need to establish common vocabulary, goals, problems. “Understanding the elephant versus the tail trunk”. Educate students for industry. Encourage academia to understand the study of the Web as a discipline.

  5. Parenthesis: MSN SemGrail 07 Themes cutting across subareas of Web science Long Tails / Economics / Culture Fractal Nature, multi-scale Dynamics, emergent systems, social networks Requires new analytics (eg what are right logics, probabilistic and approximation metrics) Humans and machines interact and interactions registered. New dimension in social sciences. Transformed way we think about information (analogy to introduction of printing press). Democracy of information, producers and consumers of information coincide.

  6. Models for Social Networks (in this spirit) PREAMBLE What is Web Science ? Outline: Wide Range of Models Canonical Example: Modeling Small World Phenomenon Model Parameters/Metrics and their Relevance Models : Structural Explanatory (Optimization or Incentive Driven) Hybrid Which question are you (am I) trying to answer? Our grassroots discussions : Includes some intersection of comp sci, economics, social sci.

  7. (nice pictures with some meaning) Range of Models Internet (general) Routing Internet AS Level Routing Level Sparse Power Law Graphs with very different assortativity few long links in a flat world

  8. Range of Models (nice pictures with some meaning) Patent / co-author network in Boston area notice bottleneck bad cut Flickr social network from Flickr search keyword “graph” notice no botlleneck bad cut

  9. ( Range of Flickr Pictures - meaning ? ) Technology Platforms Local Facebook Friendship Graph A Wep Page Organization 4 Color Theorem

  10. Range of Models Biological Networks with unclear meaning, but make front page of Nature/Science/PNAS

  11. Range of Models (nice pictures with no meaning)

  12. Range of Mathematical Models Rick Durrett, Cornell, Probabilist n Matthew Jackson, Staford, Economist

  13. Canonical Example: Modeling the Small World Phenomenon Clustering and Small Diameter Milgram’s Experiment 60’s : Even though relationships are highly clustered, most people are pairwise reachable via short paths, “Six Degrees of Separation” (for fun, see also Facebook group) Strogatz&Watt’s Model 80’s: In a clustered graph of size n, a few random links decrease the diameter to logn. Kleinberg 90’s: Navigability ! These short paths can be found efficiently with local search!

  14. Are there natural network models which are navigable and have, eg, power-law degree distributions ? Are there natural models where the threshold is not sharp ? Kleinberg’s navigability model Theorem: The only value for which the network is navigable isr =2. 14

  15. Model Parameters/Metrics (as a function of n) and their Relevance Important to have FLEXIBLE network models eg in Prediction / Simulation economics engineering Average degree and Degree distribution Clustering coefficient (small dense subgraphs) Diameter Expansion/Conductance (bottlenecks) Eigenvalues, eigenvectors (quantify bottlenecks and find groups efficiently) Evolving toward monopolies/oligopolies? Assortativity Can it be searched, crawled efficiently? Can pagerank be computer efficiently? Can it route with low congestion? Does it support efficient info retrieval? How does information/technology spread?

  16. Given Choose random perfect matching over Structural / Macroscopic Models Random graphs with desirable graph properties, thought to be aggregating all microscopic primitives Example 1: Power Law Random Graph

  17. Example 2: Growth & Preferential Attachment One vertex at a time New vertex attaches to existing vertices

  18. Some evolutionary random graph models may also capture more factors, e.g, geography, and hence varying conductance. Example 2, generalization towards flexibility:

  19. Explanatory / Microscopic Models / Optimization Driven Example: HOT, evolutionary, new node attaches by minimizing cost and maximizing quality of service Point: Optimization primitives can yield power law distributions.

  20. Explanatory / Microscopic Models / Incentive Driven Example: A Network Formation Game How fast can such a stable configuration be reached?

  21. Hybrid Models RANDOM DOT PRODUCT GRAPH MODEL

  22. Example 1:

  23. Example 2:

  24. SUMMARY It is important to identify critical metrics and parameters ie, how they impact network performance. It is important to develop models where critical parameters vary and flexible network models. It is important to identify network primitives related to optimization and incentives. It is important to develop mechanisms that affect such primitives.

  25. HOW ABOUT YOU ? WHICH QUESTIONS DO YOU WANT TO ANSWER ?

More Related