1 / 22

Authoritative Sources in a Hyperlinked Environment Jon M. Kleinberg

Authoritative Sources in a Hyperlinked Environment Jon M. Kleinberg. Presented By: Talin Kevorkian Summer 2010. Overview. Why Do We Care? Introduction Information Objective

Télécharger la présentation

Authoritative Sources in a Hyperlinked Environment Jon M. Kleinberg

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Authoritative Sources in a Hyperlinked EnvironmentJon M. Kleinberg Presented By: Talin Kevorkian Summer 2010

  2. Overview • Why Do We Care? • Introduction Information • Objective • Approaches and Observed Results • Related Work • Generalization • Conclusion • Evaluation of Pros and Cons Authoritative Sources in a Hyperlinked Environment

  3. Why Do We care? • Complexity of WWW as a Hypertext Corpus • Nature of the Hyperlinked Environment Structure • Efficiency (Longer Response Time) and Storage Problems Because of Huge Amount of Results Return to the User Authoritative Sources in a Hyperlinked Environment

  4. Introduction Information • Query Types • Specific • E.g. ”Does Windows 7 Support Oracle 10g?” • Scarcity Problem • Broad-Topic • E.g. “Sql Programming Language ” • Abundance Problem • Authority Notion • Similar-Page • E.g. “Similar Pages to Oracle.com” Authoritative Sources in a Hyperlinked Environment

  5. Introduction Information • Link-Based Model • Encoding latent human judgment • Conferred Authority • Creating Balance Between Popularity and Relevance • Relation Between Authority and Hubs Authoritative Sources in a Hyperlinked Environment

  6. Objective • Presenting the Link-Based Model for the Conferral Authority • Exploring Authoritative WWW Sources in the Global Range Authoritative Sources in a Hyperlinked Environment

  7. Approaches and Observed Results • Focused Subgraph Algorithm for WWW • Authorities and Hubs Computation • Approach for Similar-Page Queries • Sample Observed Results Authoritative Sources in a Hyperlinked Environment

  8. Focused Subgraph Algorithm for WWW • Inputs: • Query String σ • Text-based Search Engine • Outputs: • Set of Hyperlinked Pages as a Directed Graph G(V,E) • Root Set Rσ • Sub Set Sσ • Almost Small in size • Containing Most of Relevant Pages • Covering Most of the Strongest Authorities • Links Type in G[Sσ] • Transverse • Intrinsic Authoritative Sources in a Hyperlinked Environment

  9. Authorities and Hubs Computation • Solution to the approach of Ordering Pages by Their In-degree • Confusion Between Strong “Authorities” and “Universally Popular“ Pages • Containing Mutually Reinforcing Relationship Concept Authoritative Sources in a Hyperlinked Environment

  10. Authorities and Hubs Computation • Iterate Algorithm • Input: • Set of n linked pages Gσ • Outputs: • Updated Authority Weight (thru operation I) • Updated Hub Weight (thru Operation O) • Filter Algorithm • Input: • Set of n linked pages Gσ • Outputs: • Reporting Pages with Top c Authorities • Reporting Pages with Top c Hubs Authoritative Sources in a Hyperlinked Environment

  11. Approach for Similar-Page Queries • First Step: What Do Users of the WWW Decide to be Related to a Page When They Create any Pages and Hyperlinks • Second Step: Applying Link Structure to the Concept of “Similarity” • Third Step: Using concept of Authorities and Hubs Authoritative Sources in a Hyperlinked Environment

  12. Sample Observed Results(For Broad-Specific Queries) Authoritative Sources in a Hyperlinked Environment

  13. Sample Observed Results (For Similar-Pages Queries) Authoritative Sources in a Hyperlinked Environment

  14. Related Work Link Structure is Related to: • Definition of Standing, Impact and Influence Concepts • WWW Ranking Techniques • Data Clustering Authoritative Sources in a Hyperlinked Environment

  15. Standing, Impact and Influence Concepts • Social Network • Proposed Standing Measure • Katz Theory: Based on Path-Counting • Hubbell Theory : Based on Nodes Weight-Propagation • Scientific Citations • Proposed Impact/Influence Measure • Garfield’s Impact Theory • Pinski-Narin Influence Theory Authoritative Sources in a Hyperlinked Environment

  16. WWW Ranking Techniques • Ranking Measure Proposal: • Botafogo-Rivlin-Shniderman Theory • Carriere-Kanzman Theory • Brin-Page Theory and Contrast with This Paper Approach Authoritative Sources in a Hyperlinked Environment

  17. Data Clustering • Clustering needs : • Similarity Functions • Bibliographic Coupling • Co-Citation • Cluster Producer Functions • Small-Griffith Approach • Dimension-Reduction • Spectral Graph partitioning • Centroid Scaling Authoritative Sources in a Hyperlinked Environment

  18. Generalization • Specific Queries • Diffusion Concept • Set of Hubs and Authorities can be Separated from each other Because: • Query String has different Meaning like “Jaguar” • Query String is a Highly Polarized Subject Like “Abortion” • Query String can be Applied in Multiple Communities like “Randomized Algorithms” Authoritative Sources in a Hyperlinked Environment

  19. GeneraliztionSample Results Authoritative Sources in a Hyperlinked Environment

  20. Conclusion • Basic Elements of Paper Approach • Applying Notation of Authoritative Sources • Selecting High Quality of Results • Dealing with Scale Problem • Exploring Structure of Hubs and Authorities Authoritative Sources in a Hyperlinked Environment

  21. Evaluation of Pros and Cons • Pros: • Clearly Describe the Algorithms and Applied Approaches • Provide Tangible Examples and Results • Enough Connection to Related Works • Cons: • Ignoring the Textual Contents of pages • Complexity in the Nature of Quality Judgment • Concentrating mostly on Broad-Topic Queries Authoritative Sources in a Hyperlinked Environment

  22. Q & A Authoritative Sources in a Hyperlinked Environment

More Related