1 / 22

Innovative Methods of Internet Search Index Optimization and Google Empire

Innovative Methods of Internet Search Index Optimization and Google Empire. Presenter Dr. Arun K. Timalsina Dept. of Electronics & Computer Engineering Pulchowk Campus Institute of Engineering t.arun @ ieee.org. Internet Statistics. Google : # 1 Browsed Web Page.

jubal
Télécharger la présentation

Innovative Methods of Internet Search Index Optimization and Google Empire

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Innovative Methods of Internet Search Index Optimization and Google Empire Presenter Dr. Arun K. Timalsina Dept. of Electronics & Computer Engineering Pulchowk Campus Institute of Engineering t.arun@ ieee.org

  2. Internet Statistics

  3. Google : # 1 Browsed Web Page

  4. Search Engine Market • Google in 1998 just 25 million pages • Today billions of web pages, only index of 100 million GB • Google public DNS gets 70 billion requests a day

  5. Business Model : Search Quality

  6. Web Mining : Specialized Data Mining

  7. Google Search : Why Special ? Other Search Engine

  8. Google Search : Why Special ?

  9. Social Network Theory • Famous Small-world experiment : Stanley Milgram, 1967 • Experimental Set-up : Few letters to be sent from random Wichita (Kansas) and Omaha (Nebraska) residents to Boston (Massachusetts) residents but without having complete address • Each person has to seek the help from their friends in locating those letter receivers • Result : On an average of 5.5 persons hop was there in delivering these letters  Six Degrees of Separation • Implicit result of this research : • ~30 years after Google, • ~40 years after Facebook, …

  10. Web Structure Mining • Content Mining based algorithms : LSI, and SVD • Social Network Theory influenced models : hyperlinks of the Web to rank pages according to their levels of prestige or authority. • HITS : Jon Kleinberg (Cornell University) at Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, 1998 • PageRank : Sergey Brinand Larry Page ( PhD Students from Stanford University) at Seventh International World Wide Web Conference (WWW7), 1998 • PageRankpowers the Google Search Engine HITS : Hyperlink Induced Topic Search : (Hub, authority)

  11. PageRank : Rank Prestige in Social Network

  12. PageRank

  13. Simple Example Calculation

  14. Matrix Notation

  15. The Only Panacea of Business Complexity • 80-20 rule is no more relevant • Rather Selling Less of More : Long Tail Distribution • Internet search complexity problem (hundreds of billions of web pages ) • Easier application on Personalized service and product based business model

  16. Conclusion • Google Empire is based on this PageRank (may be more sophistication after being a company…..) • This is one among various examples of Research Projects transforming into a big company • American Internet use in 1998 was almost similar to current Nepal internet penetration rate Nepal Status • There are many Nepali versions of L. P. and S. B. ‘s among us • ( LaxmiPaneru, LekhnathPathak, ……. , • SharadBhujel, SankalpaBelbaase,… ! ) Thanks !

  17. Extra Slides

  18. Solving Equations

  19. Power Iteration Solving Example

  20. Example

  21. Power Iteration based Solving

  22. Power Iteration Method : Optimal Solving

More Related