Introduction to Web Computing: Understanding the World Wide Web
70 likes | 166 Vues
Explore the concepts of the World Wide Web, web technologies, protocols, and mining techniques. Learn about web size, search efficiency, link analysis, and spam detection.
Introduction to Web Computing: Understanding the World Wide Web
E N D
Presentation Transcript
Introduction to Web Computing By J. H. Wang Sep. 26, 2011
What’s the Web • World Wide Web: WWW (or the Web) • Proposed by Tim-Berners Lee in 1989 • Based on ideas from hypertext systems • "... to link and access information of various kinds as a web of nodes in which the user can browse at will"
World Wide Web • Client: browser (e.g. IE, firefox) • To view web pages that may contain text, images, videos, and other multimedia and navigate between them via hyperlinks • Server (e.g. Apache httpd) • To host web pages and multimedia files
Three Major Technologies • URL (Uniform Resource Locator) or URI (Uniform Resource Identifier): a system of globally unique identifiers for resources on the Web and elsewhere • HTML (HyperText Markup Language): the publishing language • HTTP (Hypertext Transfer Protocol): the protocol
What’s Related to Web Computing • Web protocols and formats • Versions, new features, security issues • Web search and other Web services • Information retrieval • Web mining • Web content mining • Web structure mining • Web usage mining
Some Interesting Problems • What’s the size of the Web? • At least 11.49 billion pages, according to http://www.worldwidewebsize.com/ • How to search the Web efficiently? • Crawling (Ch.20) • How to identify the most linked pages? • Link analysis (Ch.21) • How to identify the spam pages? • (Ch.19)