190 likes | 298 Vues
The history of internet searching began in 1969 with ARPANET, funded by the DoD for scientific research. Initially connecting universities, it paved the way for innovations like email by Ray Tomlinson in 1971. Early methods included Telnet, FTP, and Gopher, leading to the first search tools such as ARCHIE and the WWW Wanderer. These developments laid the groundwork for search engine technology, introducing concepts like bots, databases, user interfaces, and relevance ranking, crucial for how we navigate the internet today.
E N D
The Internet • Built in 1969. • Funded by the DoD for scientific research, built by BBN Technologies. • Originally called ARPANET • Advanced Research Projects Agency • First nodes (connections) were at universities (UCLA, UCSB, Stanford, Univ. of Utah)
Ray Tomlinson Invented Email in 1971. Wasn’t supposed to be working on it, he thought it would be a “neat idea”.
The Internet By 1971 there were 23 sites on the ‘net. Computers that made up the ARPANET were called IMP’s (Internet Message Processor)
Protocols • Methods of using the Internet: • Telnet – Access and Control Computers • FTP – File Transfer Protocol • HTTP – HyperText Transfer Protocol • Gopher – File Access & Downloading • Email
History of Internet Searching • Problems with FTP • No organization of FTP Servers • User had to know an FTP Server existed • User had to visit FTP Server to see files • FTP – File Transfer Protocol • Protocol established in 1985. • FTP Servers provide files to FTP Clients
History of Internet Searching • ARCHIE • 1990 (No WWW) • Alan Emtage @ McGill Univ. in Montreal • Searchable directory of FTP files • Searched FTP Servers and indexed their files • User searched the Index • Required Telnet and FTP
History of Internet Searching • Gopher • 1991 (WWW Began) • Paul Lindner & Mark P. McCahill of Univ. of Minnesota • Named after the Univ. of Minn. Mascot • Connected Gopher servers through the Gopher hierarchy (gopherspace)
History of Internet Searching • Wanderer(Matthew Gray’s World Wide Web Wanderer) • First WWW Engine • Designed to track the size of the WWW • Captured URL’s and entered into database (Wandex) • First Robots “bots”
Search Engine Technology • Three parts to a Search Engine • Bots (Robots) • Database • User Interface
Search Engine Technology • Bots (Robots) • Also called Spiders • Computer programs sent out by Query Servers • Search the Internet for servers • Identify servers & collect information • Uses links from websites to find other sites
Search Engine Technology • Database • Collects the information from Query Server and organizes it.
Search Engine Technology • User Interface • Allows users to search the database and returns the information from it.
Search Engine Technology • Relevance Ranking • Search engine measures the relevance of the information found to your request • First search engine to use Relevance Ranking was the Repository-Based Software Engine (RBSE) in 1993
Search Engine Technology • Relevance Ranking (Techniques) • How often do the search terms appear • How close are the search terms to each other • Where do the search terms appear • How often do the search terms appear compared to the length of the web page