190 likes | 289 Views
History of Internet Searching. The Internet. Built in 1969. Funded by the DoD for scientific research, built by BBN Technologies. Originally called ARPANET A dvanced R esearch P rojects A gency First nodes (connections) were at universities (UCLA, UCSB, Stanford, Univ. of Utah).
E N D
The Internet • Built in 1969. • Funded by the DoD for scientific research, built by BBN Technologies. • Originally called ARPANET • Advanced Research Projects Agency • First nodes (connections) were at universities (UCLA, UCSB, Stanford, Univ. of Utah)
Ray Tomlinson Invented Email in 1971. Wasn’t supposed to be working on it, he thought it would be a “neat idea”.
The Internet By 1971 there were 23 sites on the ‘net. Computers that made up the ARPANET were called IMP’s (Internet Message Processor)
Protocols • Methods of using the Internet: • Telnet – Access and Control Computers • FTP – File Transfer Protocol • HTTP – HyperText Transfer Protocol • Gopher – File Access & Downloading • Email
History of Internet Searching • Problems with FTP • No organization of FTP Servers • User had to know an FTP Server existed • User had to visit FTP Server to see files • FTP – File Transfer Protocol • Protocol established in 1985. • FTP Servers provide files to FTP Clients
History of Internet Searching • ARCHIE • 1990 (No WWW) • Alan Emtage @ McGill Univ. in Montreal • Searchable directory of FTP files • Searched FTP Servers and indexed their files • User searched the Index • Required Telnet and FTP
History of Internet Searching • Gopher • 1991 (WWW Began) • Paul Lindner & Mark P. McCahill of Univ. of Minnesota • Named after the Univ. of Minn. Mascot • Connected Gopher servers through the Gopher hierarchy (gopherspace)
History of Internet Searching • Wanderer(Matthew Gray’s World Wide Web Wanderer) • First WWW Engine • Designed to track the size of the WWW • Captured URL’s and entered into database (Wandex) • First Robots “bots”
Search Engine Technology • Three parts to a Search Engine • Bots (Robots) • Database • User Interface
Search Engine Technology • Bots (Robots) • Also called Spiders • Computer programs sent out by Query Servers • Search the Internet for servers • Identify servers & collect information • Uses links from websites to find other sites
Search Engine Technology • Database • Collects the information from Query Server and organizes it.
Search Engine Technology • User Interface • Allows users to search the database and returns the information from it.
Search Engine Technology • Relevance Ranking • Search engine measures the relevance of the information found to your request • First search engine to use Relevance Ranking was the Repository-Based Software Engine (RBSE) in 1993
Search Engine Technology • Relevance Ranking (Techniques) • How often do the search terms appear • How close are the search terms to each other • Where do the search terms appear • How often do the search terms appear compared to the length of the web page