180 likes | 399 Views
Internet Exploration: Search Engines. Computer Information Technology – Section 3-2. The Internet. Objectives: The Student will: Understand Search Engines and how they work Understand the pros and cons or various popular search engines
E N D
Internet Exploration:Search Engines Computer Information Technology – Section 3-2
The Internet • Objectives: • The Student will: • Understand Search Engines and how they work • Understand the pros and cons or various popular search engines • Understand the definitions of terms associated Search Engines. • Perform a basic search and compare results from different search engines
How does a search work? • Google give a quick tour of how a search works: http://www.google.com/intl/en/insidesearch/howsearchworks/thestory/index.html
Search Engines • Search Engine: A program that searches documents for specified keywords and returns a list of the documents where the keywords were found. Without search engines you would never be able to find anything on the web • Typically, a search engine works by sending out a spider to fetch as many documents as possible. • Spider: A program that automatically fetches Web pages. Spiders are used to feed pages to search engines. It's called a spider because it crawls over the Web. Another term for these programs is webcrawler. Because most Web pages contain links to other pages, a spider can start almost anywhere. As soon as it sees a link to another page, it goes off and fetches it.
Search Engines • Spiders or Crawlers visit a Web site, read the information on the actual site, read the site's meta tags and also follow the links that the site connects to performing indexing on all linked Web sites as well. • meta tags: A special HTML tag that provides information about a Web page. You can’t see meta tags on the web page. They provide information such as who created the page, how often it is updated, what the page is about, and which keywords represent the page's content. • The crawler returns all that information back to a central depository, where the data is indexed. • This is the data the search engine searches! This is why search engines return links that are no longer valid.
Search Engines • Crawlers rely entirely on links from other web pages, so if a web page is never linked to in any other page, search engine spiders cannot find it. • Crawlers will return to web pages periodically to update the database
Search Engines – Why they give different results • Not all indices are going to be exactly the same. • It depends on what the spiders find (or what the humans submitted). • Not every search engine uses the same algorithm to search through the indices. • The algorithm is what the search engines use to determine the relevance of the information in the index to what the user is searching for. • Algorithm: A formula or set of steps for solving a particular problem.
Search Engines – Why they give different results • Google has one of the largest databases but studies indicate that less than ½ of the searchable web is searchable in Google. • Studies also show that more than 80% of the pages in a major search engine's database exist only in that database. • When doing research try different search engines!
Search Engines – Search Results • Searching for “Hancock High School”: • Google: About 32,000,000 results • Yahoo: 4,250,000results • Ask.com: Doesn’t tell you. • Bing.com: 4,120,000results
Search Engines – Wrap-Up • Terms you should know: • Search Engine: • A program that searches documents for specified keywords • Spider or Crawler: • A program that automatically fetches Web pages. • Meta tags: • A special HTML tag that provides information about a Web page. • Algorithm: • A formula or set of steps for solving a particular problem.
Search Engines – Assignment • Before you leave today… • Pick a topic of interest to you. IT MUST BE APPROPRIATE FOR SCHOOL! • Pick 3 search engines (Google, Yahoo, Altavista.com , Ask.com, www.alltheweb.com, bing.com, www.askjeeves.com, lycos.com) • Do a search on your topic • On the paper put: • Your Name and the period. • Your topic • Report how many web sites each search engine finds • Note if any of the top 10 sites are the same between the different search engines (circle the sites that are on all 3 lists).