260 likes | 404 Views
SEARCHING FOR TRUTH. chapter 5. Locating Information on the WWW. Characteristics of Directories. Topical lists of Internet Resources arranged in a hierarchical way
E N D
SEARCHING FOR TRUTH chapter5 Locating Information on the WWW
Characteristics of Directories • Topical lists of Internet Resources • arranged in a hierarchical way • differ from search engines: humans maintain directories and search engines rely on spiders or robots to crawl the Internet for links.
Characteristics of Directory • Ask the following questions: • Who selects the included Web resources? • Who categorizes the Web pages and sites? • How are the results displayed? • Are the sources rated? annotated? reviewed?
Strengths of Directories • fewer resources than search engine databases • provide rating or annotation of resources • increase probability of finding relevant results
Disadvantages • arbitrary hierarchical arrangements • infrequent updates • subjectivity of rating and annotating resources
Using the Yahoo Directory • Go to Yahoo • click on Science • follow appropriate links to Human Genome Projects.
Using Directory Search • Go to Yahoo home page • click on search and enter “Human genome project”
The First Step: Evaluating Your Information Needs • Types of information most likely to be found on the Internet • Some reasons why the Web won’t have everything you’re looking for • Choosing the best tool to start with • A checklist to help you choose the right tool
Finding Information Gems in Virtual Libraries • Virtual libraries are directories that contain collections of evaluated resources that have carefully selected by information specialists. • Subject guides, reference works, and specialized databases, are most likely found in virtual libraries. • Two popular virtual libraries are the Internet Public Library, http://ipl.org, and the Librarians’ Index to the Internet, http://lii.org.
Searching the World Wide Web: Using Search Engines • Search engines use computer programs called spiders or robots to gather information on the Internet. • The information is kept in a database that is searchable. • Two popular search engines are Google, http://www.google.com, and AltaVista, http://altavista.com.
Search Engine Similarities • Search engines list results by some type of relevancy ranking scheme. • Two common search features are Boolean searching and phrase searching.
Search Engine Differences • Size of the database • Search options • Update frequency • Ranking of search results • Special features (images, news, and so forth)
Using Several Search Engines Simultaneously: Meta-search Tools • Some allow you to search several search engines simultaneously. • Some supply lists of databases that can be searched directly from their pages. • May provide a good way to keep up with new search engines.
Task • Explore the Directory of NPR – describe its organization • Now look at a web library. Describe its organization • Find the Kelly book value of my car: a 1998 Honda Civic CX.
Content Issues: Pornography, Free Speech, Censorship, and Filtering • Free speech vs. censorship • Communications Decency Act of 1996 • Filtering and blocking devices
Top level, second level and third level classifications of a collection
The NPR home page Programming pull-down menu What are the top level classifications?
The NPR programming hierarchy tree How many levels in the hierarchy?
Restricting the Thai restaurants “hits” by eliminating any page containing the word review.
worda wordb Making Effective Queries • Queries use the three logical operators worda AND wordb -- both words must appear worda OR wordb -- either word may appear NOT worda -- the word is prohibited from appearing • Google has separate windows for each • When 1 window is available, write a formula (Lincoln OR Jefferson) AND NOT Memorial Which parts of the diagram do the operators cover?
The Burmese mountain dog page Does this page seem legitimate?