260 likes | 281 Views
Other Internet Sources. When you search with a search engine, you are not searching the entire Internet. Estimated over 80% of information available is invisible to search engines. The Hidden Web. Or, the Invisible Web or Deep Web or Deep Matter. It’s not a secret! W hat’s not retrieved.
E N D
When you search with a search engine, you are notsearching the entire Internet
Estimated over 80% of information available is invisible to search engines
The Hidden Web Or, the Invisible Web or Deep Web or Deep Matter
It’s not a secret! What’s not retrieved Thousands of specialized databases, dynamic pages, files… millions of records No matter what you call it…
“Hidden” databases produced by… Universities Libraries Associations Businesses Government agencies The Hidden Internet, cont.
Great article on the Invisible Web • Chris Sherman and Gary Price • “The Invisible Web: Uncovering Sources Search Engines Can’t See” • From Library Trends • Let’s find it!
Invisible Web Search Tools • http://www.lii.org • infomine.ucr.edu • http://www.completeplanet.com • http://www.intute.ac.uk • http://gdnet.org/index.php (Caveat: searching the last site is challenge!)
Specific topic areas Special interest Quirky Unlovely (design issues) Personal WWW Directories
Examples of Personal WWW Directories • Matt Drudge’s Dad (it’s okay, he’s a librarian): www.refdesk.com • Gary Price, Hidden Web Guru: http://www.freepint.com/gary/direct.htm
Other Sources of Information on the Internet • Metasearch Engines • Newsgroups and Listservs • Blogs • News search engines
1. Metasearch Engines • Sends your keywords to several search engines at the same time • Return a single list of results from multiple sources • “Source” engine identified (most of the time)
Standard Search Engines • Search for keywords, number of times within document • Keywords from a single (updated) database of websites • Each search engine searches unique selection of web pages • Results ranked and sorted • Robots or spiders find new websites
Metasearch Engines • Transmits searches simultaneously to several search engines • Results gathered from engines queried • Search terms sent to indexes maintained by traditional search engines • Pros and Cons?
Metasearch Engines Examples • Directory: http://www.santarosa.edu/library/Refs/engines.shtml • http://www.metacrawler.com/ • http://www.dogpile.com/ • http://ixquick.com/ • http://www.webcrawler.com/
Newsgroups Worldwide bulletin board systems Tens of thousands of forums (newsgroups) Groups focus on wide range of topics, accessible to anyone & everyone “Bulletin board” style: original and follow-up postings Not subscription based 2. Newsgroups and Mailing Lists
Mailing Lists Automated mailing list distribution systems “Subscribe" to discussion lists Distributed to the entire subscriber base via e-mail Discussion groups Wide-ranging Subscription-based Newsgroups and Mailing Lists
Pros Good for locating professional discussions Interactive, questions welcome Collaborative Alternative voices Cyber-networking Cons Unmoderated or lightly moderated forums “On the Internet, no one knows you’re a dog.” Can be difficult to search Newsgroups and Mailing Lists
Newsgroups and Mailing Lists • Accessible mailing lists by Catalist:http://www.lsoft.com/lists/listref.html • Google groups (the Usenet): http://groups.google.com • Usenet history: http://www.ibiblio.org/usenet-i/ • Yahoo groups: http://groups.yahoo.com • Topica lists: http://lists.topica.com/ • About.com forums: http://www.about.com
Other Ways to Find Collaborative Sources • Search Google or other search engines for forums. For example: PDF forums • Search Google or other search engines for discussion groups. For example: Gardening discussion groups
Newsgroup and Mailing List Search Examples • Google Groups (Usenet) • Catalist Listserv search • http://www.lsoft.com/lists/listref.html
Online journals, updated often News-oriented Contain commentary and links Personal or professional focus Pros: Alternative sources Constantly updated Wide range of topics Cons: Difficult to search Too much information Reliability of authors “Feeding Frenzy” 3. Weblogs or Blogs
Weblog Sources • Google Directory: http://directory.google.com/Top/Computers/Internet/On_the_Web/Weblogs/ • DMOZ Weblog listing: http://dmoz.org/Computers/Internet/ • IPL Blog Page: http://www.ipl.org/div/blogs • Search Google or other search engines for weblogs or blogs. For example: Gardening weblogs
Homework for Next Week • Read through UC Berkeley’s Internet Evaluation Site • Listen to Talk of the Nation Wikipedia show • (Click on the “Listen” icon at the top of the page; story is about 30 minutes long) • …and/or read the transcript of the show • Complete Internet Quiz
Homework for Next Week • Internet Source Assignment • May e-mail before 3/12 • Final Project Due!