1 / 25

Much of Science is Non-Googleable: An Emerging Solution

Much of Science is Non-Googleable: An Emerging Solution. National Science Foundation Washington, DC August 14, 2007 Science Foundation. Walter Warnick Director, Office of Scientific and Technical Information U. S. Department of Energy. True or False?.

Download Presentation

Much of Science is Non-Googleable: An Emerging Solution

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Much of Science is Non-Googleable: An Emerging Solution National Science Foundation Washington, DC August 14, 2007 Science Foundation Walter Warnick Director, Office of Scientific and Technical Information U. S. Department of Energy

  2. True or False? Most useful information is available via familiar search engines such as Google and Yahoo!

  3. Google: v., to search for information through Google Googleable: adj., information found by Googling Non-Googleable: adj., information that cannot be found by Googling

  4. Larry Page, speaking to scientists, AAAS 2007 "Virtually all economic growth (in the world) was due to technological progress. I think as a society we're not really paying attention to that.” He called on the scientists to make more of their research available digitally.“We have to unlock the wealth of scientific knowledge and get it to everyone.”

  5. Researchers Information fuels discovery Superior access to quality informationspeeds discovery Public Public Researchers OSTI Mission To advance science and sustain technological creativity by making R&D findings available to Department of Energy researchers and the American public

  6. Systems that crawl the Web do not typically reach below the surface Google “crawls” the surface Web, but scientific databases are largely found in the deep Web Surface Web Scientific databases stump Google

  7. Google moves ahead with plan to open up federal Web sites Google is making strides on an initiative to make information stored on public government Web sites more accessible to people looking for it, but challenges remain, officials with the search engine company said Wednesday. Three federal organizations recently agreed to structure their sites to make them accessible for nearly all Internet searches, the officials said. Information on the Plain Language Web site aimed at eliminating jargon in government communications, and on sites by the Energy Department's Office of Scientific and Technical Information and the Education Department's National Center for Education Statistics, has been opened up to the three most popular search engines: Google, Yahoo and MSN. Google works to solve the problem, but there’s a better way …

  8. We need systems that probe the deep Web Surface Web Federated search drills down to the deep Web where scientific databases reside. Unlike the Google solution, it places no burden on the database owners. Deep Web databases

  9. Basic Search Search term:“electric vehicles” AND batteries

  10. Results by rank

  11. Results by source

  12. Advanced Search Search term:“avian influenza” OR “bird flu”

  13. Results by source

  14. Bibliographic record

  15. Full text

  16. Federated Search: Advantages • Current, real-time results • No burden for database owner • Inexpensive to implement • No need-to-know for user • No searching door-to-door • Allows for fielded searching • Interoperability is automatically achieved

  17. Additional Points • Federated search has limitations • Neither crawling nor federated search is a panacea • Federated searching does things crawling cannot do, and vise versa. They are complementary technologies • Federated searching has advanced rapidly and should continue to advance

  18. In January 2007, Dr. Raymond Orbach, U.S. Department of Energy Under Secretary for Science, andLynne Brindley, Chief Executive of the British Library, signed a Statement of Intent to partner in the development of a searchable global science gateway

  19. WorldWideScience.org Sources • Australian Antarctic Data Centre • Article@INIST (France) • Canada Institute for Scientific and Technical Information • Defence Research and Development (Canada) • DEFF Global E-Prints (Denmark) • DEFF Research Database (Denmark) • J-EAST (Japan) • Journal@rchive (Japan) • J-STAGE (Japan) • J-STORE (Japan) • NARCIS (Netherlands) • ReaD (Japan) • Science.gov (United States) • SciELO (Brazil) • UK PubMed Central • Vascoda (Germany) • Transactions & Proceedings (New Zealand) African portals are in the works, at which time all the world’s inhabited continents will be represented

  20. France Germany The Netherlands Australia Denmark Canada United Kingdom United States Japan Current National Partners in WorldWideScience.org Brazil New Zealand

  21. In short WorldWideScience.org pulls together often isolated islands of information into one easily searchable gateway

  22. Next Steps for WorldWideScience.org • Establish governance structure • Increase number of national sources • Add non-English sources • Develop translations capabilities • Integrate limited-access sources The International Council for Scientific and Technical Information (ICSTI) has indicated preliminarily that it will serve as an umbrella for governance.

More Related