320 likes | 409 Views
Workshop 20. Metasearch - the NISO Initiative. March 25, 2006; Washington. Ray Denenberg. Ralph LeVan. Metasearch. …. Also known as. parallel search federated search broadcast search cross-database search. NISO: Definition, Problem, Challenge. NISO Definition
E N D
Workshop 20 Metasearch - the NISO Initiative March 25, 2006; Washington Ray Denenberg Ralph LeVan
…. Also known as • parallel search • federated search • broadcast search • cross-database search
NISO: Definition, Problem, Challenge • NISO Definition • “search and retrieval spanning multiple databases, sources, platforms, protocols, and vendors at one time.” • the Problem ..... • Current systems require users to know how to select, access and search specific databases. • and the corresponding Challenge ..... • To create an environment that helps users find what they are seeking while minimizing what they need to know
NISO: “Why Bother?” • Because most patrons do not care where information is or who packaged it. • Because Google cannot do it all.
NISO Metasearch Initiative: Goals • metasearch service providers • offer more effective and responsive services • content providers • deliver enhanced content • protect their intellectual property • libraries • deliver services that distinguish them from Google and other free web services.
Metasearch Topological Entities Meta- searcher Client Database
Metasearch Topological Model Database Meta- searcher Client Database Database
Recall Goals: • metasearchservice providers to offer more effective and responsive services • content providers to deliver enhanced content and protect their intellectual property • libraries to deliver services that distinguish them from Google and other free web services.
Topological Client Metasearcher Database NISO Defined: Library Metasearch Provider Content Provider Entities
offer more effective and responsive services deliver “distinguished” services deliver enhanced content; protect intellectual property Meta- searcher Client Database Contentprovider Service provider Library
Protocol Model Database Meta- searcher Client Database Protocol Database Protocol
Metasearch Protocol Model Database Meta- searcher Client Database Protocol Database Protocol
Metasearch Protocol Database Meta- searcher Database Database Protocol
Metasearch Initiative Committee Charge • identifying/developing standards/ best practices to improve interoperability between metasearch engines, and content providers • identifying a simple search/retrieve protocol to help database providers more effectively interoperate with metasearching applications.
Metasearch Initiative Committee Charge - Boiling it Down • interoperability • between metasearch engines and content providers. • protocol • Identifying one - for interoperability between metasearch engines and content providers.
Task Groups • TG 1: Access Management • TG2: Collection Description • TG 3: Search/Retrieve
TG 1: Access Management • Authentication • Authorization
Potential Authentication Technologies. • Non-authenticated identification • IP recognition • Proxy Servers • Referring URL • Embedded data in URL • Vendor provided Javascript • Cookies • Shouting • Proprietary APIs • NCIP. SIP2. • LDAP • Shibboleth • Kerberos • Athens (UK) . • PAPI • Tequila
TG1 Recommendations • Now • IP authentication • Username / Password • Potential for the future • Shibboleth
TG2: Collection Description Service Description: used by applications to determine how to access remote services. Collection Description: used by humans or applications to select collections from those made available by a metasearch service.
TG 3: Search/Retrieve • Describe current practice….. • vocabulary ….. • template …… • Inventory …… • Citation level data elements • Investigate Result Set and Record level metadate • Review SRW/SRU and recommend modifications
TG 3: Search/Retrieve • Citation level data elements • Investigate Result Set and Record level metadata • Review SRW/SRU and recommend modifications
Citation Level Data Elements • http://www.niso.org/standards/resources/MI-Citation_Elements_v1.pdf
Result Set metadata • Branding • Clustering • Database name • Diagnostic messages • List of terms • Postings, count by term • Record count • Resources used • Result status • Sort order
Record Level metadata • Application URI • Character Set • Constraints • Cost Info • Dates (creation, last modified, reviewed) • Language • Position in result set • Processing instructions • Rank • Score • Size
Review SRW/SRU and recommend modifications
Metasearch XML Gateway “MXG”