330 likes | 344 Views
Join the ETD Search Services community to access and share research data. OCLC NDLTD Union provides 112,652 ETDs metadata from 44 institutions in 3 formats. Use SRU and VTLS Search for efficient retrieval.
E N D
ETD Search Services Ming Luo EdwardA.Fox(fox@vt.edu) Virginia Tech
Acknowledgements (selected) • Support: Adobe, AOL, DFG, NSF (DUE-0333531,0136690,0121679; IIS-0086227,0080748 ), OCLC, UNESCO, VTLS • Colleagues: Vinod Chachra, Tom Dehn, Marcos Gonçalves, Thom Hickey, Aaron Krowne, Ming Luo, Gail McMillan, Hussein Suleman, Jeff Young
Where are the data coming from? • From this community! • Please join us, share your data! • OCLC NDLTD Union • 112,652 ETDs metadata • 44 institutions • Will be able to provide 3 formats • DC • ETDMS • MARC21
Where are the data coming from? • OCLC (Research) contacts are • Thom Hickey [hickey@oclc.org] • Jeff Young [jyoung@oclc.org] • Tom Dehn [dehn@oclc.org] • VTLS has some additional data sources • Providing data other than through OAI-PMH • Including in Korean and Greek • Contact is Vinod Chachra [chachrav@vtls.com]
OCLC SRU • What is SRU? • Search and Retrieve URL Service (SRU) is web service based protocols for searching databases • Derived from Z39.50 • Uses Common Query Language • Current version: V1.1, 13th February 2004 • Three basic operations, explain, scan and searchRetrieve
CQL Examples(from http://www.loc.gov/z3950/agency/zing/cql/) • dc.title cql.stem • dc.title = "cat" cat • dc.title = "cat" author = "smith" • dc.title any "cat" bath.author cql.exact "smith, j." • dc.title any/relevant/rel.CORI "cat fish" dc.author exact/stem "smith, j." • dc.title = cat "<element>" • dc.title = "cat" and bath.author = "smith" " cat" or hat • dc.title = "cat" prox/distance=1/unit=word dc.title = "in""cat" prox/distance>2/ordered "hat" • dc.title=cat and/rel.sum dc.title=dog • > dc="http://www.dublincore.org/" dc.title = "cat"
VTLS Search • Based on Virtua system from • VTLS (www.vtls.com) • Visionary Technology in Library Solutions • Developed in C++ • Uses Oracle Database
Virtua User Interface • Scan Search • Key Word Search • Expert Search
VTLS Union CatalogContent Languages • The VTLS NDLTD Union Catalog has data in 6 different languages. These are: • English • German • Greek • Korean • Portuguese • Spanish • Examples follow
Virginia Tech ETD Union • Componentized Digital Library Software • Uses OCLC’s OAI data provider • Mirrored in China by CALIS • About 200 queries and 400 pages per day for the past year and usage is increasing
Metadata harvesting The World According to OAI Service Providers Discovery Current Awareness Preservation Data Providers
User Interface Search Browse What’sNew OCLCdataprovider VT ETD Union System Diagram
Extended OAI-PMH Open Digital Library Protocol Protocol for Metadata Harvesting
Extended OPEN ARCHIVE Open Digital Library Component OPEN ARCHIVE
Open Digital Library Components • Running now • XML-File (data provider from file system) • Search: simple or in-memory (Essex) or generalized • Union, browse, recent, filter • E-journal/review, Submit, Edit, Annotation • Recommender, Rating; Mirroring (see JCDL’02) • Working with NCSA: from DB, unstructured text • Others in process • Classification/categorization • Registry (and other connections with web services)
Program Video Video Image Image Program Program Video Image XPMH 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 OA OA XPMH PMH OA XPMH OA XPMH XPMH OA XPMH OA Document Document Document XPMH XPMH 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 XPMH OA OA XPMH OA PMH XPMH open digital library
ETD-2 ETD-4 Video ETD-3 Image Program Program Video Image 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 ETD-1 Document Document 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 1010100101010010101010010101010101010101 Example Open Digital Library ODLRecent USER INTERFACE Recent PMH ODLUnion Filter PMH ODLUnion Union Browse PMH ODLBrowse PMH ODLUnion Filter PMH Search ODLSearch ETD DL for the Networked Digital Library of Theses and Dissertations (www.ndltd.org) Students and researchers ETD collections
ETD Union Search Mirror Site in China (CALIS)(http://ndltd.calis.edu.cn – popular site!)
Composability Efficiency Effectiveness OCLC SRU Medium High High VTLS Low High Medium Virginia Tech ETD Union Search High Medium Medium Quality of Search Services
Software Used Software License Price of Software Full Text Search OCLC SRU Homegrown N/A N/A No VTLS VIRTUA (VTLS.com) Commercial Depends on user number, collection size No Virginia Tech ETD Union Search Open Digital Library BSD-like Open Source License Free No Virginia Tech ETD collection Ultraseek (Verity.com) Commercial Depends on user number, collection size Yes Comparison of Software
Next Steps with VT ETD Union • Web Services based component • Easier user interface configuration • Better precision of search results; full-text? • Research studies (e.g., Ryan Richardson dissertation) • Studies of collections and genre • Summaries using concept maps • Cross-language retrieval
References: • Z39.50 International - Next Generation: http://www.loc.gov/z3950/agency/zing • VT Service: http://rocky.dlib.vt.edu/~etdunion/cgi-bin/OCLCUnion/UI/index.pl • VTLS Service: http://www.vtls.com/ndltd • OCLC Service (SRU): http://alcme.oclc.org/ndltd/SearchbySru.html
Thank You! • Paper with more details is available at URL: http://tennessee.cc.vt.edu/~lming/software/ETDSearchServices0.7.doc • DLRL: www.dlib.vt.edu, http://dlbox.nudl.org/ • Fox: http://fox.cs.vt.edu