130 likes | 251 Views
Open Archives Initiative. Transportation Research Board Annual Meeting. January 2008. Taylor Surface Global Product Manager Digital Collection Services. The OCLC cooperative. 60,457 libraries in 112 countries. 1,148. 5,697. 48,535. 4,204. 873. 86 million records 1.14 billion holdings.
E N D
Open Archives Initiative Transportation Research Board Annual Meeting January 2008 Taylor Surface Global Product Manager Digital Collection Services
The OCLC cooperative 60,457 libraries in 112 countries 1,148 5,697 48,535 4,204 873 86 million records 1.14 billion holdings
Processing Management Planning Digital Collection Services OCLC Preservation Service Centers WorldCat Harvesting Program Implementation services Discovery / Access CONTENTdm Software Web Harvesting Service Digital Archive Service Worldcat.org
OAI – Open Archives Initiative • Started late 1990’s • Spearheaded by folks running arXiv • Problems to solve • Many individual repositories of research • Not available to general search engines (AltaVista, Excite, etc.) • Goal: support dissemination of e-prints for research • Result: OAI Protocol for Metadata Harvesting … sharing e-print metadata (and recognition of broader applicability)
Data Providers Researchers Service Provider OAI-PMH OAI-PMH Roles
OAI-PMH Protocol … the verbs How does the Service Provider speak with the Data Provider? • Identify – tell me about yourself and capabilities • ListMetadataFormats – tell me the formats you speak • ListSets – tell me the names of your research collections • ListIdentifiers – tell me the identifiers of each document • ListRecords – tell me about each document • GetRecord – send me a description of one document
OAI-PMH the benefits / issues • Easy to get started • Simple protocol • Options for open source & commercial software support • Supports communities • The devil can be in the metadata • Character set differences • Repeatability / optionality differences • Consistency (eg., punctuation) • Ambiguity (eg., “On a horse”) • Context (eg., DC tags vs. MARC codes)
OAIster … www.oaister.org • Operated by University of Michigan • 14,626,548 metadata records • Gathered from 929 repositories • Collection development policy • Content types – open ended • Freely available & restricted content allowed • Data issues addressed in operations
Mountain West Digital Librarywww.mwdl.org • Operated by University of Utah • Using extended OAI protocol • Gathered from 31 institutions • Collection development policy • Content types broadly defined • Freely available content • Metadata guidelines
OCLC WorldCat Harvesting Programwww.worldcat.org • 86.0 million records • 125,980 via OAI • 9,785 member libraries • 100+ OAI institutions • Collections reflect institutions • Metadata profiling • WorldCat syndication
Metadata Analysis & Conversion CONTENTdm Server WorldCat Harvesting
Questions? Contact: Taylor Surface taylor_surface@oclc.org