130 likes | 291 Views
CLARINO WP2 National Registry and Long-Term Archiving. Freddy Wetjen and Oddrun Pauline Ohren National Library of Norway Bergen, 12. September 2013. National Registry of metadata. Goal Joint metadata registry of resources in all Clarino centres
E N D
CLARINOWP2 National Registry and Long-Term Archiving Freddy Wetjen and Oddrun Pauline Ohren National Library of Norway Bergen, 12. September 2013
National Registryof metadata • Goal • Joint metadata registryofresources in all Clarinocentres • Harvest data from all CLARINO centres • Exchange data withothernational CLARIN centres • Status – currentsituation • On-going and plannedactivities
National Registryof metadataStatus (1) • Metadata registryversion 1 is running • Search/browse, editing and management, butno harvesting facilities • Infrastructure: • META-SHARE infrastructure 3.0 • http://metashare.nb.no/, proxied by themanaging node http://metashare.tilde.com/ • Metadata complying META-SHARE metadata format 3.0 • No harvesting facilities • Metadata content: • 71 resources • Usage: • 11.9.2013: 37 oftheresourcesdownloaded 1-17 times • Norwegian Wordnet (Bokmål) at thetop • Topmostdownloading locations: Norway, Germany, Greece, Sweden
National Registryof metadataStatus (2) • Decisionmade: Migrate to CMDI (CLARIN platform) • Uncertainfuture for META-SHARE • 2 ys guaranteedlife span • Need for more adaptability and expressivity in metadata model • Increasedinvolvementwiththe CLARIN community
National Registryof metadataPlannedactivities • Build a basic CMDI infrastructure • Repository, editor, search service, PID scheme, harvesting • Convert metadata from META-SHARE to CMDI • Use META-SHARE profileas specified in Component Registry • Extend/adapt metadata modelaccording to need • In collaborationwiththeother CLARINO centres
Metadata modeler Infrastructure provided by CLARIN centrally META-SHARE components, a.o CMDI Metadata framework Definitions ofconcepts used in metadata components ISOcat Concept Registry • Component • editor CLARIN Component Registry Relation Registry Other trusted concept Registries «My profile» <xxxx> <yyyy> <zz> <xxxx> Joint Metadata Repository Search Service • Metadata • editor Språk-banken User Bergen Centre LAP Other centre… EDD Metadata creator TextLab Adaptation ofBroeder, D. A Data Category Registry- and Component-based Metadata Framework. LREC 2010.
National Registryof metadata; Services Clarincommoninfrastructure «Our profiles» Repository OAI/PMH harvesting • Metadata • Editor (Arbil..?) Search Services Metadata creator CMDI Weblicht VLO FCS?
Long term archiving • Metadata • editor Data Repository • Data • Delivery client -Resoures Processing and adaptation for long term storage(Checksum,pid,metadata etc.) NB long term storage (preservation)
Time perspective • Metadata registryversion 2 : Primo 2014 • Basic CMDI infrastructure • existing metadata converted from META-SHARE • OAI/PMH endpoint, butno harvesting from othercentres • Metadata registryversion 3: Mid 2015 • Extended/adapted metadata model • Harvesting from other CLARINO centres • Long term archiving: Mid 2014 withboth data and metadata.
CLARINOWP2 National Registry and Long-Term Archiving Freddy Wetjen and Oddrun Pauline Ohren National Library of Norway Bergen, 12. September 2013