210 likes | 299 Views
Cezary Mazurek (mazurek@man.poznan.pl) Marcin Werla (mwerla@man.poznan.pl) Poznań Supercomputing and Networking Center (Poznań, Poland). PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator. Polish Optical Internet PIONIER.
E N D
Cezary Mazurek (mazurek@man.poznan.pl) Marcin Werla (mwerla@man.poznan.pl) Poznań Supercomputing and Networking Center (Poznań, Poland) PIONIER Network Digital Libraries FederationExperiences of a large scale metadata aggregator ECDL 2009, Corfu, Greece
Polish Optical InternetPIONIER ECDL 2009, Corfu, Greece
Digital libraries in the PIONIER Network – organizational models • Main organizational models • Regional digital libraries • Created and maintained by several institutions from particular region • Gather mostly resources related to the region, its history and culture but also academic educational materials and national cultural heritage • Institutional digital libraries • Created and maintained by single institutions (like universities) • Gather mostly resources related to present activities (like institutional repositories) and history of the institution • In many cases the technical base and support for digital libraries is provided by local computing or networking centres (like PSNC) ECDL 2009, Corfu, Greece
Digital Libraries in Poland • Overall number of digital objects • 285 thousands • Number of active digital libraries: • 19 regional • 21 institutional • Number of cooperating • institutions: • Several hundreds of libraries, museums and archives + several other digital libraries in the phase of planning, configuration or initial content uploading Regional digital libraries Institutional digital libraries ECDL 2009, Corfu, Greece
Digital Libraries Federation • Main aims • To facilitate the use of resources from Polish digital libraries • To increase the visibility of these resources in the Internet • To create new, advanced network services both for end-users and digital libraries creators on the base of these resources ECDL 2009, Corfu, Greece
Digital Libraries Federation • Basic assumptions • No need nor requirement to move resources to the DLF • No fees for the use of the DLF and for being a part of it • Open standards are the basis for cooperation • Particular digital libraries can use different technological platforms ECDL 2009, Corfu, Greece
Digital Libraries Federation • Basic functions • Search in the available publications • Simple • Advanced • Digitization plans • Searchable • Report • API for the prevention of duplicted digitization • Location of digital objects on the basis of their OAI Identifiers • Database of Polish digital libraries • Statistics and reports • Information in the DLF is updated on the daily (nightly) basis ECDL 2009, Corfu, Greece
Digital Libraries Federation • See it: http://fbc.pionier.net.pl/ ECDL 2009, Corfu, Greece
Digital LibrariesFederationsearchplugin ECDL 2009, Corfu, Greece
Digital Libraries Federation as a metadata aggregator for Europeana Metadata aggregator Digital libraries Institutions ECDL 2009, Corfu, Greece
Digital Libraries Federation as a metadata aggregator for Europeana • We gather the information about content providers and their information systems • Database of Polish Digital Libraries in the DLF ECDL 2009, Corfu, Greece
Digital Libraries Federation as a metadata aggregator for Europeana • We gather the metadata of objects that should be visible in Europeana • Done with the OAI-PMH • In most cases we require the OAI-PMH interface • In really special cases we can do it in different way (eg. Polish Internet Library) • Now we harvest only Dublin Core Simple • Works on new national metadata schema started in September 2009 • Approximate time of development: 3 months • Approximate time of deployment: ??? ECDL 2009, Corfu, Greece
Digital Libraries Federation as a metadata aggregator for Europeana • We will try to clean-up the metadata, normalize it and enrich • On the DLF level there are automatically built dictionaries on the basis of aggregated metadata • Separately for each metadata element • Separately for each metadata language • Differences between the metadata from various digital libraries have negative impact for the searching possibilities of the end-users • That is why the metadata normalization is so important • The basic analysis shows which elements are crucial and which should be easy to clean-up • The analysis was done in April 2009 on the metadata of 214 254 aggregated objects ECDL 2009, Corfu, Greece
Digital Libraries Federation as a metadata aggregator for Europeana ECDL 2009, Corfu, Greece
Digital Libraries Federation as a metadata aggregator for Europeana • Format • In 99% of descriptions: MIME type(eg. text/html, image/x.djvu) • Language • In most cases: ISO 639-2 (pol, ger, lat, fre etc.) • Sometimes one value „pol, ger” instead of „pol”, „ger” • Rights • Name of the institution which holds the original object • Type • … ECDL 2009, Corfu, Greece
Digital Libraries Federation as a metadata aggregator for Europeana ECDL 2009, Corfu, Greece
Digital Libraries Federation as a metadata aggregator for Europeana ECDL 2009, Corfu, Greece
Subject - Most frequent values (Polish version of objects’ description) Confused with coverage: temporal spatial ECDL 2009, Corfu, Greece
Publisher – Most frequent values (Polish version of objects’ description) Geographical location… ECDL 2009, Corfu, Greece
Summary • We have over 40 digital libraries in Poland which are filled with content and metadata coming from hundreds of institutions from different domains • We harvest the metadata and provide a single point of access to it • The PIONIER Network Digital Libraries Federation (http://fbc.pionier.net.pl/) • The software used for this service will be released as an open-source by the end of this year • Cooperation with Europeana (but not only this) requires cleaning-up and normalization of metadata • This is currently our biggest challenge • But we do not want to solve it only by technical means on the level of our aggregator • Close cooperation with content providers and some organizational changes prepared by them should effect in more efficient and sustainable metadata improvement process than a purely technical solution ECDL 2009, Corfu, Greece
Cezary Mazurek (mazurek@man.poznan.pl) Marcin Werla (mwerla@man.poznan.pl) Poznań Supercomputing and Networking Center (Poznań, Poland) PIONIER Network Digital Libraries FederationExperiences of a large scale metadata aggregator Thank you for your attention. Any questions? ECDL 2009, Corfu, Greece