1 / 21

PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator

Cezary Mazurek (mazurek@man.poznan.pl) Marcin Werla (mwerla@man.poznan.pl) Poznań Supercomputing and Networking Center (Poznań, Poland). PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator. Polish Optical Internet PIONIER.

Antony
Download Presentation

PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Cezary Mazurek (mazurek@man.poznan.pl) Marcin Werla (mwerla@man.poznan.pl) Poznań Supercomputing and Networking Center (Poznań, Poland) PIONIER Network Digital Libraries FederationExperiences of a large scale metadata aggregator ECDL 2009, Corfu, Greece

  2. Polish Optical InternetPIONIER ECDL 2009, Corfu, Greece

  3. Digital libraries in the PIONIER Network – organizational models • Main organizational models • Regional digital libraries • Created and maintained by several institutions from particular region • Gather mostly resources related to the region, its history and culture but also academic educational materials and national cultural heritage • Institutional digital libraries • Created and maintained by single institutions (like universities) • Gather mostly resources related to present activities (like institutional repositories) and history of the institution • In many cases the technical base and support for digital libraries is provided by local computing or networking centres (like PSNC) ECDL 2009, Corfu, Greece

  4. Digital Libraries in Poland • Overall number of digital objects • 285 thousands • Number of active digital libraries: • 19 regional • 21 institutional • Number of cooperating • institutions: • Several hundreds of libraries, museums and archives + several other digital libraries in the phase of planning, configuration or initial content uploading Regional digital libraries Institutional digital libraries ECDL 2009, Corfu, Greece

  5. Digital Libraries Federation • Main aims • To facilitate the use of resources from Polish digital libraries • To increase the visibility of these resources in the Internet • To create new, advanced network services both for end-users and digital libraries creators on the base of these resources ECDL 2009, Corfu, Greece

  6. Digital Libraries Federation • Basic assumptions • No need nor requirement to move resources to the DLF • No fees for the use of the DLF and for being a part of it • Open standards are the basis for cooperation • Particular digital libraries can use different technological platforms ECDL 2009, Corfu, Greece

  7. Digital Libraries Federation • Basic functions • Search in the available publications • Simple • Advanced • Digitization plans • Searchable • Report • API for the prevention of duplicted digitization • Location of digital objects on the basis of their OAI Identifiers • Database of Polish digital libraries • Statistics and reports • Information in the DLF is updated on the daily (nightly) basis ECDL 2009, Corfu, Greece

  8. Digital Libraries Federation • See it: http://fbc.pionier.net.pl/ ECDL 2009, Corfu, Greece

  9. Digital LibrariesFederationsearchplugin ECDL 2009, Corfu, Greece

  10. Digital Libraries Federation as a metadata aggregator for Europeana Metadata aggregator Digital libraries Institutions ECDL 2009, Corfu, Greece

  11. Digital Libraries Federation as a metadata aggregator for Europeana • We gather the information about content providers and their information systems • Database of Polish Digital Libraries in the DLF ECDL 2009, Corfu, Greece

  12. Digital Libraries Federation as a metadata aggregator for Europeana • We gather the metadata of objects that should be visible in Europeana • Done with the OAI-PMH • In most cases we require the OAI-PMH interface • In really special cases we can do it in different way (eg. Polish Internet Library) • Now we harvest only Dublin Core Simple • Works on new national metadata schema started in September 2009 • Approximate time of development: 3 months • Approximate time of deployment: ??? ECDL 2009, Corfu, Greece

  13. Digital Libraries Federation as a metadata aggregator for Europeana • We will try to clean-up the metadata, normalize it and enrich • On the DLF level there are automatically built dictionaries on the basis of aggregated metadata • Separately for each metadata element • Separately for each metadata language • Differences between the metadata from various digital libraries have negative impact for the searching possibilities of the end-users • That is why the metadata normalization is so important • The basic analysis shows which elements are crucial and which should be easy to clean-up • The analysis was done in April 2009 on the metadata of 214 254 aggregated objects ECDL 2009, Corfu, Greece

  14. Digital Libraries Federation as a metadata aggregator for Europeana ECDL 2009, Corfu, Greece

  15. Digital Libraries Federation as a metadata aggregator for Europeana • Format • In 99% of descriptions: MIME type(eg. text/html, image/x.djvu) • Language • In most cases: ISO 639-2 (pol, ger, lat, fre etc.) • Sometimes one value „pol, ger” instead of „pol”, „ger” • Rights • Name of the institution which holds the original object • Type • … ECDL 2009, Corfu, Greece

  16. Digital Libraries Federation as a metadata aggregator for Europeana ECDL 2009, Corfu, Greece

  17. Digital Libraries Federation as a metadata aggregator for Europeana ECDL 2009, Corfu, Greece

  18. Subject - Most frequent values (Polish version of objects’ description) Confused with coverage: temporal spatial ECDL 2009, Corfu, Greece

  19. Publisher – Most frequent values (Polish version of objects’ description) Geographical location… ECDL 2009, Corfu, Greece

  20. Summary • We have over 40 digital libraries in Poland which are filled with content and metadata coming from hundreds of institutions from different domains • We harvest the metadata and provide a single point of access to it • The PIONIER Network Digital Libraries Federation (http://fbc.pionier.net.pl/) • The software used for this service will be released as an open-source by the end of this year • Cooperation with Europeana (but not only this) requires cleaning-up and normalization of metadata • This is currently our biggest challenge • But we do not want to solve it only by technical means on the level of our aggregator • Close cooperation with content providers and some organizational changes prepared by them should effect in more efficient and sustainable metadata improvement process than a purely technical solution ECDL 2009, Corfu, Greece

  21. Cezary Mazurek (mazurek@man.poznan.pl) Marcin Werla (mwerla@man.poznan.pl) Poznań Supercomputing and Networking Center (Poznań, Poland) PIONIER Network Digital Libraries FederationExperiences of a large scale metadata aggregator Thank you for your attention. Any questions? ECDL 2009, Corfu, Greece

More Related