280 likes | 401 Views
Translingual Retrieval. Moving between vocabularies MACS 2010. Jahns / Karg, Deutsche Nationalbibliothek Concepts in Context - Cologne Conference on Interoperability and Semantics in Knowledge Organization 2010-07-19. MACS. Project history. 1997 CENL-MACS project started
E N D
Translingual Retrieval Movingbetweenvocabularies MACS 2010 Jahns / Karg, Deutsche Nationalbibliothek Concepts in Context - Cologne Conference on Interoperability and Semantics in Knowledge Organization 2010-07-19
MACS Project history 1997 CENL-MACS project started 1997-1999 feasibility study sports & theatre 2000-2001 prototype of a web application 2002-2004 LMI (link management interface) was established 2005 RAMEAU-LCSH-Links were loaded by BnF 2006 CrissCross started
MACS Project progress • 2007 SNB startedediting links on a regularbasis • 2008 SNB and DNB preparedworkdivision • 2008 update and installationof an update routine • forthethreeindexinglanguages • 2008-2009 recruitingfreelancers, training • translatorsworkphase • 2009 TEL prototype of a multilingual search • interface 58.907 EN-FR-GER links
70,000 Tuple RAMEAU - LCSH from BnF 50,000 SWD conceptstoconnectwith SWD-topicalheadingsof ALL disciplines 12 workpackagesforneighbouringdisciplines MACS Project scope
MACS Working area – LMI https://macs.hoppie.nl/lmi/ • Searching and listingsubjectheadings
Update mechanism and identifier Working area – LMI
Searching links Working area – LMI
Editing links / Creatingnew links Working area – LMI
Filtering DDC-domains Working area – LMI
Methodology • Nolanguageisusedas a pivot • Headingsare not translated • Mapping isdone on thebasisofconcepts • Linking ischecked upon retrievalresults • Set ofpracticalrules and guidelines (BS 8723)
Language differences • LCSH, RAMEAU pre-coordinatedstrings / plural • SWD more flexible; post-coordination / singular • LCSH, RAMEAU subdivisionpractice 160,000 280,000 Frauenarbeit Women -- employment Mörder AND Intensivtäter Serial murderers 96,000 Femmes -- travail Tueurs en série
Equivalence 1 :1, 1 : n (including AND) Null: nomapping Types of links Theologie Theologie Theology Athlétisme AND Entraîneurs Leichtathletiktrainer Track and field coaches Traducteurs Übersetzer Translators Traducteurs Dolmetscher Translators 0 Erdalkalichloride Alkaline earth chlorides
Equivalence • exact • inexact = overlapping • partial equivalent Ètudiants japonais Japanischer Student japanese students Impérialisme Neokolonialismus Imperialism Cas (linguistique) Dativ 0
Mapping problems Cider • Different coverage • Different semantics, semanticrelations • Language skills Apfelwein Civilisation, Ancient Antike? Altertum! Cidre Civilsation antique
Access tomappings • Mapped terms in LMI -> XML-Download • Linked Open Data • Multilingual equivalent relations in authority records
Linked Data:Publishing MACS in the web • Linked Data: Datasets on the web linkedtogethersemantically, forming a publiclyaccessiblenetwork of structuredinformation
Why Linked Data? • Hasbecomethe de-facto standardfor: • publishing • exchangingdata on the Internet • Reachesmorecollaborationpartners and customers • Research institutions • Cultural heritageorganizations • Internet companies • online communities • Library datacanplay a majorrole in thesemantic web • providesreliability and stability • canbecome a „backboneoftrust“
MACS as Linked Data • Linked Data publishedbytheGerman National Library (asof May 30th, 2010) • 37.547 links from SWD to LCSH • 28.249 links from SWD to RAMEAU • RAMEAU and LCSH also publish MACS relationsaspartoftheirLinked Data • Ourdatahasbeenusedto update the RAMEAU SWD links • LCSH SWD links will follow
Applicationsof MACS Data • Improved Multilingual Search Functionalities • Europeana • Deutsche Digitale Bibliothek • Website of the German National Library
Cataloguers benefit Automatic Indexing: 5050 610 5100 !040537242! |s|Schwangerschaft ; 5101 !040153320! |s|Ernährung
Users benefit • Search with the native language • Enlargement of recall • Additional recall in the own catalogue • Enlargement of the query to foreign catalogues and portals • Precision of recall
Users benefit searchterm Ärztlicher Behandlungsfehler Dokuments: in DNB SNL LoC BL BnF and so on....
Multilingual meta-searching • One-stop-searchingforheterogeneouslyindexeddocuments • Authority data (mapping) displayedtoenhancesearching and understanding • Gettingresultsfrom different libraries • Europe comestogether: TEL, Europeana
Library users of multicultural societies • Searchingforsubjects in thepreferredlanguageoftheuser • Crossinglanguagebarriers in academic and publiclibraries • 12 % foreignstudents in Germany
Thank you! Merci Danke h.karg@d-nb.de y.jahns@d-nb.de