260 likes | 420 Views
Using Estonian Subject Thesaurus in digital environment. Sirje Nilbe Tiiu Tarkpea. Outline. Estonian Subject Thesaurus (EMS) Thesaurus and the online catalogues of libraries Databases of articles The Estonian National Bibliography Database Digital Archive DIGAR
E N D
Using Estonian Subject Thesaurus in digital environment Sirje Nilbe Tiiu Tarkpea
Outline • Estonian Subject Thesaurus (EMS) • Thesaurus and the online catalogues of libraries • Databases of articles • The Estonian National Bibliography Database • Digital Archive DIGAR • University of Tartu Digital Archive on DSpace • Other potential users • Conclusions
Estonian SubjectThesaurus (EMS)http://ems.elnet.ee • A controlledvocabularyofuniversalcoverageforinformationretrieval and indexing • WastakenintouseunderthisnameinMay 2009 • Predecessors: theThesaurusoftheUniversityof Tartu Library (1994) and theEstonianUniversalThesaurus (1999) • Mergingprojectcarried on 2007 – 2009 => EstonianSubjectThesaurus
Estonian SubjectThesaurus Co-managed by ELNET Consortium, the National Library of Estonia and The University of Tartu Library EMS is developed by a manager, a programmer and 9 authorised editors, all with part-time contribution Free for Estonian libraries and other institutions Contains over 54 000 terms (36 500 preferred, 17 500 non-preferred) The user interface is in Estonian and in English EMS mostly serves as a tool for post-coordinated indexing
EMS databaseenables • to search by each term in the online catalogue ESTER, in the database of Estonian articles ISE, in the database of Estonian National Bibliography and in Google • to subscribe current awareness service for new, changed and deleted subject terms • for using the data in the other systems the subject term records can be exported in MARC21 format • a word retrieval web service for machine-to-machine interaction includes machine-readable MARC21, eye-readable MARC21 or MARC XML formats
EMS intheonlinecataloguesoflibrariesE-catalogue ESTERhttp://tallinn.ester.ee; http://tartu.ester.ee • Online catalogue ESTER is the shared catalogue of the ELNET Consortium member libraries • 13 libraries, 3 million bibliographic records • Software: integrated library system Millennium from US Company Innovative Interfaces Inc. • Facilities for development of authority files in the library system are poor • Since May 2009 the subject authority records are no longer compiled manually • The authority data are updated twice a month on the basis of extentions and corrections in EMS database • A MARC21 file is exported from the thesaurus system and loaded into the library system • In the subject index of the catalogue, cross references from authority records are arranged between data indexed from bibliographic records.
EMS intheonlinecataloguesoflibrariesOthercatalogues • Compiledmostlybycopycataloguingfrom ESTER • Donotsupportauthoritycontrol • Special, public and schoollibrariesoftenrichtheirrecordswithadditionalsubjectterms • Therecordssometimescontainsynonymousdescriptors and nonpreferredtermstoincreasethe number ofaccesspoints
DatabaseofarticlesIndexScriptorumEstoniae (ISE)http://ise.elnet.ee • A result of cooperation between 12 libraries in the ELNET Consortium (2009) • Maintained in the additional module Reference Database of the library system Millennium, thus the functionality and appearance of it resemble the e-catalogue ESTER • Contains articles from newspapers, magazines and journals, serial publications and anthologies • Full-text accessed in free digital archives and Web publications
DatabaseofarticlesIndexScriptorumEstoniae (ISE) • Present standard for subject indexing is the Estonian Subject Thesaurus • The authority data are updated twice a month • Old separate databases had their own local vocabularies, we have to harmonise the records with EMS that takes several years • EMS doesn't always meet the needs of subject indexing of periodicals (local history in local newspapers for example)
Estonian NationalBibliographyDatabase (ERB)http://erb.nlib.ee/ • ERB is compiled and managed by the National Library of Estonia • Software: developed locally on the basis of MySQL • Items are catalogued, classified and indexed in Millennium, exported and loaded into ERB • No changes can be made in ERB database • Subject search can be carried out by subject terms, UDC numbers and keywords (title and subject fields) • ERB does not contain authority data • A user can click on EMS logo to find terms or start with EMS and find ERB records this way
DigitalArchiveDIGARhttp://digar.nlib.ee • DIGAR is the digital archive of the National Library of Estonia • The archive preserves: • Estonian online publications issued on the Internet • digital copies of Estonian electronic publications issued on physical carriers • digital copies of publications on analog carriers • print files of Estonian publications • Objects, stored in DIGAR, have been described in ESTER or ISE • Records in these databases and full texts in DIGAR are linked • The metadata of DIGAR is mostly imported from ESTER, incl. subject descriptors • The metadata is supplemented by the English-language equivalents of the descriptors from EMS
Universityof Tartu DigitalArchive on DSpacehttp://dspace.utlib.ee • The aim of the repository is to collect, preserve and make available the digital information of the university faculties, departments and institutes • Publicly available via Internet search engines and can be interfaced by OAI data exchange protocols • The repository preserves: • print files of doctoral theses published by the University of Tartu • original files created in the university, e. g. conference papers, articles, reports, textbooks, personal archives etc. • Digital copies of analog carriers • Online publications issued on the Internet • Digital copies of electronic publications issued on physical carriers
Universityof Tartu DigitalArchive on DSpace • Records in DSpace are searchable via online catalogue ESTER, which has a link to full text in repository • Additionaly, English equivalents of subject terms are added manually • If so desired, one can add subject terms or keywords in any language (preferably in the language of original paper) • It is possible to create subject term patterns added by default for a collection
Otherpotentialusers • Other knowledge institutions (museums and archives) are starting to develop their information system • They need controlled vocabularies to describe their specific records and artefacts • EMS seems to be too extensive for them, but lacking many specific terms at the same time • EMS as a language and terminology resource • EMS and EuroTermBank
Conclusions • EMS tries to consider the needs of all interest groups • Includes large amount of terms for specific and exhaustive indexing • Seems to be too extensive for archives and museums • EMS is available as a web database and authority records in MARC21 format • Authority control is supported by the online catalogue ESTER and article database ISE • The other systems do not support authority records, thus EMS loses lot of its advantage as a thesaurus • In the digital collections metadata often include English-language equivalents • Further steps: conformity to the semantic web standards like SKOS and Open Linked Data
Thankyou! sirje.nilbe@nlib.ee tiiu.tarkpea@ut.ee