330 likes | 542 Views
VIAF Council Meeting, Singapore, 2013-08-16. Janifer Gatenby. OCLC. Improving works and expressions in VIAF. V IAF and WorldCat/WorldCat Identities. . Please note:. Excerpts from the following full presentation were presented by Janifer Gatenby to the VIAF Council meeting 2013-08-16.
E N D
VIAF Council Meeting, Singapore, 2013-08-16 JaniferGatenby OCLC Improving works and expressions in VIAF VIAF and WorldCat/WorldCat Identities.
Please note: Excerpts from the following full presentation were presented by JaniferGatenby to the VIAF Council meeting 2013-08-16
IFLA, Singapore, 2013-08-19 Karen Smith Yoshimura Eric Childress Janifer Gatenby Jean Godby Richard Greene Jenny Toves Diane Vizine Goetz Robert Bremer JD Shipengrover Gail Thornburg Jay Weitz Multilingual WorldCatpresented by Janifer Gatenby
WorldCat Today • Resources in nearly all languages • Contributed by more than 20,000 libraries worldwide • More than half the database is for works not in English
WorldCat Today • Bibliographic Records • Hybrid records • Parallel records • Clustered at Work level (FRBR)
Existing Architecture Subj Classif Subj Classif Authors Subj Classif Authors Authors Holding Holding Holdings Bibliographic record Work cluster Manifestation cluster Content cluster
Complementary Initiatives Work Level Record GLIMIR Manifestation & Content Clusters Multi-lingual Bibliographic Structure
Work Level Record http://www.oclc.org/research/activities/workrecs.html
Work Level Record: Objective Create a landing page summarizing content for a work
GLIMIR • The Content Cluster • Enables better work record displays by reducing the number of lines that display for large works • Enables a choice of format and presents the formats that could be acceptable substitutes • Consolidates holdings for identical content • The Manifestation Cluster is important • Consolidates holdings at manifestation level • In the short term allows the record catalogued in the language of the interface to be chosen for display • Reduces apparent duplication • Allows a more accurate count of the number of manifestations in WorldCat (as opposed to the number of records)
Multilingual Bibliographic Structure Project Creates true multi-lingual displays • At work and manifestation levels • Using all available data instead of “most appropriate record” • Generates data Corrects many of the 28 million records coded “und” Better control and linking of translations Input to refinement of work clusters Smarter data storage
“Most appropriate” questioned • Worldcat.org selects the most appropriate record to show to a user as representative of the work in the short result list and beyond • The end result will not be very satisfactory from a multi-lingual viewpoint… here’s why
Most appropriate display Build the display from all available data
Multilingual Bibliographic Structure Project • Work level data, mined from all associated bibliographic records will be displayed supplemented with expression / manifestation level data as the user drills through the short to fuller versions of the metadata. End user interface will show works and manifestations not bibliographic records; the cataloguing client will also show bibliographic records
Proposed new architecture eng ger jpn o fre Notes Contents ++ Authors Authors fre jpn Authors eng Subj Classif Subj sif ger fre Holding jpn Holding Work Holding eng ger jpn fre eng ger fre Translations (Language of work) Holding Manif eng Holding Manif eng Manif eng Manif eng ger Manif eng Manif eng Manif fre fre jpn eng
Important principles • Language tagging of elements, particularly • Summaries (M21 520) • Subject headings • Display in script preferred by the user if data is available • Improve translated interfaces • Show consolidated holdings as appropriate
Great works are translated • The cream of the world’s cultural and knowledge heritage is shared by being translated • WorldCat contains many rich cataloguing records for these translations GOAL: Data mine the really good records to improve clustering, presentation, authority records and linked data
Translations • Inconsistencies causing work clusters to be incomplete & less than optimal search results • Titles without subtitles • Different forms of uniform title or missing uniform title • Inverted title • Different coding of original and translated information Generated uniform title authority records will overcome most of these differences without needing to edit individual records
Generate uniform title authority records • Improve FRBR work groups • Made by data mining • Contribute to VIAF • Diffuse via VIAF as linked data • Possibility to create web page / web service
Translation records in VIAF • Will enrich VIAF significantly • New elements - translated title and translator
Diffusion of Translation records • Records are freely available to the world from VIAF in • MARC-21 • XML • RDF (linked data) • Just links in JSON • And other formats as introduced
We don’t know now, but soon will • # of manifestations as opposed to # of records • # of works that have translations • Top translated authors and works • And more