340 likes | 490 Views
VIAF Global Council - Lyon, France 15 August 2014. Janifer Gatenby. VIAF and ISNI Synchronisation. EMEA Program Manager Metadata. bridging-domains. cross-domain. Text Rights. Trade Sources. Music Rights. Archives and Museums. Encyclopaedias. Libraries. Researchers & Professional
E N D
VIAF Global Council - Lyon, France 15 August 2014 Janifer Gatenby VIAF and ISNI Synchronisation EMEA Program Manager Metadata
bridging-domains cross-domain Text Rights Trade Sources Music Rights Archives and Museums Encyclopaedias Libraries Researchers & Professional Granting organisations Professional Societies Article databases Theses databases
ISNI Status at July 2014 • 8.01 million assigned ISNIs (was 1 million 2 years ago) • 15.4 million links; ISNI as linked data • ORCID Registration process is accessing ISNI • New members: Harvard University, La Trobe University and COPYRUS (Russia) • Linked Content Coalition names ISNI as # 1 strategy ì
VIAF and ISNI are Complementary VIAF Scope • Persons • Organisations • Works / uniform titles • Expressions • Meetings • Geographic • All public data ISNI Scope • Persons • + musicians, researchers • Organisations • (excluding sparse) • (excluding undifferentiated) • Includes private data
VIAF and ISNI are Complementary VIAF Role • Ingest authority records from the world’s major national and research libraries • Make clusters • Expose and diffuse ISNI Role • Create permanent IDs • By batch • On demand • Diffuse those IDs • Libraries, trade, rights management, professional societies, educational institutions
VIAF and ISNI are Complementary VIAF System • Harvester • Clustering mechanism (re-clustered monthly) • 5 web interface languages • Download in multiple formats • Linked data & SRU • 1 million personal visitors p.a. ISNI System • Batch load • Online request API • Web site (English only) • Allows end user input • Member input and correction • 16+ indexes • SRU; linked data • Quality Team monitoring & correcting • Diffusion, including corrections
VIAF ingest into ISNI • VIAF provides full file each month • ISNI compares previous & current files & creates separate files for processing • Deletes (VIAF cluster ID in old but not new) • If assigned or has other sources, source becomes ISNI • Contents changed • Sources added or deleted • New (VIAF cluster ID in new but not old) • Re-matches VIAF deletes • VIAF cluster movement reports for BL and BnF
VIAF Global Council - Lyon, France 15 August 2014 Maintaining Clusters
Mixed identities Source 1 Source 1 Source 2 Cluster Error Source Error
End User Note Dear Sir / Madam, The ISNI 0000000117488848 refers to "Marco Antonio Casanova", Professor at the Catholic University of Rio de Janeiro. I am not the author of "Fragmentos póstumos. - Nietzsche uma introdução filosófica" or "Segunda consideração intempestiva da utilidade e desvantagem da história para a vida". The author of these works is "Marco Antonio dos Santos Casa Nova". You may confirm this information by consulting our CVs at the Brazilian Research Council: Marco Antonio Casanova (me): http://lattes.cnpq.br/0400232298849115 Marco Antonio dos Santos Casa Nova (the other author): http://lattes.cnpq.br/3409704326617178
Correction – Source Error • Reply to End User Thank you for using the ISNI database and suggesting improvements to your record. There is now another ISNI record for Marco Antonio dos Santos Casa Nova (ISNI 0000 0004 3077 6045). I have corrected your record, removed the erroneous titles and added a link to your online CV (Lattes database). If you have any further queries, please let me know. • Email to Source I am part of the the ISNI Quality Team (experts from the British Library and Bibliothèque nationale de France in charge of the quality of the ISNI database). We perform manual checking and corrections in the ISNI database such as splits, merges/deduplications and data corrections. ISNI Quality team received a request from an enduser about ISNI records 0000 0001 1748 8848 and 0000 0004 3077 6045, VIAF 19998588 and their related Authority record XXX 109895029 mixes 2 identities (see the snapshot below) : 1/ Marco Antonio Casanova (ISNI 0000 0001 1748 8848) 2/ Nova,MarcoAntonio dos Santos Casa (ISNI 0000 0004 3077 6045) Philosoph, and author of "Segunda consideração intempestiva da utilidade e desvantagem da história para a vida" I hope this information will be useful. = I Source 1 Source ISNI Source ISNI
Correction – Cluster Error • ISNI marks its two records as verified & sends to VIAF • These records are given the same status as XA records in VIAF clustering. • No two XA records may occur in the same cluster Source ISNI Source ISNI
End User Note • It seems 2 ISNIs has been assigned to the French singer Laïka Fatien (born 1968 in Paris): ISNI 0000 0000 8065 8419 and ISNI 0000 0000 7238 637X. I think the last one can be deleted.
Correction – Merged duplicate • Reply to End User • Thank you for using the ISNI database and providing us with information about the duplicate records for LaïkaFatien. • There is now just one record on the ISNI database for this identity – ISNI: 0000 0000 8065 8419. • If you have any further queries, please let me know. • Notification to VIAF via ISNI record • ISNI record contains verification note (i.e. treat as XA) • ISNI record contains 2 VIAF cluster identifiers = VIAF A VIAF B ISNI VIAF A VIAF B
ISNI Quality Team • Samples data regularly • c. 2% VIAF clusters have mixed identities • Duplicate clusters are higher, nearer 5% • Makes corrections at cluster level • Merges, splits, error notifications • Access to cataloguing client / macros • Makes system recommendations • Gives approval for single source assignment • Responds to End User input • Sends emails to sources for error correction (12 VIAF sources currently participating)
ISNI System Notification (Push process) Someone else has matched & details You probably need to take action
ISNI AssignmentAgency • Matching, merging and splitting infrastructure • Correction of errors • Sampling and anomaly checks, • e.g. date anomalies, unlikely mixture of sources • Pseudonym splitting • Re-importing and re-matching • Diagnostic indexes and reports • Enrichment • e.g. Wikipedia, Dewey • Notification system
VIAF ISNI InteroperabilityTask Force • Met in Paris 22-23 April 2014 • Representatives from • Bibliothèque nationale de France • Biblioteca Nacional de España • British Library • Deutsche Nationalbibliothek • Sudoc • OCLC (VIAF system) • OCLC Leiden (ISNI Assignment Agency)
Recommendations to VIAF at OCLC • Use profession and other disambiguating data • Investigate making an anomaly report • Investigate changing the clustering rules to flag and prevent a record with a mixed identity from entering the clusters where 2 or more sources have established separate identity • Investigate changing the clustering rules to prevent duplicate clusters. • Provide deprecated VIAF Ids in the distributed data • Treat records from ISNI that are flagged as manual as XA records • Include ISNI in RDF • Remove test from ISNI icon • Only show one name form for ISNI in the wheel • Investigate why SUDOC titles are not appearing
Recommendations to ISNI at OCLC • Flag manual merges and splits (joint specification to be made) • Indicate to VIAF that a VIAF source needs to be split from a VIAF cluster (joint specification to be made) • Keep up to date with VIAF • Produce anomaly reports • Produce notifications to VIAF sources • [Provide only one ISNI record per VIAF cluster ID; make split off records ISNI source] • [Provide records with ISNI source to VIAF]
Recommendations to VIAF Council • Mark undifferentiated authorities or consider not supplying them to VIAF • Include nationality, particularly for own national identities • Use VIAF in authority control and select VIAF cluster ID • Also use ISNI • If a mixed identity is found in VIAF or ISNI, use either the public interface or [preferably] the member interface of ISNI to request resolution by the ISNI Quality Team. All manual corrections made in ISNI will come to VIAF as records with XA status to ensure merges or splits.
VIAF Global Council - Lyon, France 15 August 2014 Become Involved Jointly let’s maintain clusters
The ISNI Quality Team • Board members are British Library and Bibliothèque nationale de France (Representing CENL) • Seeking Associate Members • KB, Netherlands in process • Control own identities • Access to client maintenance software • Access to restricted data • Provide back-up for end user responses
ISNI Members • View whole database (but not restricted fields) • Access to compare screen; can merge • Reports on request • ISNIs – simple report or enhanced • Cluster movement report • Diagnostic reports • Statistics and links
ISNI Database: Member view Public view Member view
Member view – list of additional data displayed (if not private) • Related identities • Related persons • Related organisations • Nationality • Gender • Keyword or key phrase • Dewey classification • Publisher • Dates active • Associated countries • Provisional records • Including links to possible matches, if applicable
Private data Dates Personal Affiliations Titles of works These can be masked from the public and from member view. However most sources allow titles to be seen by other members to facilitate merging.
Do not merge Anything that looks suspicious : Report it in a general note and the QT will review This is not the same person This title belongs to
ISNI Statistics Basic statistics Cross matches VIAF matches
La Trobe University: 1,864 VIAF Links Linked Data: isni.org/isni/
Janifer Gatenby EMEA Program Manager Metadata Janifer.gatenby@oclc.org