170 likes | 356 Views
OECD ’ s approach to Manage, Publish, and Cite data. Major objectives for data management. Ability to be cited – thereby generating incoming links via CrossRef Integration with other publication types to create an integrated information service
E N D
Major objectives for data management • Ability to be cited – thereby generating incoming links via CrossRef • Integration with other publication types to create an integrated information service • Pushing metadata to information/knowledge management channels (e.g. MARC) • Pushing metadata to discovery channels (e.g. RePEc, econlit et al)
OECD Publishing’s approach • A two-steps approach: • from standards implementation • to online publishing and citing • Consolidating the foundations: • Continuing process of linking data with publications • Next challenges to increase discovery and use …
A two-steps approach From Standards implementation • In 2008-2009: Aggregation of datasets & data tables in a central bibliographic database including OECD books and papers • development of standards for bibliographic management and citing of datasets and data tables Green, T (2009), “We Need Publishing Standards for Datasets and Data Tables”, OECD Publishing White Paper, OECD Publishing. doi: 10.1787/603233448430, http://dx.doi.org/10.1787/603233448430
Two concepts are required for datasets’ management dataset(being part of collection/stand-alone serial collection(of datasets/ of collection of datasets) Data Concepts DS DS DS DOI ISSN DOI Stand-alone Dataset – subject to subscription DS Collection of datasets DS Collection of collection of datasets DOI DS DS DB DB DOI DOI datasets DPP DPP Collection of datasets DS DS DB DB DOI ISSN DOI DOI DS DS DOI DS DS DOI
Agreed definitions • Collection • of more than one datasets • of collection of datasets • Has an ISSN • Has a DOI • Is subject to subscription • Collection of datasets • belongs to a Top Collection • Has a DOI • Does not have ISSN • Dataset: • a content type (group of related data such as a OECD.stat cube) published: • as part of a collection • stand-alone (in this case it can be subject to subscription and has an ISSN) • Has a DOI
Agreed DOI syntax • Collection (of collection of datasets) • DOI suffix =<CollectionAcronym>-data-<LanguageISO2Code> • e.g. agr-data-fr • Dataset (including stand-alone dataset managed as serial) • DOI suffix = data-<DatasetOrderNumber on 5 digits>-<LanguageISO2Code> • e.g. data-00023-en
What do we cite ? Only Dataset are cited
A two-steps approach … To online publishing and citing • In 2010 launch of OECD iLibrary portal aggregating books, papers and statistics: thanks to metadata standards datasets and data tables are published online together with e books, journals and working papers • Each dataset language version is assigned a unique and persistent DOI referenced in CrossRefdatabase, allowing Cross-referencing between datasets and published articles, books, chapters, etc.
A continuing Process of Linking data with publications Consolidating the foundations • Cross-referencing but also…. • Internal linking within OECD publications catalogue
Overview of links management in KAPPA between books, papers and statistical content External IMF Data Mapper External Resource Related periodical StatisticalPeriodical or Annual N1 External link: Related Website Main Eco. Indicators Statistical Collection Related database 1N Serial 1N Related periodical Related database N1 Has Source/Method StatisticalPeriodical or Annual Serial Statistical Collection Related periodical Is Source/Method of 1N N1 Related database Has Source/Method Datasource Is Source/Method of Datasource Business Tendency Surveys: A Handbook Quarterly unit labour cost Key table Publication Publica-tion Datasource Is Source/Method of Has Source/Method Datasource • Legend: • One waylink • Bidirectionallink: must beentered in a given direction in KAPPA (the full arrowrepresents the linkthatwillbeentered, and the dottedarrowrepresents the reciprocallinkwhichwillautomaticallybecreated) Datasource Datasource Publicat° compo- nent Dataset Book Table/Graph Chapter/article Datasource Datasource
Display of cross-references on OECD iLibrary dataset homepage
Provision of MARC records for datasets Next challenges to increase discovery & use • The MARC records are provided in MARCXML for • dataset (within collection, or stand-alone) • statistical collection • key table • key table collection • MARC records are generated in English only, and describe the online version of a publication/serial.
Next challenges to increase discovery & use • Expand the definition of dynamic datasets: • updating « datasets » which are continously updated in a dynamic way • Regular datasets’ editions: datasets which are not updating resources but are published in separate editions rather than as an integrating resource which is continuously updated and adapt citing/bibliographic standards • Manage online archived datasets • Disseminate datasets records on RePEcthe world’s largest collection of papers in economics