180 likes | 506 Views
ISO/TC 211 Workshop on Standards in Action Stockholm, Sweden, 8 June 2005. ISO 19115 as the metadata standard for Statistics South Africa. Joseph Lukhwareni, Sibongile Madonsela, Antony Cooper 1 , Marius Cronje, Dineo Mokhuwa, Lucas Podile, Nishan Pillay, Thanyani Maremba
E N D
ISO/TC 211 Workshop on Standards in Action Stockholm, Sweden, 8 June 2005 ISO 19115 as the metadata standard for Statistics South Africa Joseph Lukhwareni, Sibongile Madonsela, Antony Cooper1, Marius Cronje, Dineo Mokhuwa, Lucas Podile, Nishan Pillay, Thanyani Maremba and Mandla MasemulaData Management and Information Delivery Project (DMID), Statistics South Africa 1 CSIR, South Africa. Presenting author.
Overview • Background • Standard Investigation • Findings • Implementation of ISO 19115 • Development of capturing tool • Principles and Benefits
Background • Statistics South Africa (Stats SA) • National Department • Official statistics agency for South Africa • Vision is to be the preferred supplier of quality statistics • Data Management and Information Delivery project (DMID) • Building a data warehouse for Stats SA
DMID Data Warehouse Policies and Standards Data Repository Central Metadata Repository CaRS Metadata Repository
Current metadata situation • Originating components structure and store metadata according to different standards and procedures. This results in: • Limited analysis and comparability of data • Inconsistent access to and use of data • Lack of consistent standard • Weakness in version control • Lack of or inadequate metadata • Rules on archiving are inconsistent or non-existent
Standards investigated • Metadata registries (ISO/IEC 11179) • Geographic information (ISO 19115) • Dublin Core Metadata Initiative (DCMI) • Data Documentation Initiative (DDI)
ISO/IEC 11179 • Information technology – Metadata registries (MDR) • Describes what a metadata registry should contain • For concepts and definition formulation • Does not describe metadata per se • For the developers of metadata standards • Not for those who record and use metadata • Currently used by other stats agencies • Australian Bureau of Statistics & Statistics Canada
ISO 19115 • Geographic information – Metadata • Provide rules for extensions and profiles • Guidance on extending metadata, implementing and managing metadata • Hierarchical levels of metadata • Free text elements may include multiple instances in different languages • Comprehensive dataset metadata profile • Code lists used extensively to remove bias • Used by geographic and non-geographic organisations
Dublin Core • ISO 15836:2003 • Information and documentation – The Dublin Core metadata element set • Focuses on data discovery • Initially developed for document-like objects (librarian) • Many element refinements (qualifiers) • Largely free text • 15 core metadata elements • Title, Creator, Subject, Description, Publisher, Contributor, Date, Type, Format, Identifier, Source, Language, Relation, Coverage, Rights
DDI • Data Documentation Initiative (DDI) • Standard for technical documentation describing social and behavioural data • Over 300 tags • Largely free text • Content, presentation, transport and preservation ofdocumentation for datasets • DDI specification is written in XML • Document Type Definition (DTD) and XML Schema (XSD) v2.0 published 2003-07-15
Implementation of ISO 19115 • Decided to profile SANS 1878 • South African spatial metadata standard • Itself a profile of ISO 19115 • Piloted Profile in Stats SA • Geography (Census 2001 Enumeration Area) • Economic Statistics (Survey of Employment and Earning) • Social Statistics (Labour Force Survey)
Implementation of ISO 19115 • Pilot indicated the need to extendthe Profile for statistical elements • Used examples from other international Stats Agencies to add the extended elements • Elements were further tested at an internal workshop
Development of capturing tool • Investigate the available open source and off-the-shelf solutions • e.g. M3Cat, NESSTAR, Metamaker, Metalite, ArcCatalog • Developed evaluation criterion • Recommended in-house development of tool • Interface modelled after Metalite • ISO 19115-compliant metadata tool will integrate with other systems in Stats SA • e.g. CaRS,ArcCatalog,NESSTAR,etc
Joseph Lukhwareni Sibongile Madonsela Marius Cronje Dineo Mokhuwa Lucas Podile Nishan Pillay Thanyani Maremba Mandla MasemulaDMID, Stats SA Thank you! Contact details: Antony Cooper Email: antonyc@statssa.gov.za Phone: +27 12 310 8548