260 likes | 370 Views
Update on EUROSTAT activities. A second hand experience. LUCAS Census 2010 SDMX. First phase sample for stratification: orthophoto interpretation. LAND COVER classes 1 ARABLE LAND 2 PERMANENT CROPS 3 GRASSLAND 4 WOODED AREAS AND SHRUBLAND 5 BARE LAND, RARE VEGET. 6 ARTIFICIAL LAND
E N D
Update on EUROSTAT activities A second hand experience Ekkehard Petri GISCO Eurostat
LUCAS • Census 2010 • SDMX EFGS Meeting 2010 Den Haag
First phase sample for stratification: orthophoto interpretation LAND COVERclasses 1 ARABLE LAND 2 PERMANENT CROPS 3 GRASSLAND 4 WOODED AREAS AND SHRUBLAND 5 BARE LAND, RARE VEGET. 6 ARTIFICIAL LAND 7 WATER 2km grid 1 100 000 points Second phase sample: in-situ data collection • Parameters • Land cover • Land use • pictures • etc. Sample of around 260,000 pts Ground survey LUCAS data collection process EFGS Meeting 2010 Den Haag
Sampling strategy: Second phase sampling design • Definition of sample size by strata • Optimal size by NUTS2 and strata based on fixed precisions for a set of LC classes targeted by country • Points selection • LUCAS 2006 sample points included as much as possible (land cover/use changes can be detected) • Maximisation of the distance between points • Exclusion of remote points and points above 1000m EFGS Meeting 2010 Den Haag
Land Cover nomenclature LUCAS 2009 EFGS Meeting 2010 Den Haag
Land Use nomenclature LUCAS 2009 EFGS Meeting 2010 Den Haag
Data availability • Types of data • Tabular microdata on the first and second phase sample (land cover/use on the specific point, LC/LU change in the specific point etc.) • Pictures (four cardinal directions) • Aggregated estimates (NUTS1/NUTS2 depending on LC classes) • Years • 2006 • 2008/2009 (from march 2010 on) • Terms of use • An agreement has to be signed between DG-ESTAT and users about: • Confidentiality: only aggregated data can be disseminated EFGS Meeting 2010 Den Haag
Data availability per country/year EFGS Meeting 2010 Den Haag
Census EFGS Meeting 2010 Den Haag
EU census goals • Comparability of census data on the EU level • Same reference year (first time: 2011) • Same ‘topics’ (variables) • Use of harmonized definitions and technical specifications • Use of identical breakdowns of the topics • Unified dissemination programme (hypercubes) =>Common Baseline across countries • Transparent quality of census results • Quality reports • Detailed tables on quality of the data • Metadata EFGS Meeting 2010 Den Haag
EU census limits What does the regulation not provide? • No access to microdata • No possibility to define geographical areas flexibly • No harmonised confidentiality control • No normative minimum quality requirements (quality thresholds) • No consolidation of census results form different Member States BUT • Member States are free to do more! EFGS Meeting 2010 Den Haag
What data for what geographical area? NUTS2: • Year of arrival in the country • Educational attainment • Location of place of work • Current activity status • Occupation • Industry • Status in employment • Tenure status of households • Housing arrangements • Type of ownership (of dwellings) • Water supply system, Toilet facilities, Bathing facilities, Type of heating EFGS Meeting 2010 Den Haag
What data for what geographical area ? LAU 2 Housing topics Occupancy status of conventional dwellings Number of occupants Useful floor space and/or Number of rooms Density standard Dwellings by type of building Dwellings by period of construction Type of living quarters Location of living quarters Population topics Sex Age Legal marital status Country/place of birth Country of citizenship Place of usual residence one year prior to the census (Size of the) Locality Household status Type of private household Size of private household Family status Type of family nucleus Size of family nucleus Total population Place of usual residence Relationships between household members EFGS Meeting 2010 Den Haag
What we can NOT do for GISCO ? • The municipalities as smallest geographical area for the census data to be transmitted to Eurostat (LAU 2 level) are fixed. No flexibility to define areas freely. • After long and detailed consultation with the Census experts from the Member States, the foreseen obligatory statistical programme represents a balance between the desirable and the feasible. • Eurostat does not have access to census microdata. • Confidentiality control is done by the NSI. EFGS Meeting 2010 Den Haag
What we can do for GISCO ? • Usage of common definitions, technical specifications and breakdowns makes census data better comparable at the European level. • Intensive description and quality reporting of the NSI on the data sources and methodology they use to do the population and housing census. This might help to develop small area reporting systems. • Key topics will be required for the LAU 2 level. It is likely that some of the data might also be available for even smaller areas in some Member States. • Eurostat organizes a task force on Census Data Disclosure Control which aims at proposing best methodology and practice to protect census data with minimum damage to disseminated results. • The Census Hub might be used to exchange and disseminate small area data from censuses. EFGS Meeting 2010 Den Haag
Census Hub project: architecture WS WS WS database database database EFGS Meeting 2010 Den Haag
The Census Hub project • The Census Hub project aims to build a new IT infrastructure to achieve the data exchange between the National Statistical Institutes (NSI), Eurostat and the users of census data using SDMX standards. • Data sharing architecture • Based on the agreed hyper-cubes with harmonised data • Confidentiality problems handled at national level • A data user browses the hub to define a dataset of interest via structural metadata (dimensions, attributes, measures, code lists, etc). Data are retrieved directly from the interested Member States’ systems EFGS Meeting 2010 Den Haag
Present and ongoing activities • Pilot project in Germany, Ireland, Italy and Portugal • Guideline explaining how to implement an SDMX MSs architecture in the Census Hub context available EFGS Meeting 2010 Den Haag
SDMX EFGS Meeting 2010 Den Haag
What is SDMX • “Statistical Data and Metadata Exchange” • SDMX preferred standard for exchange and sharing of data and metadata in the global statistical community • Sponsors include • European Central Bank (ECB) • Eurostat • Organisation for Economic Co-operation and Development (OECD) • United Nations Statistical Division (UNSD) EFGS Meeting 2010 Den Haag
Benefits from SDMX standards • Covers potentially all statistical domains • Open to all stakeholders • Are neutral in terms of underlying commercial technologies • Demography and the Census hub already implemented EFGS Meeting 2010 Den Haag
SDMX components Information model for data and metadata Syntax for automatic exchange of data and metadata Guidelines to Harmonise Contents IT Architectures for data exchange IT tools to support implementation and to disseminate SDMX data SDMX is not just a data transmission format… Similarities with INSPIRE are substantial EFGS Meeting 2010 Den Haag
SDMX Components: Information Model • Statistical data • Metadata • Structural • Conceptual • Quality • Methodology • Data exchange process EFGS Meeting 2010 Den Haag
SDMX Information Model Provides a way of modelling statistical data, metadata and data exchange processes. Dimensions (ex: country, variable/topic, year) Dataset Structure Definition DSD Code lists Structural Metadata Attributes (ex: unit of measure) Describe Metadata about an individual value, a time series or a group of time series Data EFGS Meeting 2010 Den Haag
SDMX Components: IT Tools • SDMX Registry • Tools to create data definitions and metadata • Tools to convert and validate data and metadata • Tools to visualise data and metadata • Training available from Eurostathttp://epp.eurostat.ec.europa.eu/portal/page?_pageid=2733,61942355,2733_61942368&_dad=portal&_schema=PORTAL EFGS Meeting 2010 Den Haag
SDMX Registry Repository Graphical User Interface (GUI) for user interaction over the Web Structural metadata Provision of information DSW – “standalone” Java GUI CodeLists Dataflows ConceptSchemes Provision agreements DSDs Accessible via a Web Service accepting SDMX-ML messages EFGS Meeting 2010 Den Haag