170 likes | 301 Views
European statistical microdata bases: What form of access for social science researchers?. Michel GLAUDE Director “Social and information society statistics” EUROSTAT. Eurostat Free dissemination policy.
E N D
European statistical microdata bases:What form of access for social science researchers? Michel GLAUDE Director “Social and information society statistics” EUROSTAT QMSS CONFERENCE PRAGUE
Eurostat Free dissemination policy • Since 1 October 2004, all European non confidential statistical data has been available free of charge on the Eurostat website… • …But Social science researchers need to have access to microdata QMSS CONFERENCE PRAGUE
Eurostat data collection policy:To develop transmission of harmonised microdata to Eurostat • To better answer users’ needs • To produce and disseminate more analytical studies • To alleviate and simplify the burden for NSIs • To increase the central access for researchers to these “Gold mines” QMSS CONFERENCE PRAGUE
Microdata bases for access • At present: • The Labour Force Survey (LFS) • The Community Statistics on Income and Living Conditions (EU-SILC) and the former ECHP • The Community Innovation Statistics (CIS) • The Continuing Vocational Training Survey (CVTS) • In a near future: • The Structure of Earnings Survey (SES) • The Adult Education survey (AES) QMSS CONFERENCE PRAGUE
The Labour Force Survey (LFS) • Main quarterly European data source for unemployment, employment, working conditions, education, occupation… • Yearly ad-hoc modules • All European Economic Area countries (except Malta) plus Bulgaria and Romania in June 2007 (2006 data) • 1500 000 individuals par Quarter for the whole EU • More than 100 variables plus 11 for each ad-hoc module • In June Y release of data for the reference years from 1983 to Y-1 for 8000 € and 1000 € for each following year • In March/April Y release of the ad-hoc module of year Y-2 • Regulations 2257/2003, 430/2005, 246/2003 QMSS CONFERENCE PRAGUE
The Community Statistics on Income and Living Conditions (EU-SILC)… • Main annual European data source for income, poverty, social exclusion and living conditions (housing, health, education, child care…) started in 2004 • Yearly ad-hoc modules: Transmission of poverty (2005), Housing conditions (2006), Social participation (2007)… • All 25 MS plus Norway and Iceland for SILC 2005 • 430 000 individuals and 200 000 households for the SILC 2005 • 72 variables at household level and 85 at individual level • Cross sectional data for year Y-2 (income reference year Y-3) released in March Y for 500 € and 250 € for each following year • Longitudinal data (4 years trajectories) released in September Y • Regulations 1177/2003, 1980/2003, 1983/2003… QMSS CONFERENCE PRAGUE
…and the former European Community Household Panel (ECHP 1994-2001) • Annual panel survey covering income, working life, housing situation, social relations, health and biographical information • Community Survey covering 14 Member States from 1994 to 2001 • 130 000 adults (16+) and 60 000 households interviewed every year • 8 000 € for the whole anonymised microdata set (8 waves) • Gentlemen’s agreement basis QMSS CONFERENCE PRAGUE
The Community Innovation Statistics (CIS) • Main data source for measuring innovation in Europe covering basic information of the enterprise, product and process innovation, innovation activity and expenditure, effects of innovation… • 27 MS plus Iceland, Norway and Turkey for CIS4 (launched in 2005, reference period 2004 with observation period 2002 to 2004) • 250 000 enterprises (10 employees +) from industry and services • 120 variables collected • Specific microdata anonymisation method • In Autumn 2007 release of CIS4 anonymised microdata for 20 countries • In mid 2006 release of CIS3 (reference period 2000 or 2001) anonymised microdata for 15 EEA countries for 200 € • Access through the SAFE Center in Eurostat to CIS3 Confidential microdata for 25 € per day • Regulation 1450/2004 QMSS CONFERENCE PRAGUE
The Continuing Vocational Training Survey (CVTS) • Main data source for describing the strategies of enterprises with respect to training of their staff (Participation rates, volumes, costs…) • 27 MS plus Norway for CVTS3 (launched in 2006, reference period 2005) • 200 000 enterprises (10 employees +) from industry and services • 160 variables collected • In spring 2008 release of CVTS3 anonymised microdata • Regulations 1552/2005 198/2006 QMSS CONFERENCE PRAGUE
The Structure of Earnings Survey (SES) • Main data source for detailed information on the level of remuneration, individual characteristics of employees and of their employer (every 4 years) • 27 MS plus Norway and Iceland for SES 2002 • 7 800 000 employees from enterprises (10 employees +) belonging to NACE C to K (C to O from 2006 onwards) • SES 2002: CD-Rom dissemination in September 2007 • SES 2002: Safe Center access in June 2007 (including SES1995) • SES 2006: Safe Center access and CD-Rom dissemination expected January 2009 • Regulations 1552/2005 198/2006 QMSS CONFERENCE PRAGUE
TheAdult Education survey (AES) • Main data source for the participation of adult to formal, non formal education and training as well as to informal training. • 27 MS plus Norway, Turkey and Switzerland for the first data collection (2005-2007) • 150 000 individuals plus countries extensions: Poland 25 000, Italy 50 000 • Maximum 170 variables collected. Non core modules are not taken by all Member States • Timeliness of access to researchers not yet specified. Microdata will arrive at Eurostat over the period 2006-2008 • Gentlemen’s agreement basis QMSS CONFERENCE PRAGUE
Procedure for access • Provision of Regulation 831/2002 • Anonymised microdata sets on CD-Rom • SAFE Center access at Eurostat premises • Practical arrangements: The “fast track” procedure • See Eurostat Website • Costs have been reduced QMSS CONFERENCE PRAGUE
Future Developments: New data bases to be collected by Eurostat • The Information and Communication Technology Survey (ICT household) • The European Health Interview Survey (EHIS) QMSS CONFERENCE PRAGUE
Future Developments: The European System of Social statistical Survey Modules • A flexible tool to quickly answer to new or existing policy demands • Aiming at rationalising the household surveys data collection system • A mixed structure composed of : - annual regular modules (Core variables, ICT, Health indicators) - irregular modules (victimisation, consumer protection, fertility, youth…) • 200 000 individuals for the EU-27 according to country sizes • Results within 2 years after the demand is expressed • Microdata at European level • First implementation in 2010? QMSS CONFERENCE PRAGUE
Future Developments: Methodological improvements • A common set of “Core variables” • The new European Socio-economic Classification (ESeC) QMSS CONFERENCE PRAGUE
Future Developments: Improving access for researchers • Revision of the “Statistical law” Regulation 322/97 Creation of “Public use Files” Harmonising methods for anonymisation • Developing “Remote access” • Extending SAFE centers in NSIs? • Giving better access for Data Archive centers? QMSS CONFERENCE PRAGUE
How to develop collaboration… • Between European social science research and Eurostat and NSIs ? QMSS CONFERENCE PRAGUE