300 likes | 487 Views
Availability, infrastructure, Use and Reuse of Administrative Data in Statistics Production -- a Scandinavian Example. Folke Carlsson and Anders Holmberg R&D Department Statistics Sweden E-mail: firstname.lastname@scb.se. Outline. Why administrative data sources? Historical background
E N D
Availability, infrastructure, Use and Reuse of Administrative Data in Statistics Production -- a Scandinavian Example Folke Carlsson and Anders Holmberg R&D Department Statistics Sweden E-mail: firstname.lastname@scb.se
Outline • Why administrative data sources? • Historical background • Prerequisites • Cooperation • Legislation • Methodology and quality • Future Development ideas
The Strengths of Administrative registers • No extra response burden • Reduced costs (dramatically) • Good coverage of units • Data for every year • In some cases good quality • Opportunities for regional statistics and longitudinal studies
History • 1749 first Swedish census • Unified PIN 1947/BIN 1975 • Centralisation of statistics 60:ies • Main frame late 60 • Development during a long period see ” Register-based statistics in the Nordic countries – Review of best practices with focus on population and social statisticsECE/CES/STAT/2007/2 ”
PINS and BINS – the base for the Register System Introduced 1947
PINS and BINS – the base for the Register System • BIN Introduced 1975 BIN=registrationsnumber+check 2021000837 • PINS and BINS are widely used and accepted
Cooperation with data suppliers Swedish Tax administration Data delivered continuously on daily, monthly, quarterly and yearly bases. Political and administrative decisions can affect statistics directly Ex. Wealth tax was abolished in 2007 Wealth statistics need to be collected in a different way Important
Statistics Legislation • 1974 first Personal Data Act • Protection of personal integrity • Notification of personal data processing to Data Inspection Board • Right to register extracts free of charge, on request • Regulations on damages, penalties and fines in the case of infringements of the general provisions of the Act • Data Inspection Board • During 80-ties (Census 1985) • Big brother debate • ”I don't want my name in a computer” • Today - Why asking when ”you” already have that information
Statistics Legislation • The Official Statistics Act • Rules applying to the official statistics regarding: • Purpose • Availability • Objectivity • Protection of information • Obligation to provide information
The Secrecy Act • The personal and economic circumstances of respondents are protected. Reasons for confidentiality: • Protection of personal integrity • Public confidence in Statistics Sweden • Data requirements for the statistics Public information can be protected by The Secrecy Act when they come to Statistics Sweden
Examination of confidentiality • Data may be released after examination in accordance with Chapter 9, Section 4 of the Secrecy Act in the following cases: • research purposes • statistical purposes • anonymised data (unidentifiable)
Standardised population Standardised population f. ex pop 31 dec Administrative Source Statistical register Administrative Source Statistical register Administrative Source Statistical register
Dependence between surveys and registers Ex. Labour Market statistics based on both Surveys and registers • LFS use income verifications as auxiliary information • Register based labour market statistics use LFS to calibrate rules to decide employment/unemployment
The system of statistical registers consists of: • Basic registers - important object sets • Other statistical registers - important variables • Linkages between objects in different basic registers. Linkages between basic registers and statistical registers • Standardised variables (fundamental variables) • Register-statistical methods, quality assurance • Metadata • IT-tools for register processing • Rules for protecting integrity
Statistics Sweden’s system of registers Causes of Death Register Supplementary Benefits Register Vehicles – privately owned Register of Criminal Convictions Education RegisterEmployment Register Population and Housing Census Personal Income and Assets Persons Nominated and Elected Longitudinal Income Register (Linda) Longitudinal Register (Louise) Multi-Generation Register Fertility Register Swedish for Immigrants Adult Education Upper Secondary School 9th Form Register Teacher Register Higher Education Register Persons Enrolled in Education Education-Labour Market Register Income Statements Private Sector Employees, wages County Council Employees, wages Parish Employees, wages Local Authority Employees, wages Civil Service, wages Occupational Register Register of Jobs and Activities Population Register Business Income Demography of Enterprises Payrolls, number of employees Standardised Accounts Monthly Tax Returns Manufacturing VAT Register Foreign Trade Register Vehicles – company owned Agricultural Enterprises Register Register of Schools Geographical Database Real Estate Price Register Renovation Register New Construction Register One or Two Dwelling Buildings Multi-Dwelling Buildings Industrial Real Estate Agricultural Real Estate Valued Estate Units Real Estate Register Business Register
All other registers are linked to the basic Basic Registers:Population-, Activity-, Real Estate- and Business RegisterThe same Pin, Bin (Lu id), R E id and D id are used in all registers PIN BIN PIN PIN BIN BIN
Towards a system Metadata Metaplus identified today 62 700 variables but only 13 500 unique variables A lot of variables are used over again
MetaPlus - Contents • Standard variables • Variables • Object classes • Classifications and value domains • Registers created and used • Population
Statistics Sweden’s system of registers Causes of Death Register Supplementary Benefits Register Vehicles – privately owned Register of Criminal Convictions Education RegisterEmployment Register Population and Housing Census Personal Income and Assets Persons Nominated and Elected Longitudinal Income Register (Linda) Longitudinal Register (Louise) Multi-Generation Register Fertility Register Swedish for Immigrants Adult Education Upper Secondary School 9th Form Register Teacher Register Higher Education Register Persons Enrolled in Education Education-Labour Market Register Income Statements Private Sector Employees, wages County Council Employees, wages Parish Employees, wages Local Authority Employees, wages Civil Service, wages Occupational Register Register of Jobs and Activities Population Register Business Income Demography of Enterprises Payrolls, number of employees Standardised Accounts Monthly Tax Returns Manufacturing VAT Register Foreign Trade Register Vehicles – company owned Agricultural Enterprises Register Register of Schools Geographical Database Real Estate Price Register Renovation Register New Construction Register One or Two Dwelling Buildings Multi-Dwelling Buildings Industrial Real Estate Agricultural Real Estate Valued Estate Units Real Estate Register Business Register
BR-Prod system FDB RSV PR Yearversion + variables SKD BR yearversion + variables SRU PR-Prod system KU FTR FR yearversion+Variables RE-Prod system 3 Spheres of Interest
SIL SIL Text SQL SIL From stovepipes to IDW M e t a d a t a Baseregister SSD SUL Structured Input Datawarehouse
Social statistics Today Statistics Sweden make 100 000 interviews in several large surveys The same or similar question appear in several surveys Is it possible to coordinate these surveys and register so we are able to get more and better information? What does standardised populations offer?
Economic statistics Coordinate National accounts – Business register Business register Survey Administrative sources National Accounts
t Quality documentation with focus on Variables instead of Registers SCBDOK BAS Analysis product MDS Wizard Indicators BAM RE GI ST ER
Use administrative data – It definitely has a lot of advantages Patience – It takes time before it is time to harvest and there are risks, learn from other’s mistakes, benchmark. Don’t use administrative data as they are – combine sources, create new objects and new variables Build ONE system – not one for social statistics and one for economic statistics The system should be flexible – a register is not for one use. A new field to investigate – select an appropriate object set and appropriate variables from the system and create new variables Good keys are the key to a good system Metadata – Good metadata is an insurance against huge errors – definitions changes etc. You must know what you compare! It is also the key to efficiency in a statistical production system. Never forget protection of integrity Experiences
Thank You! Folke Carlsson and Anders Holmberg firstname.surname@scb.se