180 likes | 302 Views
Invaluable source of micro-data for socio-economic statistics Johan van Rooijen. The Social Statistics Database:. Contents of this pesentation. What does the Social Statistics Database (SSD) contain? Linkage keys Standardization Data and metadata Process Privacy-protection
E N D
Invaluable source of micro-data for socio-economic statistics Johan van Rooijen The Social Statistics Database:
Contents of this pesentation • What does the Social Statistics Database (SSD) contain? • Linkage keys • Standardization • Data and metadata • Process • Privacy-protection • Output-examples
SSD: contents The SSD is a central database containing microdata on: • Persons • Relations between persons • Households • Jobs • Self-eployment • Social security benefits • State and employer pensions • Income • Education • Hospitalizations • Causes of death • Criminal offense reports • Houses • Vehicles and more……
Many administrative sources, such as: • Population Register • Register Income Tax Declarations • Administration of Employee Insurance Schemes • Police Register • Valuation of real estate registration system Survey: • Labour Force Survey Future developments: more administrative sources, more surveys, big data?
SSD: linkage keys All mirco datafiles stored in the SSD contain a LINKAGE KEY: Interlinking microdata is what the SSD is about! Example: linking microdata on graduates with data on persons and data on employment in order to describe the transisiton from education to labour market participation
SSD: contents and linkage keys PIN: person identification number HIN: household identification number AIN: adress identification number EIN: enterprise identification number
SSD: standardization standardization is an important aspect of the SSD • organization of the IT-infrastructure • file formats • linkage keys • names • metadata
In conclusion: Microdata-files are linked to other microdata-files by means of a set of standardized linkage keys Microdata-files are linked to their metadata through the name (file-name / variable-names)
SSD: privacy-protection Legislation: • Statistics Netherlands Act • Netherlands Data Protection Act These laws: • authorize Statistics Netherlands to use personal data • oblige Statistics Netherlands to take adequate measures aimed at privacy protection
Privacy protection measures: • Linkage keys are anonymous, original personal identifiers are removed from the data • Access rights to the SSD are restricted • Limited e-mail facilities • Staff members have to take an oath • Check on disclosure risk of output
The effect of discharge and discharge reason on relationship dissolution
Research by external researchers (through Centre for Policy-related Statistics) • The Netherlands Cancer Institute: linking SSD-data to cancer registration • Municipalities: evaluation of their social assistance policy by enriching own data with data from the SSD