740 likes | 906 Views
Design and Use of the IPUMS-International Data Series http://international.ipums.org. Matt Sobek Minnesota Population Center sobek@pop.umn.edu. IPUMS-International. Overview. Processing. Dissemination system. Strengths and limitations. Users. END https://international.ipums.org.
E N D
Design and Use of the IPUMS-International Data Series http://international.ipums.org Matt SobekMinnesota Population Centersobek@pop.umn.edu
IPUMS-International Overview Processing Dissemination system Strengths and limitations Users
END https://international.ipums.org Matt SobekMinnesota Population Centersobek@pop.umn.edu
What is IPUMS-International? Census data – 1960 to present Samples – 1 to 10%, nationally representative Microdata – individual-level Integrated – consistent codes across time and place Downloadable – anonymized Extract system – select variables – pooled data
Map of IPUMS Partners Dark green = disseminating data Light green = partners, not yet disseminating 83 countries
Current Countries in IPUMS Africa Americas Asia Europe Austria Belarus France Greece Hungary Italy Netherlands Portugal Romania Slovenia Spain United Kingdom Egypt Ghana Guinea Kenya Rwanda South Africa Uganda Armenia Cambodia China India Iraq Israel Jordan Kyrgyz Rep. Malaysia Mongolia Palestine Philippines Vietnam Argentina Bolivia Brazil Canada Chile Colombia Costa Rica Ecuador Mexico Panama United States Venezuela 44 countries 130 samples 279 million persons
Countries in IPUMS Archive Bangladesh Botswana Cuba Czech Republic Dominican Rep. El Salvador Ethiopia Fiji Germany Guatemala Haiti Honduras Indonesia Liberia Madagascar Malawi Mali Mauritius Nepal Nicaragua Pakistan Paraguay Peru Puerto Rico Senegal Saint Lucia Sierra Leone Sudan Switzerland Tanzania Thailand Turkmenistan Uruguay Zambia
Relation to head Marital status Literacy Occupation IPUMS Microdata
Availability of Selected Person Variables (Number of samples)
Availability of Selected Household Variables (Number of samples) 536 Integrated variables 10,600 Unharmonized variables
User Access Application • Scholarly and educational purposes • Key: it must not be redistributed Once approved, access to all data Free
Making the IPUMS Pre-processing Integration Dissemination
Making the IPUMS Pre-processing • Language translation • Reformatting • Error correction • Sampling • Confidentiality Integration
Making the IPUMS Pre-processing • Language translation • Reformatting • Error correction • Sampling • Confidentiality Integration • Metadata • Data harmonization • Constructed variables
Census Questionnaire (Mexico 2000) Water Access
XML-Tagged Census Questionnaire (Mexico 2000) Water access
Data Integration – Marital Status China 1982 Colombia 1973 Kenya 1989 Mexico 1970 U.S.A. 1990
Family Interrelationship Variables (Simple household) Spouse’s 2 1 0 0 0 0 Mother’s Father’s 0 0 0 0 0 0 2 1 2 1 2 1
1 1 1 1 IPUMS “Pointer” Variables (Complex household) Spouse’s Mother’s Father’s 0 0 0 0 0 0 0 0 0 6 0 5 0 0 0 5 6 0 5 6 0 0 0 0 9 0 0 9 0
Family Interrelationship Pointers 13 censuses include data on location of parent or spouse Under age 18
Variable Description (Marital status)
Comparability Discussion (Marital status)
Enumeration Text (Marital status)
Enumeration Text (Marital status, Cambodia)
Variable Codes (Marital status)
Variable Codes (Marital status)
Variable Codes (Marital status)
Extract Step 4 – Attach Characteristics Age of spouse Employment status of father Occupation of father