340 likes | 571 Views
Using IPUMS.org. Katie Genadek Minnesota Population Center University of Minnesota kgenadek@umn.edu. The IPUMS projects are funded by the National Science Foundation and the National Institutes of Health. 00:00. Overview. What is IPUMS? Microdata and Summary Data IPUMS-USA IPUMS-CPS
E N D
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota kgenadek@umn.edu The IPUMS projects are funded by the National Science Foundation and the National Institutes of Health 00:00
Overview • What is IPUMS? • Microdata and Summary Data • IPUMS-USA • IPUMS-CPS • Online Analysis System • Online Demonstration • Questions 00:44
What is IPUMS? Integrated - consistent codes, labels, and documentation Public Use - anonymized, downloadable Microdata - individual-level Series - pooled data over time and place 1:26
But, What is IPUMS Data? Individual level: Demographic Data Census Data Survey Data Health Data Historical Data Migration Data Time Use Data Summary level: Demographic Data Census Data Historical Data Mapping Data 2:09
MPC Data Projects http://www.ipums.org/ 2:41
Microdata: MICRODATA AND SUMMARY DATA 4:40
Microdata versus Summary Data Microdata Summary Data • Shows full range of responses for individuals • Enable custom tables and sophisticated analyses • Suppression: geography, truncation, and item level suppression • Premade or published tables of aggregate characteristics • Enable examination of small geographic areas • Suppression: limited content, grouped intervals, and cell suppression 4:40
Summary Data 5:44
Age Birthplace Mother’s birthplace Sex Relationship Race Occupation IPUMS Data Structure Household record (shaded) followed by a person record for each member of the household For each type of record, columns correspond to specific variables 5:54
Microdata Data: IPUMS-USA 6:50
IPUMS-USA • Database includes public use microdata samples: • U.S. decennial censuses (1850-2000) • Complete-count dataset for 1880 • Linked Samples 1850 – 1930 • Samples from Puerto Rico (1910-2008) • American Community Survey (2000-2009) • The first MPC data project • Most widely used database ~ 30,000 users 6:53
Census Samples 8:31
The American Community Survey • Replaced the long form of the Decennial Census • Demonstration stage: 2000 to 2004 • Full implementation 2005, group quarters added 2006 • Rolling sample design Microdata samples: • Full survey responses for 1% of US population • Yearly samples, multi-year samples 9:18
ACS Samples 10:03
Census and ACS Variable Topics • Basic demographic • Marriage • Family structure • Fertility • Ethnicity • Disability • Education • Work • Income • Migration • Housing Characteristics 10:13
Geography Limitations • No confidentiality restrictions for samples prior to 1940 – no geographic limitation • Samples from 1940-1970 • Limited and inconsistent geographic identifiers • Recent samples: • State • Some Metropolitan Areas • County Groups • Public Use Microdata Areas (PUMAs) 10:43
What are PUMAs? • Public Use Microdata Areas (PUMAs) • Comprised of approximately 100,000 persons • Boundaries do not always align with jurisdictional boundaries • Detailed contents and maps available • GIS shape files for PUMAs available 11:26
Microdata Data: IPUMS-CPS 11:53
Current Population Survey (CPS) • Administered starting 1940 • Monthly survey administered by the Bureau of Labor Statistics • Household survey was designed to measure unemployment • Source of the official Government statistics on employment and unemployment • In 2009 - 57,000 households interviewed monthly 11:55
Current Population SurveyMarch Supplement • All March respondents • Additional respondents from February, March and November monthly samples • Data are collected for Armed Forces members residing with their families • March Annual Social and Economic Supplement is the most widely used by social scientists and policymakers 12:20
Current Population SurveyMarch Supplement • Labor force participation and unemployment • Work experience and educational attainment • Sources of income including non-cash benefits • Program participation • Tax filing status • Health Insurance • Migration 12:51
IPUMS - CPS • All March Data (Back to 1962) • Basic Monthly Surveys • Samples from 2000-2008 (back to 1976 soon) • Data for every month • ~50,000 households surveyed each month • Less variables than March supplement • Demographic information • Family characteristics • Employment status • Education information 13:16
Obtaining Data: ONLINE ANALYSIS SYSTEM 14:26
Online Analysis System • High-speed tabulation software developed at UC-Berkeley • Allows for analysis of microdata without statistical package • All analysis performed online • Can analyze multiple years of data • Help guides on webpage 14:26
Features • Data analysis capabilities • Frequencies and cross tabulations (including charts) • Comparisons of means (with complex std errors) • Correlation matrix • Comparisons of correlations • Regression (ordinary least squares) • Logit and probit regression • List values of individual cases 15:02
Where is this online tabulator? • Follow the link ‘Analyze Data Online’ from the homepage of: • usa.ipums.org/usa/ • cps.ipums.org/cps/ • Select all samples of year of interest in USA • Open IPUMS-USA or CPS in additional tab for documentation 15:41
Obtaining Data: USE THIS DATA 16:00
Microdata for Analysis • Documentation is Important!!! • Use the IPUMS documentation • Be aware of top/bottom codes, NIU codes, and missing data codes • Know the universe – who got asked the question • Weights – makes estimates representative • See additional weights presentation • Sample size is important • Check analysis without weights 16:01
Microdata for Analysis • Allows more complex analysis then summary data • Geographic Restrictions • State Level Analysis • Metro Area level Analysis • Time series – change over time • Not downloading tons of tables 18:43
IPUMS is Awesome • Comprehensive online documentation • Integration makes analyzing change over time possible • Data analysis system allows you access the data and analyze it online • All of the data are available for free online • User support is available by e-mail to help you as needed 19:31
Social Explorer - Shout Out • Produces online maps and data reports • Based on boundary files made available through NHGIS • Map changes in census data over time • http://www.socialexplorer.com/ 20:23
Obtaining Data: DISCUSSION OF “WEIGHTING” ANDONLINE DEMO OF IPUMS 20:54
Questions – email usIPUMS User Support ipums@umn.eduContact:Katie Genadekkgenadek@umn.edu 32:00