100 likes | 188 Views
Approach for Obtaining State Population Dataset. Kim-Keen Wee. Wyoming Map. Outline. Step 1: Obtain Boundary File of the State Step 2: Preprocess the Boundary File Step 3: Get Report of Census 2000 on Selected State (by County) Step 4: Generate Random Population (by County)
E N D
Approach for Obtaining State Population Dataset Kim-Keen Wee
Outline • Step 1: Obtain Boundary File of the State • Step 2: Preprocess the Boundary File • Step 3: Get Report of Census 2000 on Selected State (by County) • Step 4: Generate Random Population (by County) • Step 5: Create Class Label Based on Census Data • Step 6: Combine all Counties
Step 1: Obtain Boundary File of the State • First, get the boundary file (Lon/Lat coordinate) of the state from the U.S. Census Bureau website http://www.census.gov/geo/www/cob/co2000.html • Pick the state of interest and download the zip folder from the ASCII format section (i.e. co56_d00_ascii_WY.zip) • Of the two files in the folder, we will use the one that contains the coordinate (i.e. co56_d00.dat)
Step 2: Preprocess the Boundary File • Separate the file into individual counties (to be used later by S-PLUS), each is a polygon Each polygon file contains points (x,y) with the last point equal to the first point, therefore forming a polygon *this county (left) has 692 points
Step 3: Get Report of Census 2000 on Selected State (by County) • The total number of population for a state can be found here: http://www.census.gov/popest/counties/CO-EST2003-01.html • Pick the state of interest
Step 4: Generate Random Population (by County) • Use S-PLUS to generate random location for each county Note: S-PLUS, based on the S language, is the package for exploratory data analysis and statistical data mining. • Input a polygon (a county) into S-PLUS and specify the number of population you want to create.
Step 5: Create Class Label Based on Census Data • Create datasets based on: • Household Income in 1999 • Poverty Status in 1999 • Race • Age
Household Income in 1999 Wyoming Park County
Step 6: Combine all Counties • After all counties has been created with class label, the last step is to combine all counties together, thus forming a complete State Dataset