450 likes | 797 Views
Presented By: Dr. Michael Kaylen University of Missouri. TIPS FOR SURVEY DATA ANALYSIS. Survey Data analysis involves transforming survey data into information. Data Information. Introduction. DATA. Number of Travelers in MO by State of Origin and Month. INFORMATION.
E N D
Presented By: Dr. Michael Kaylen University of Missouri TIPS FOR SURVEY DATA ANALYSIS
Survey Data analysis involves transforming survey data into information. Data Information Introduction
Number of Travelers in MO • by State of Origin and Month. INFORMATION
Data Information • Tips Focus On • Excel Pivot Tables • Weighted Data • Application to Household Panel Data INTRODUCTION
HOUSEHOLD PANEL DATA • Monthly Surveys of Households • 3 Levels of Data • Household (demographics) • Trip (# traveling, states visited, etc.) • State (# nights by lodging type, expenditures, etc.) • Simulated Data
HOUSEHOLD PANEL DATA • Household Level Data (HOUSE! - 54,824 Observations) • Household ID • Month • # Trips • Origin State • Household Income Range • Two Weights
HOUSEHOLD PANEL DATA • Household Level Data • Trip Level Data (TRIP! - 21,144 Observations) • Household Level Data • # Household Members on Trip • Primary Trip Purpose • Primary Transportation Mode • (0/1) Code for Each State • Three Weights
HOUSEHOLD PANEL DATA • Household Level Data • Trip Level Data • State Level Data (STATE! - 23,225 Observations) • Household and Trip Level Data • Detailed State • # Nights by Lodging Type • Expenditures by Category • (0/1) Code for Activities • Three Weights
EXCEL PIVOT TABLES • Analyze Data Using 3 operations: • Group Data into Categories • Ex. - Create a PivotTable
Put cursor anywhere in data table, worksheet house. • Click on Insert Tab
To Group: Drag Fields to • Row/Column Labels
Cross-tab using both Row • and Column Labels
EXCEL PIVOT TABLES • Analyze Data Using 3 Operations: • Group Data into Categories • Summarize Data Using Calculations • Count, Sum, Average, Maximum, Minimum, Standard Deviation • Ex.- Look at number of households in sample, by state of origin and month.
Change the type of calculation by clicking on the drop-down menu
EXCEL PIVOT TABLES Analyze Data Using 3 Operations: Group Data into Categories Summarize Data Using Calculations Filter Results Can be used to view a subset of results
WEIGHTED DATA • Weights are used to Project Sample Data to a Population • Ex. – A Household Weight of 10,000 means that particular household “represents” 10,000 households in the population
WEIGHTED DATA • The Design Weight of a sample element is the inverse of its inclusion probability • Ex. – If 20,000 households are chosen from a simple random sampling design from 100,000,000 households, the design weight is 100,000,000/20,000 = 5,000
WEIGHTED DATA • Calibration Weights - computed using Data on auxiliary variables (e.g., demographics) • “Balance” sample data. • Ex. – If studying travel to MO and sample under-represents neighboring states.
WEIGHTED DATA CALCULATIONS WITH WEIGHTS • To estimate population totals: Ex. – To estimate the total number of household trips, create a new variable: WT_HH * HH_Trips
WEIGHTED DATA CALCULATIONS WITH WEIGHTS • To estimate population totals: • To estimate population averages:
POTPOURRI • Monitor number of observations, by strata (e.g., month, state). • - Sample size is critical to accuracy. • Monitor sum of weights over all observations, by strata. • - Weight totals should reflect population numbers.
POTPOURRI • Be careful projecting to other than the sample design population. Ex. 1 – Sampled Households, but interested in Household Trips (e.g., What percent of all household trips included travel in MO?).
POTPOURRI - Already used data in HOUSE! to estimate 138,511,079 household trips taken during 3 months. - TRIP! contains detailed data on trips, each row (observation) corresponding to one trip. - Problem: household weights over all trips in TRIP! sum to only 124,116,209
POTPOURRI Why the discrepancy? Sampled households could only provide details for up to 3 trips, regardless of the number of trips actually taken. Solution: create a new weight WT_HHTrip =
PivotTable showing Sum of WT_HHTrip, grouped by TR_VisitMO About 2.9% of all HH trips included MO.
POTPOURRI • Be careful projecting to other than the sample design population. Ex. 1 – Sampled Households, but interested in Household Trips. Ex. 2 – Sampled Households, but interested in Travelers (e.g., What percent of all travelers visited MO?).
POTPOURRI - The original data set contains two numbers of potential interest for each detailed trip: the number of people in the travel party and the number of household members in the travel party. - Problem: which numbers to use?
POTPOURRI Solution: Since the sampling design was based on households, not travel parties, use the number of household members in the travel party. WT_PersTrip = WT_HHTrip * TR_HHMemTot
PivotTable showing Sum of WT_PersTrip, grouped by TR_VisitMO About 2.9% of all travelers visited MO.
Questions, Comments? Thank You!