1 / 38

Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Assessing Nonresponse Bias and Measurement Error in Estimates of Employment. John Dixon Clyde Tucker Polly Phipps Bureau of Labor Statistics any opinions expressed in this paper are those of the authors and do not constitute policy of the Bureau of Labor Statistics. Types of Nonresponse.

velvet
Download Presentation

Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Assessing Nonresponse Bias and Measurement Error in Estimates of Employment John Dixon Clyde Tucker Polly Phipps Bureau of Labor Statistics any opinions expressed in this paper are those of the authors and do not constitute policy of the Bureau of Labor Statistics

  2. Types of Nonresponse • Ignorable: Conditioning on known auxiliary variables some bias may be eliminated from estimates—accomplished by making weighting adjustments. • Nonignorable: Bias that cannot be eliminated by conditioning on auxiliary variables—remains in estimates and contributes to mean squared error • Weighting adjustments may be of only limited utility

  3. Previous Research • Household surveys • Haraldsen, et al. 1999;Keeter, et al. 2000;Curtin, et al. 2000 found little nonresponse bias • Brick and Bose (2001) found weighting was effective • Most results confined to response rates between 40% and 70%

  4. Previous Research • Establishment Surveys • Nonresponding units can have more effect than in household surveys • Tomaskovic-Devey, et al. (1994) found differential nonresponse by size, industry, and profitability of firm • Most research focused on characterizing nonrespondents (Hidiroglou, et al. 1993 and Phipps et al. 2007, adjusting for nonresponse (Sommers, et al. 2004) or understanding why establishments don’t respond (McCarthy, et al. 1999 and McCarthy and Beckler 2000) • Copeland (2003), in contrast, looked at the bias in early estimates that resulted from late responders in the Current Employment Statistics Program (CES) • Tucker, et al. (2005) began an examination of bias in CES using data from the QCEW (State UI files). This paper is a more thorough report on that work.

  5. The CES • Collects employment, hours and earnings monthly from a current sample of over 300,000 establishments • Tracks the gains and losses in jobs in various sectors of the economy • In this paper, nonresponse bias work on this survey focuses on estimating bias for establishment subpopulations with different patterns of nonresponse using data from the 2003 CES and QCEW (over 400,000 respondents)

  6. Nonresponse Error for Sample Mean In simplest terms OR Respondent Mean = Full Sample Mean + (Nonresponse Rate)*(Respondent Mean – Nonrespondent Mean)

  7. Theory • Levels of bias will differ by subpopulations • The difference between respondent and nonrespondent estimates will be greatest on either end of the nonresponse continuum, but potential bias greatest when response rates are low • Bias in business surveys may be greatest in the Services sector

  8. The Current Study of Nonresponse Bias in the CES • Because this work is theoretically driven, the paper just examines the difference in employment of respondents and nonrespondents but not the ultimate effect on estimates after accounting for response rate. • Furthermore, the CES benchmarks the estimates to the QCEW (the data used in this study) using estimation cells based on industry and region. • Thus, the eventual aggregate bias should be small.

  9. CES Response Rates by NAICS Categories

  10. Measurement error • Administrative data for employment counts is thought to be less error-prone than survey data. • Errors in the unemployment insurance counts have consequences for taxes or fines. • Surveys are often handled by less experienced staff. • Differences between the survey and the administrative data could be due to • definitional differences (e.g.:students usually aren’t counted in the unemployment insurance counts, but may be in the payroll counts) • reporting period

  11. Measurement error (continued) Both the administrative and the survey data vary from month to month, but if the survey has more measurement error, then the variability of the survey data relative to the administrative data could be used as a rough indicator of measurement error.

  12. Distribution of relative variance

  13. Measurement error measures and closing

  14. Estimate of Biases (nonresponse and measurement error) • Using the most recent employment reports in the QCEW (not CES) for both responders and nonresponders • Compare the employment reports for respondents to that for nonrespondents • Compare the variability of estimates within companies for the CES survey relative to the variability of the QCEW as an estimate of measurement bias. • Results presented are not weighted by probability of selection, but weighted results show similar patterns

  15. Nonresponse and Measurement Bias

  16. Quantile Regression • Bias analysis performed at the establishment level on subpopulations defined by size and industry • Testing for the difference in employment between CES responders and nonresponders. Y=a+Bx+e where x is an indicator of nonresponse (essentially a t-test). • Since size of firm is theorized to relate to nonresponse, the coefficients relating nonresponse to employment is likely to be different for different size firms. • Quantile regression examines the coefficients for different quantiles of the distribution of the sizes of firms. • Since industries can be expected to have different patterns, the quantile regressions are done by industry group.

  17. Interpretation of Quantile Regression of size on bias • At the micro level, the quantile regression shows the coefficients relating nonresponse (coded 0) to the size of firm. Each point on the curve is a regression relating nonresponse to size conditional on the rest of the distribution. The skewness affects the standard errors, so a stabilizing transformation is needed. This can be seen in the box & whisker plot to the right.

  18. Distribution of size and the quantile regression curve

  19. Employment and Measurement error

  20. Measures of Measurement error

  21. Measures of Nonresponse

  22. Interpretation of a Log Transformation • Using the log of size, the proportionate effect is greater for smaller firms. Since the coefficients are based on linear models, the transformation makes the distribution of responders and nonresponders more reasonable, as seen in the box & whisker plot to the right. • The quantile regression curve shows smaller firms have proportionately more bias than larger firms.

  23. Quantile regression using the log of size.

  24. Industry patterns • In almost all cases, there is some significant positive bias. • The most common pattern of the quantile curves is rapidly accelerating coefficients with increasing establishment size for almost half the industries. • Another pattern is flat coefficients until past the middle of the size distribution followed by accelerating coefficients. • A third pattern is a relatively gradual increase in the coefficients to an asymptote.

  25. Retail trade

  26. Accelerating patterns • The Agriculture, Forestry, Fishing and Hunting industries typify the rapidly accelerating pattern. The logged values show decreasing coefficients. The statistical significance shown by the confidence intervals varied.

  27. Accelerating pattern industries • Agriculture; mining; metal manufacturing; transportation and warehousing; information; finance and insurance; professional, scientific and technical services; administrative services and waste management; and accommodation and food services.

  28. Late accelerating pattern • The Health Care and Social Assistance industries typified the late accelerating pattern. While health care had positive coefficients, The real estate industry started negative and only became positive at the 80th quantile. • While most other patterns of the logged values showed decline, these showed an increase.

  29. Late Accelerating Industries • Health Care and social assistance; retail trade; real estate, rental and leasing; education services; and other services

  30. Asymptote pattern • The construction industry typifies the asymptote pattern of gradually increasing coefficients until the highest quantiles. • The logged values showed an early decline to an asymptote.

  31. Asymptote industries • The construction; food manufacturing; wood and mineral manufacturing; wholesale trade; and arts, entertainment and recreation showed the asymptote pattern. • They may have a critical mass, where once the size reaches the 90th quantile, there is a more uniform participation rate.

  32. Utilities • The utilities industry shows a relatively flat curve of coefficients. • The logged values were higher for small firms.

  33. Management of Companies and Enterprises • The coefficients is virtually flat until the highest quantiles and then turns decidedly . • The coefficients from the logged values linearly declined.

  34. Measurement error and nonresponse

  35. Flat patterns • Measurement error stayed mostly constant over the likelihood of nonresponse for Agriculture, Arts, Information, and Transportation

  36. Declining pattern • Only Real-estate had a declining pattern of measurement error over the likelihood of nonresponse.

  37. Summary • Quantile regression is a useful technique for studying nonresponse bias • The patterns of differences between industries varied • The most common pattern indicated efforts to adjust for nonresponse should be spent on larger firms to better estimate overall changes in employment • Yet, proportionate differences were higher for smaller firms • The patterns weren’t related to the nonresponse rate for the industry • Measurement error relates to smaller differences than nonresponse bias. Mixed responders had lower measurement error, contrary to expectations. • Most industries had an increase in measurement error as the probability of nonresponse increased.

  38. Assessing Nonresponse Bias and Measurement Error in Estimates of Employment John Dixon dixon.j@bls.gov Clyde Tucker tucker_p@bls.gov Polly Phipps phipps_p@bls.gov Bureau of Labor Statistics

More Related