1 / 26

Synthetic estimators in Ireland

Learn about synthetic estimators for small area studies in Ireland to improve precision & accuracy of prevalence estimates. Explore model-based estimators, issues, limits, confidentiality, validation, and future possibilities.

diegoj
Download Presentation

Synthetic estimators in Ireland

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Anthony Staines DCU Synthetic estimators in Ireland

  2. What are synthetic estimators? • Estimates of something you haven't got • Typically estimates for a small area of something • Making maximum use of what you have

  3. Example • Lung cancer risk • Smoking is a key explanation • Suppose you want to study the geography of lung cancer

  4. What you have • Smoking data from a national survey by age and sex • Small area level data on population and cancer incidence by age and sex

  5. What you can do at once • Estimate prevalence for small areas included in the study • Using the sample in the study

  6. What's wrong with this? • The areas you need may not be included • The estimates will be very imprecise

  7. You can do better • In some obvious ways • And some not so obvious

  8. What you assume • National age and sex specific rates apply in each small area

  9. And so • From these you calculate small area specific prevalence estimates • This is indirect standardisation • Can be done smarter • requiring aggregation properties to hold • Adding in area level covariates (urban/rural etc.)

  10. Can you do better? • Yes

  11. How?

  12. Model based estimators • These have a long history • Many diverse applications • Combine survey data and some kind of 'census data' • 'Census data' is that available for every area of interest

  13. Roughly • Use the survey data to estimate relationships • at the relevant level • between survey covariates • and the census data

  14. Then • Assume the same relationship applies in the other areas

  15. Issues • Modelling can be hard • Remember these are predictive models, not explanatory models • Data not easy to get at the right small area level

  16. Models • models using individual level covariates only • models using area level covariates only • models combining individual and area-level covariates

  17. Limits • Available data • Confidentiality • Complexity of methods, esp. multi-level methods • Validation

  18. Spatial data limits • Have to be able to link survey and census to the same set of small areas • Given the primitive systems in the UK and the nearly non-existent systems in the Republic this is a lot of work • Errors here will lead to biassed estimates

  19. Confidentiality • Need to respect confidentiality of survey respondents • May limit the data available for these purposes • May need to design survey and survey consent process carefully to get good estimates

  20. Modelling • Can become very complex • Clustered survey designs • Survey weights • Variable selection • Model diagnostics

  21. What and where to model • Data may exist at many different geographies • Multi-level models with individual, household, local and regional effects can be considered • GIS might be very useful here for data handling • Not advisable to aggregate covariates at different spatial levels • This is just making a bad embedded synthetic estimator

  22. Validation • Not easy to do, but essential • How do you validate your synthetic estimates? • Cross-validation? • Another survey? • ?

  23. Options • How about • Health Atlas Ireland? • This is a system built for HSE, (led by Howard Johnson) to plan health services • It already has • Maps • Census • HIPE • Mortality data

  24. Census output options • Recently they have developed a very flexible census output system • Uses census data at ED level • Locations of houses • Assumes that all the houses in a DED are exchangeable

  25. Census output options • Allocates census data to any given area • Directly weighted by using the number of households and the ED composition of the desired area

  26. Futures? • Modern design of surveys • Could readily be extended to do SA from almost any survey data where the necessary geographical data have bene collected • Greatly improves value for money of large scale surveys

More Related