1 / 19

Using administrative data to compile agricultural statistics

Using administrative data to compile agricultural statistics. Experiences from the Census of Agriculture 2010. Fiona O’Callaghan, CSO 29 th September 2011. Outline. Background Available data Analysis/aggregation Merging of data Lessons learnt Future developments Summary. Background.

Download Presentation

Using administrative data to compile agricultural statistics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Using administrative data to compile agricultural statistics Experiences from the Census of Agriculture 2010 Fiona O’Callaghan, CSO 29th September 2011

  2. Outline • Background • Available data • Analysis/aggregation • Merging of data • Lessons learnt • Future developments • Summary

  3. Background • CSO Statement of Strategy includes as a high level goals • Minimise response burden, and extend the statistical use of administrative records • Improve the scope, quality & timeliness of our statistics • Achieve greater efficiencies using best practices • SPAR Report - Statistical Potential of Administrative Records

  4. Background • Up until 2010, CSO conducted 2 annual farm surveys (June & Dec) with sample sizes ranging from 15,000 to 20,000 farms. Farm Structure Survey approx 50,000 farms. • June survey: average response burden approx. 30 mins. • December survey: average response burden approx. 18 mins. • COA 2010: average response burden 26 mins.

  5. Available Data • CSO identified six major data holdings within DAFF which contained relevant information • Animal Identification and Movement System (AIM) • Single Payment System (SPS) • Organic Database • REPS database • Animal Health Computer System (AHCS) • Corporate Client System (CCS)

  6. Available Data • Three of these databases were used for COA 2010 • CCS was used in developing the register • AIM was used for cattle • SPS was used for crops/cereals

  7. Analysis/aggregation • Corporate Client System • Contains name, address, DOB, herd number etc. • This was merged with the existing CSO Agriculture Register to form a new Register for COA 2010 • Issues involving the “unique” identifier, Herd Number, resulting in duplicates • Result – 153,904 Census forms issued

  8. Analysis/aggregation • AIM – involves the use of electronic means to capture data on animal movements through computer links established at livestock markets, meat plants, and export points • Data available since 2002 • At an aggregate level, the AIM figures for the bovine population have been consistently higher than the CSO estimates for the corresponding date (on average approx. 5% higher) • Preliminary comparative analysis performed by CSO in 2008 and 2009 • Eliminated 11 cattle questions from Census form

  9. Analysis/aggregation • AIM data consists of the following variables • Tag number • Herd number • DOB • Gender • Breed • Breed Type (Beef/Dairy) • Animal Class (Cow, bull, etc.) • Date of calving event

  10. Analysis/aggregation • Need to convert this information into totals for the following categories Breeding Cattle Dairy Cows Other Cows Dairy Heifers* Other Heifers Bulls Other Cattle Male: 2 years and over Female: 2 years and over Male: 1-2 years Female: 1-2 years Male: under 1 year Female: under 1 year * Heifers in calf intended for the dairy herd

  11. Analysis/aggregation • Information on heifers-in-calf not available in AIM database, but a proxy can be estimated. • Other categories can be derived directly using gender, DOB, animal class. • Eurostat requires further breakdown of categories into animals for slaughter – currently developing a methodology to model this.

  12. Analysis/aggregation • SPS • Information on every eligible parcel of land • XY coordinates, Herd Number, Area, Use • Preliminary analysis performed in 2009 • Eliminated 14 crops/cereals questions from Census form

  13. Analysis/aggregation • SPS • XY coordinates used to assign NUTS Region codes at farm level • Approx. 45% farms are spread over >1 DED - in these instances the DED containing the largest area owned was assigned

  14. Merging of data • Three separate data files • Census returns • AIM data • SPS data • “Unique” identifier – Herd Number – but many instances where one farmer could be associated with more than one Herd Number

  15. Merging of data • Labour intensive task of matching by name & address • Issues with Father & Son with same name & address • Different versions of names on different databases – Seamus/James, Sean/John etc. • Non-unique addresses • Farms that returned to CSO as Retired/Dead/Not a Farm etc. but active on admin. data

  16. Lessons Learnt • More collaboration between DAFF & CSO • Confidentiality – one-way transfer of information • Parallel pilot run in 2009

  17. Future Developments • Beyond the SPAR Initiative – cross sector efficiencies Piggy-backing on SFP online applications to collect remaining survey items on June Survey • Exploiting geo-coordinates • To produce interactive maps • To link with other databases • Create area frame designs

  18. Summary • Positive development for Farmers & CSO • Reduction in response burden – 25 questions dropped • Reduction in editing & processing of data • Result - a high quality register of agricultural holdings, and high quality data

  19. Questions?

More Related