1 / 16

A Matched-Model Geometric Mean Price Index for Supermarket Scanner Data

A Matched-Model Geometric Mean Price Index for Supermarket Scanner Data. Nicolette de Bruijn Peter Hein van Mulligen Jan de Haan Sixth EMG Workshop, 13 – 15 December 2006, UNSW, Sydney. Contents. Historical overview Advantages of using scanner data Daily practice Matched-model approach

brice
Download Presentation

A Matched-Model Geometric Mean Price Index for Supermarket Scanner Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Matched-Model Geometric Mean Price Index for Supermarket Scanner Data Nicolette de Bruijn Peter Hein van Mulligen Jan de Haan Sixth EMG Workshop, 13 – 15 December 2006, UNSW, Sydney

  2. Contents • Historical overview • Advantages of using scanner data • Daily practice • Matched-model approach • Simulations on scannerdata and results • Conclusions • Future plans

  3. Historical overview • 2002 • Introduction of use of scanner data in the CPI - 2 main supermarket organisations - field observations in these chains were cancelled • 2005 • Base year shift to 2004 and some improvements • 2007 • Base year shift to 2006 • 1 supermarket

  4. Advantages of using scanner data (1) • Improving quality: • Large numbers of items • Average real transaction prices • No more observation mistakes • Detailed and better weights • Efficiency: • Less survey work in the field • Less administrative burden for retailers

  5. Advantages of using scanner data (2) • More articles in scanner data than field survey articles: abundance of scanner data • Descriptions of articles in scanner data are narrower than descriptions for field survey articles

  6. Use of the data • EANs: unique products • EANs must be classified (COICOP/CBL) • All data are used for construction of weights • Weights per EAN on basis of turnover shares • New basket and new weights each year • Prices: unit values per EAN per supermarket • Laspeyres index per COICOP/CBL-group per supermarket • Scanner data indexes are aggregated with indexes based on field surveys for other supermarkets and shop types

  7. Why this study? • The current method is time-consuming - classification of EANs into CBL/COICOP-groups - choosing successors for disappearing EANs • Therefore it is impossible to use more scanner data from other supermarkets • CBS would like to scale up on scanner data of supermarkets • A more efficient method is needed matched model without explicit QA

  8. Differences between current method and proposed method • Geometric mean vs. Arithmetic mean • Explicit QA vs. class mean imputation • Fixed weight per EAN vs. no weights • Fixed basket vs. monthly basket

  9. Benchmark versus scenario 1 • Benchmark: CBL index on basis of geometric mean • Scenario 1: CBL index on basis of arithmetic mean (current situation) • Advantages of geometric mean: - substitution, less sensitive in heterogenic groups

  10. Scenario 2: disappearing EANs • About 100 disappearing EANs per month • Imputation for unimportant EANs • Important EANs are ‘replaced’ by hand • possibility to correct explicitly for quality changes • Finding successors and applying explicit quality corrections is time-consuming • Scenario 2: no explicit quality adjustments but average index change of CBL-group

  11. Scenario 3: weights • Scenario 3: no explicit weights are used, each EAN gets the same weight • Problems scenario 3: weights are required because of heterogeneity within CBL-groups • Equal weighting yields unacceptable differences for CBL- and COICOP-groups • But fixed weights are necessary, as monthly weights result in an upward drift

  12. Results

  13. Matched-model simulation • Monthly basket of matched EANs • Geometric mean and no explicit adjustments for quality changes • Using monthly weights or no weights at all is not appropriate, therefore: rough weighing • Sample on basis of turnover shares per CBL-group • Variation in coverage: 60%, 70%, …, 95% • NB: simulation on (official) ‘basket items’

  14. Results matched model

  15. Conclusions • When the process is fully automated we can scale up the amount of supermarket scanner data • Classifying EANs into COICOP/CBL-groups will remain time consuming; a solution has to be found before scaling up

  16. The future • 2007 • Pilot study: implementation of matched model approach for 2 supermarkets to detect practical and logistic difficulties • 2008 • Matched-model geometric mean index in production • Reduction of field surveys in supermarkets

More Related