1 / 17

Detection Techniques Applied

Explore techniques like Deviation Analysis and Correlation Analysis to detect duplicate responses in survey data. Discover how to identify top duplicates and clusters by supervisors.

cofer
Download Presentation

Detection Techniques Applied

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Detection Techniques Applied Ali Mushtaq WSS – Dec 2

  2. Survey Data Overview • PAPI • 5,000+ completed responses • ~150 Interviewers: 10 to 50 interviews each • 20+ Supervisors: 75 to over 500 interviews overseen for each • Questions: ~200, multiple-response categorical

  3. Deviation Analysis • Deviation Score: compare response pattern distributions, stratified by Group – calculation of deviation using something like Chi-square • Correlation Analysis: same, on joint response patterns • Over all Qs, how many are outlying, what is the average deviation • Assumption: Falsified data is small in scale and is likely to deviate from overall distribution

  4. Duplicate Analysis • Compare one interview record against all others, one at a time, to measure the length of duplicate sequences • Flag pairs with long duplicate sequences • Examine unusually long sequences (esp complete duplicates) • Identify clusters by interviewer, supervisor

  5. Top Duplicates Survey A • “Top Duplicates” : pairs of surveys sharing half or more of their responses in sequence • Top Duplicates Clustered by Supervisor: • All other supervisors (15+) have no top duplicates

  6. Thank You! Questions?

  7. Top Duplicates Survey B • Top Duplicates Clustered by Supervisor: • All other supervisors (15+) have none of the top duplicates

More Related