160 likes | 298 Views
Multiple Indicator Cluster Surveys Data Processing Workshop. Structure Checking and Verification. Structure Checking Flow Chart. Main Data Entry. Data Entry Operator 1. Correct Main Data File. Structure Check. DP Supervisor. Data Entry Operator 1. No. Investigate Errors. Check ok?.
E N D
MICS Data Processing Workshop Multiple Indicator Cluster SurveysData Processing Workshop Structure Checking and Verification
Structure Checking Flow Chart Main Data Entry Data Entry Operator 1 Correct Main Data File Structure Check DP Supervisor Data Entry Operator 1 No Investigate Errors Check ok? DP Supervisor Yes Verification Data Entry Data Entry Operator 2
Structure Checking • Check that the number of questionnaires in the data file matches the number of questionnaires on the cluster tracking form • Check household information panel counts of women, under-fives and men (HH12-15) match number of questionnaires in the data file
The Cluster Tracking Form • Allows supervisor to organize and track data entry progress • Questionnaire totals entered by questionnaire administrator • Other information entered by supervisor
Cluster Tracking Checks • Check questionnaires in data file equal questionnaires on cluster tracking form • Check done for: • Complete, Incomplete, and Total Households • Complete, Incomplete, and Total Women • Complete, Incomplete, and Total Men • Complete, Incomplete, and Total Children
Cluster-Level Checks • Women’s questionnaires = Σ HH12 • Completed women’s questionnaires = Σ HH13 • Men’s questionnaires = Σ HH13A • Completed under-five questionnaires = Σ HH13B • Under-five questionnaires = Σ HH14 • Completed under-five questionnaires = Σ HH15
Sample Output - Cluster MICS 5 Data Structure Check Cluster: 3 Households | Women | Men | Children Total Comp Incomp | Eligible Interviewed | Eligible Interviewed | Eligible Interviewed | HH12 FOUND HH13 FOUND | HH13A FOUND HH13B FOUND | HH14 FOUND HH15 FOUND 2 1 1 | 5 5 4 4 | 6 6 5 5 | 4 4 3 3 • Compare totals to those on cluster tracking form • If totals different, check cluster’s questionnaires and resolve the difference(s)
Household-Level Checks • Women’s questionnaires = HH12 • Completed women’s questionnaires = HH13 • Men’s questionnaires = HH13A • Completed men’s questionnaires = HH13B • Under-five questionnaires = HH14 • Completed under-five questionnaires = HH15
Sample Output - Household MICS5 Data Structure Check Household: 1 Result: 1 Women | Men| Children Eligible Interviewed | Eligible Interviewed | Eligible Interviewed HH12 FOUND HH13 FOUND | HH13A FOUND HH13B FOUND | HH14 FOUND HH15 FOUND 4 4 3 3 | 2 2 1 1 | 2 2 2 2 • Use listing of households to identify source of cluster level problems
Verification Flow Chart Verification Data Entry Data Entry Operator 2 Correct Both Data Files Verification DP Supervisor Data Entry Operator 1 & 2 Yes Determine Correct Values Differences? Data Entry Operator 1 & 2 No Backup Raw Data File DP Supervisor
Verification • Purpose: Eliminating data entry errors • Method: 1. Program compares main and verification data files 2. Program produces list of differences 3. Data entry operators use listing and questionnaires to determine corrections 4. Corrections marked on listing 5. BOTH data files corrected by original operators 6. Repeat steps 1-5 until no differences in listing!
Sample Difference Listing Input File: NETPROJ\DATA\M003.DAT Reference File: NETPROJ\VERI\V003.DAT ----------------------------------------------------------------------------------------------- Case Id Item Input File Reference File ----------------------------------------------------------------------------------------------- [00301 ] ED4(6) 202 204 ED4B(6) 02 04 HL4(9) 2 1 [0030404] IM1 1 2 IM3B 31032006 IM3BD 31 IM3BM 03 IM3BY 2006
Verification Application • The file compare.cmp defines which items (i.e., dictionary items) will be compared • It is strongly recommended that this file not be changed
Option I. Enter Geographic Positioning System (GPS) Data • This option allows the data-processing supervisor to enter GPS location data with gpsentry.ent application • This application allows the data-processing supervisor to enter as many clusters at a time as he/she would like • The application requires the data-processing supervisor to enter the GPS data twice as a check against keying errors
Option J. Modify GPS Data • This option allows the data-processing supervisor to modifying GPS location data by executing the gpsentry.ent application