160 likes | 309 Views
Data Cleaning Netherlands: type of payments received and amounts mentioned. Manon de Groot Antwerp, 6 February 2008. EP module: situation. EP078 EP071: type(s) of payment(s) received. EP078: amount(s) received for those payment(s). EP094: EP089: type(s) of payment (s) received
E N D
Data Cleaning Netherlands: type of payments received and amounts mentioned Manon de Groot Antwerp, 6 February 2008
EP module: situation • EP078 • EP071: type(s) of payment(s) received. • EP078: amount(s) received for those payment(s). • EP094: • EP089: type(s) of payment (s) received • EP094: amount(s) received for those payment(s).
Situation • Respondents indicate which payment(s) they have received in the past year (e.g., EP071, EP089). • Respondents indicatethe amount they have received for all mentioned payments in EP071. • If payment mentioned in EP071 corresponding EP078 value > 0. • If payment NOT mentioned in EP071 EP078 value(s) missing. At the moment: • EP071: order given by respondent (1st mentioned, 2nd, 3rd, etc...) • EP078: order given by showcards. • Category mentioned in EP071_1 does not necessarily correspond to first category on showcard (EP078_1 ), etc. Thus, EP071 is NOT dummy coded.
Warning • Respondents are ONLY asked for amouns for payments mentioned in EP071. Consequenlty, EP078 values of zero are not allowed. Example: • Respondent mentions ‘public old-age pension’ in EP071 EP078_1 is assigned a value > 0. • Respondent DOES NOT mention ‘public old-age pension’ in EP071 EP078_1 is assigned ‘missing value’. But (at least for the Netherlands): • Some EP078 values are zero! • Payment NOT mentioned in EP071? • Zero should be coded ‘missing’?.
EP071: number of payments mentioned 1 2 3 4 5 6 7 People mentioned up to 7 payments (all countries together) . People mentioned up to 3 payments in the Netherlands. Note that these categories are NOT in showcard order. So, 2678 respondents mentioned a first payment, 65 a second payment, and 2 mentioned a third one.
EP078: number of categories to choose from 1 2 3 4 5 6 7 8 9 10 11 12 13 …from in total 13 categories (all countries together) In the Netherlands 7 categories to choose from. Note that these categories are in showcard order
EP078 problem: zero values 1 3 4 6 7 10 11 5 problems
What should be done Remember: • Zero values for EP078 are not allowed. EP078 are values for the payments mentioned in EP071. Thus, values for EP078 must be greater than zero (payment applicable) or missing (payment not applicable). What to do: • Select cases with zero values on EP078 variable(s). • For this selection and EP078: make a listing of the selected EP078 variable(s) and all EP071 values (that apply in your country). • Identity which EP071 variable(s) is (are) associated with the zero value(s).
Example EP078_1 Syntax (SPSS) temporary. select if (ep078_1 = 0). list variables ep078_1 ep071_1 ep071_2 ep071_3 ep071_4 ep071_5 ep071_6 ep071_7.
Example EP078_1 1st-mentioned payment, 2nd-mentioned payment, etc. EP078_1 EP071_1 EP071_2 EP071_3 EP071_4 EP071_5 EP071_6 EP071_7 ,00 1 . . . . . . ,00 1 . . . . . . ,00 1 . . . . . . ,00 1 . . . . . . ,00 1 . . . . . . ,00 1 . . . . . . ,00 1 . . . . . . Number of cases read: 7 Number of cases listed: 7 Zero euros for showcard category 1 (public old-age pension) 1st-mentioned payment is category 1 (public old-age pension) from showcard BUT…
Example EP078_3 EP078_3 EP071_1 EP071_2 EP071_3 EP071_4 EP071_5 EP071_6 EP071_7 ,00 3 . . . . . . ,00 3 . . . . . . ,00 3 . . . . . . Number of cases read: 3 Number of cases listed: 3 Zero euros for showcard category 3 (Public early retirement or pre-retirement pension). 1st-mentioned payment is category 3. BUT…
Example EP078_4 EP078_4 EP071_1 EP071_2 EP071_3 EP071_4 EP071_5 EP071_6 EP071_7 ,00 4 . . . . . . ,00 4 . . . . . . ,00 4 . . . . . . Number of cases read: 3 Number of cases listed: 3 Zero euros for showcard category 4 (Main public disability insurance pension, or sickness benefits). 1st-mentioned payment is category 4. BUT…
Example EP078_6 Zero euros for showcard category 6 (Public unemployment benefit or insurance). 1st-mentioned payment is category 6. BUT… EP078_6 EP071_1 EP071_2 EP071_3 EP071_4 EP071_5 EP071_6 EP071_7 ,00 1 6 . . . . . ,00 6 4 . . . . . Number of cases read: 2 Number of cases listed: 2 Zero euros for showcard category 6 (Public unemployment benefit or insurance). 1st-mentioned payment is category 6. BUT…
Wrong vs. Right approach • Wrong approach (assuming correspondence) • Wrong result temporary. select if (ep078_6 = 0). list variables ep078_6 ep071_6. EP078_6 EP071_6 .00 . .00 . Number of cases read: 2 Number of cases listed: 2 Wrong conclusion: Recode ‘.00’ to ‘missing’ (because category 6 was NOT mentioned)
Correct approach (not assuming correspondence) • Correct result temporary. select if (ep078_6 = 0). list variables ep078_6 ep071_1 ep071_2 ep071_3 ep071_4 ep071_5 ep071_6 ep071_7. EP078_6 EP071_1 EP071_2 EP071_3 EP071_4 EP071_5 EP071_6 EP071_7 .00 1 6 . . . . . .00 6 4 . . . . . Number of cases read: 2 Number of cases listed: 2 Correct conclusion: Assign non-zero values to EP078_6 (because category 6 was mentioned)
Conclusions • Do not confuse ‘showcard order’ (EP078, EP094) of payment categories with ‘respondent order’ (EP071, EP089) of mentioned payment categories. • Check ‘related’ variables in combination with each other. E.g: • Check EP071 icw. EP078. • Check EP089 icw. EP094.