html5
1 / 16

Data Cleaning Netherlands: type of payments received and amounts mentioned

Data Cleaning Netherlands: type of payments received and amounts mentioned. Manon de Groot Antwerp, 6 February 2008. EP module: situation. EP078 EP071: type(s) of payment(s) received. EP078: amount(s) received for those payment(s). EP094: EP089: type(s) of payment (s) received

bern
Télécharger la présentation

Data Cleaning Netherlands: type of payments received and amounts mentioned

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Cleaning Netherlands: type of payments received and amounts mentioned Manon de Groot Antwerp, 6 February 2008

  2. EP module: situation • EP078 • EP071: type(s) of payment(s) received. • EP078: amount(s) received for those payment(s). • EP094: • EP089: type(s) of payment (s) received • EP094: amount(s) received for those payment(s).

  3. Situation • Respondents indicate which payment(s) they have received in the past year (e.g., EP071, EP089). • Respondents indicatethe amount they have received for all mentioned payments in EP071. • If payment mentioned in EP071  corresponding EP078 value > 0. • If payment NOT mentioned in EP071  EP078 value(s) missing. At the moment: • EP071: order given by respondent (1st mentioned, 2nd, 3rd, etc...) • EP078: order given by showcards. • Category mentioned in EP071_1 does not necessarily correspond to first category on showcard (EP078_1 ), etc. Thus, EP071 is NOT dummy coded.

  4. Warning • Respondents are ONLY asked for amouns for payments mentioned in EP071. Consequenlty, EP078 values of zero are not allowed. Example: • Respondent mentions ‘public old-age pension’ in EP071  EP078_1 is assigned a value > 0. • Respondent DOES NOT mention ‘public old-age pension’ in EP071  EP078_1 is assigned ‘missing value’. But (at least for the Netherlands): • Some EP078 values are zero! • Payment NOT mentioned in EP071? • Zero should be coded ‘missing’?.

  5. EP071: number of payments mentioned 1 2 3 4 5 6 7 People mentioned up to 7 payments (all countries together) . People mentioned up to 3 payments in the Netherlands. Note that these categories are NOT in showcard order. So, 2678 respondents mentioned a first payment, 65 a second payment, and 2 mentioned a third one.

  6. EP078: number of categories to choose from 1 2 3 4 5 6 7 8 9 10 11 12 13 …from in total 13 categories (all countries together) In the Netherlands 7 categories to choose from. Note that these categories are in showcard order

  7. EP078 problem: zero values 1 3 4 6 7 10 11 5 problems

  8. What should be done Remember: • Zero values for EP078 are not allowed. EP078 are values for the payments mentioned in EP071. Thus, values for EP078 must be greater than zero (payment applicable) or missing (payment not applicable). What to do: • Select cases with zero values on EP078 variable(s). • For this selection and EP078: make a listing of the selected EP078 variable(s) and all EP071 values (that apply in your country). • Identity which EP071 variable(s) is (are) associated with the zero value(s).

  9. Example EP078_1 Syntax (SPSS) temporary. select if (ep078_1 = 0). list variables ep078_1 ep071_1 ep071_2 ep071_3 ep071_4 ep071_5 ep071_6 ep071_7.

  10. Example EP078_1 1st-mentioned payment, 2nd-mentioned payment, etc. EP078_1 EP071_1 EP071_2 EP071_3 EP071_4 EP071_5 EP071_6 EP071_7 ,00 1 . . . . . . ,00 1 . . . . . . ,00 1 . . . . . . ,00 1 . . . . . . ,00 1 . . . . . . ,00 1 . . . . . . ,00 1 . . . . . . Number of cases read: 7 Number of cases listed: 7 Zero euros for showcard category 1 (public old-age pension) 1st-mentioned payment is category 1 (public old-age pension) from showcard BUT…

  11. Example EP078_3 EP078_3 EP071_1 EP071_2 EP071_3 EP071_4 EP071_5 EP071_6 EP071_7 ,00 3 . . . . . . ,00 3 . . . . . . ,00 3 . . . . . . Number of cases read: 3 Number of cases listed: 3 Zero euros for showcard category 3 (Public early retirement or pre-retirement pension). 1st-mentioned payment is category 3. BUT…

  12. Example EP078_4 EP078_4 EP071_1 EP071_2 EP071_3 EP071_4 EP071_5 EP071_6 EP071_7 ,00 4 . . . . . . ,00 4 . . . . . . ,00 4 . . . . . . Number of cases read: 3 Number of cases listed: 3 Zero euros for showcard category 4 (Main public disability insurance pension, or sickness benefits). 1st-mentioned payment is category 4. BUT…

  13. Example EP078_6 Zero euros for showcard category 6 (Public unemployment benefit or insurance). 1st-mentioned payment is category 6. BUT… EP078_6 EP071_1 EP071_2 EP071_3 EP071_4 EP071_5 EP071_6 EP071_7 ,00 1 6 . . . . . ,00 6 4 . . . . . Number of cases read: 2 Number of cases listed: 2 Zero euros for showcard category 6 (Public unemployment benefit or insurance). 1st-mentioned payment is category 6. BUT…

  14. Wrong vs. Right approach • Wrong approach (assuming correspondence) • Wrong result temporary. select if (ep078_6 = 0). list variables ep078_6 ep071_6. EP078_6 EP071_6 .00 . .00 . Number of cases read: 2 Number of cases listed: 2 Wrong conclusion: Recode ‘.00’ to ‘missing’ (because category 6 was NOT mentioned)

  15. Correct approach (not assuming correspondence) • Correct result temporary. select if (ep078_6 = 0). list variables ep078_6 ep071_1 ep071_2 ep071_3 ep071_4 ep071_5 ep071_6 ep071_7. EP078_6 EP071_1 EP071_2 EP071_3 EP071_4 EP071_5 EP071_6 EP071_7 .00 1 6 . . . . . .00 6 4 . . . . . Number of cases read: 2 Number of cases listed: 2 Correct conclusion: Assign non-zero values to EP078_6 (because category 6 was mentioned)

  16. Conclusions • Do not confuse ‘showcard order’ (EP078, EP094) of payment categories with ‘respondent order’ (EP071, EP089) of mentioned payment categories. • Check ‘related’ variables in combination with each other. E.g: • Check EP071 icw. EP078. • Check EP089 icw. EP094.

More Related