MIS 2007 Analysis Methods
MIS 2007 Analysis Methods. Weighting and Data Analysis Allen HIghtower, CDC Kenya. Weighting and Data Analysis. MIS sample design and weighting Calculation of weights Adjustment for non-response Normalized/standardized weights Data analysis. MIS Multi-Stage Stratified Cluster Design.
MIS 2007 Analysis Methods
E N D
Presentation Transcript
MIS 2007Analysis Methods Weighting and Data Analysis Allen HIghtower, CDC Kenya RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Weighting and Data Analysis • MIS sample design and weighting • Calculation of weights • Adjustment for non-response • Normalized/standardized weights • Data analysis RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
MIS Multi-Stage Stratified Cluster Design RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Multi-stage cluster samples • Unequal probability of selection • difference in probability of selection • difference in behaviors across sub-groups • biased results • Simple random sample • equal probability of selection • self weighting RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Why Weight? • To correct for unequal probability of selection • To produce results that are representative of the larger population from which the sample was drawn • To adjust for non–response RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Steps in Calculating Sampling Weights • Calculate probabilities of selection • Convert the probabilities into weights • Normalize or standardize the weights in order to reflect the correct sample size RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Calculating Probabilities of Selection • The probability of selection depends upon the sample design used • For the MIS three-stage sample design the probability of selection is the joint probability of selection at the three stages of sample selection • Pi = Pa * Pb * Pc RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Probabilities of selection • Pa = P cluster sleeted in NASSEP IV # Cs selected / # Cs available • Pb = P cluster selected in MIS # Cs selected / # Cs available • Pc = P household selected in MIS # HHs selected / # HHs available • Pi = Overall P HH selected in MIS for cluster i RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Calculating the Sampling Weight • The sampling weight assigned to a household in a given cluster is the inverse of its probability of selection • Wi= 1/Pi • Wi = weight for households in the ith cluster • Pi = probability of selection for households in the ith cluster RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Standardized/Normalized Weights • Standardized weights are used to avoid generating incorrect standard errors and confidence intervals • Weights are standardized to the sample size and have a mean of 1.0. RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Calculating Standardized Weights • wi' = wi n / wi ni where • wi’ = standardized weight for cluster i • wi = sampling weight for cluster i • n = total sample size • ni = sample size for cluster i • The sum should equal the number sampled. RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Adjusting weights for non-response • Consider adjusting weights for non-response at the household and individual level - when non-response is higher than expected - when response rates differ by demographic characteristics – sex, education ... RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Calculating non-response adjustment • The non-response adjustment factor (A) is the inverse of the response rate (r) Ai = 1/ri The final weight adjusted for non-response is the product of the standardized weight and the adjustment factor Final Weight = wi' * Ai RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
MIS Weights • Household weights = HW • Individual weights = HW /HRR • Lab weights = IW */LRR • HRR= HH response rate • LRR= Lab response rate RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Data Analysis • Software programs designed for complex surveys should be used to analyze MIS data. • SUDAAN, SAS, SPSS, and Stata have procedures to account for the multi-stage stratified sampling design, and produce reliable standard errors and confidence intervals. RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Sample SUDAAN Code • PROC CROSSTAB DATA=MIS1 filetype=sas design=WR; • NEST strata1 qhnassep /psulev=2 missunit; • WEIGHT Weight; • TABLE Sex*Parasite; • CLASS Sex Parasite; • RUN; RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Sample SAS Code • PROC SURVEYFREQ data=MIS1; • WEIGHT weight; • STRATA strata1; • CLUSTER qhnassep; *** EA ID variable **; • TABLES parasite / cl row deff; *** Y/N Parasitemia **; • RUN; RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia
Sample Data RBM-MERG Malaria Indicator Training Workshop, 9th-12th September 2008, Lusaka, Zambia