1 / 24

AGENDA

Matching of administrative data to validate the 2011 Census in England and Wales NRS & RSS Edinburgh , October 2012. AGENDA. Context: 2011 Census quality assurance and the role of administrative data Data matching challenges and solutions Data to be matched

arnaud
Télécharger la présentation

AGENDA

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Matching of administrative data to validate the 2011 Census in England and WalesNRS & RSS Edinburgh, October 2012

  2. AGENDA • Context: 2011 Census quality assurance and the role of administrative data • Data matching challenges and solutions • Data to be matched • Matching methods and interpretation • Substantive results so far . . .

  3. An overview of the methods Method Product Quality assurance DSE Bias adj Overcount 5 yr age/sex CCS areas Core checks 5 yr age/sex EA /LA level Ratio estimator Nat adj Supplementary analysis 1 yr age/sex OA level Coverage imputation QA Review and sign-off First Release Main QA Panel High Level QA Panel

  4. Challenges and solutions

  5. Data to be matched

  6. Methods • Data cleaning, de-duplication, standardisation, quality analysis • Definitional alignment with Census enumeration base • Exact matching (dwelling: Address/ person: name, DoB, gender and postcode) • Score-based address matching • Probabilistic person matching • Clerical resolution of candidate pairs from automatch • Clerical search for unmatched residuals • Resolution of unmatched residuals against the Address Register History file and Census ‘associated addresses’ • Evidence-based assessment of residuals

  7. Interpretation: Who is actually present?

  8. Match rates in a ‘control’ LA

  9. Female outcomes in a ‘control’ LA

  10. Male outcomes in a ‘control’ LA

  11. Match results in university towns

  12. University town: female outcomes

  13. University town: male outcomes

  14. London: population churn

  15. London churn: female outcomes

  16. London churn: male outcomes

  17. London LA: implied sex ratios

  18. Data mining to address specific Census/PR anomalies

  19. Female students living in halls in April 2011 by NHS Authority acceptance date

  20. Male students living in halls in April 2011 by NHS Authority acceptance date

  21. LA summary: proportion of F4s and proportion unresolved, within CCS postcode clusters

  22. LA summary: concentration of Flag 4s in the PR residual

  23. LA summary: LA types, residual size and Flag 4s

  24. Further investigations • Planned analysis of the PR residuals’ addresses and households to identify ‘ghost’ records • Longitudinal matching of the 2012 Patient Register to 2011 data to identify registrations that have been cancelled by GP practices in the year following Census • Cluster analysis of all E&W LAs to see whether the typology of LAs identified through matching is mirrored in list inflation patterns nationally • Multi-level modelling to summarise results, with individual and area level explanatory variables

More Related