1 / 14

Section 4.4: Contingency Tables and Association

Section 4.4: Contingency Tables and Association. Contingency table What and why a contingen cy table Marginal distribution Conditional distribution Simpson’s Paradox What is it? What causes it?. Contingency tables are for summarizing bivariate (or multivariate) qualitative data .

kineta
Télécharger la présentation

Section 4.4: Contingency Tables and Association

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Section 4.4:Contingency Tables and Association • Contingency table • What and why a contingency table • Marginal distribution • Conditional distribution • Simpson’s Paradox • What is it? • What causes it?

  2. Contingency tables are for summarizing bivariate (or multivariate) qualitative data. sex height shoe eyes hair hand male 70 9 brown brown right male 71 11 blue blond left male 73 11.5 blue blond right female 64 7 brown black right male 66 7.5 brown lightbrown right female 63 6.5 brown black right female 64 6.5 blue red right male 72 10 brown blond left male 66 8.5 green lightbrown right female 67 8 brown lightbrown right male 74 11.5 brown brown left male 72 12 blue brown right female 68 8.5 blue lightbrown right male 78 12 blue blond right male 70 12 green blond right female 68 8 blue red both female 68 9.5 green brown left female 66 7 blue blond right male 66 10 brown brown right :::: :: :: ::::: ::::: :::::

  3. Contingency table results:Rows: eyesColumns: hair

  4. Contingency table results:Rows: hairColumns: eyes Often it is arbitrary which variable gets to be the row variable.

  5. Displaying three variables (sex, eye color, hair color). We will focus on two variables. Contingency table results for sex=female:Rows: eyesColumns: hair Contingency table results for sex=male

  6. The 793 adult male passenger survival, by 1st class, 2nd class, and 3rd class fares: http://www.encyclopedia-titanica.org/titanic-statistics.html

  7. Relative Frequency marginal distribution: (in parentheses) • Margins show relative amount in each row or column • Add to one.

  8. Conditional Distribution Either rows or columns add to one (100%). Percentages conditioned on survival status

  9. Percentages conditioned on passenger class

  10. What proportion of passengers were women & children? What proportion of the passengers were lost? What proportion of the women & children were lost? Of the passengers who were lost, what proportion of the passengers were women and children?

  11. Simpson’s Paradox: Example Hypothetical graduate school acceptance data: Men do better

  12. But if a third variable is accounted for the story changes… Women actually do better

  13. Why the change?

  14. Simpson’s Paradox represents a situation in which an association between two variables inverts or goes away when a third variable is introduced to the analysis. See: http://users.humboldt.edu/rizzardi/Handouts.dir/SimpsonParadoxExample.xlsx

More Related