750 likes | 770 Vues
Epidemiology 719 Quantitative methods in genetic epidemiology Bhramar Mukherjee and Sebastian Zoellner bhramar@umich.edu szoellne@umich.edu. Acknowledgements. Peter Kraft (HSPH) Ken Rice (UW) Nilanjan Chatterjee (NCI) Stephen Channock (NCI) Lu Wang (UM) Nan Laird (HSPH)
E N D
Epidemiology 719 Quantitative methods in genetic epidemiology Bhramar Mukherjee and Sebastian Zoellner bhramar@umich.eduszoellne@umich.edu
Acknowledgements • Peter Kraft (HSPH) • Ken Rice (UW) • Nilanjan Chatterjee (NCI) • Stephen Channock (NCI) • Lu Wang (UM) • Nan Laird (HSPH) • Goncalo Abecasis (UM)
A brave new world Course Overview
Central Course Theme Genetic Association and Gene-Environment Interaction
Assigned Paper 1 • GWAS of Age-related macular degeneration • Initial GWAS identified four loci explaining one-half of the heritability. Appreciable predictive power. • Additional GWAS to explain remaining heritability. Combined scan vs replication. Meta-Analysis.
Assigned Paper 2 Collaborative Association Study of Psoriasis Examined ~1,500 cases / ~1,500 controls at ~500,000 SNPs • Examined 20 promising SNPs in extra ~5,000 cases / ~5,000 controls Outcome: 7 regions of confirmed association with psoriasis
Assigned Paper 3 • Meta-analysis of colorectal cancer (COGENT study) . • A thorough evaluation of ten confirmed loci for colorectal cancer. Very detailed. Supplementary material also available online. • Interesting combination of various study design.
Depends on study design • Case-control study • Family-based study: case-parent triad, case-sib pairs being popular choices • Longitudinal Cohort Study • Looking at a secondary outcome under case-control sampling
Primary Analysis • Single marker association tests • Genetic susceptibility model - Dominant, recessive, co-dominant • Which test to use • Multiple testing correction
Pros and Cons • Simple, Complete. • Robust to misspecification of the true dominance pattern • Less powerful. • Unreliable for sparse table
Pros and Cons • Test statistic has single df, so more powerful. • Simple to report. • Not robust to true mode of dominance • Does not present entire information in the data.
Armitage’s trend test • Test linear trend in log(OR) with # A allele • Test statistic still has single d.f. • Simplicity, use information from the 2 x 3 array • More robust than 2 x 2 tests, but less robust than the 2 d.f. test.
Allelic test • Previous tests were based on genotype • Can also treat allele as the unit of observation. • You have doubled the sample size!!
But… • Serious impact on Type 1 error under departures from HWE • Interpretation becomes trickier.
Example AIC: Akaike information criterion, lower the value, better is model fit
Using logistic regression • Trick: Just code genotype differently • Dominant: G=1 if AA or Aa, 0 otherwise • Recessive: G=1 if AA, 0 otherwise • Trend: G=# A alleles, thus G=2 if AA, =1 if Aa and 0 if aa • Two df test: Create two dummy variables: G1=1 if Aa and 0 otherwise G2=1 if AA and 0 otherwise Perform likelihood ratio test of full (G1 and G2) vs reduced model (No G1, G2). • Adjust for other variables, fit a multivariate model