290 likes | 431 Vues
HSRP 734: Advanced Statistical Methods July 31, 2008. Objectives. Describe the general form of the Cox proportional hazards model extended for time-dependent variables Describe the analysis for staggered entry Review for Final exam. Time-Dependent Variables.
E N D
Objectives • Describe the general form of the Cox proportional hazards model extended for time-dependent variables • Describe the analysis for staggered entry • Review for Final exam
Time-Dependent Variables • Time-dependent variable: covariate whose value may vary over time. • Two options if the proportional hazards assumption is not satisfied for one or more of the predictors in the model. • Use a stratified Cox model • Use time-dependent variables
Time-Dependent Variables • Time-dependent variables may be: • Inherently time-dependent • Internal – only have meaning when subject is alive • smoking status at time t • white blood count at time t • External – can be obtained whether or not subject is alive • Air pollution index at time t • Part internal and part ancillary • E.g., heart transplant status at time t • Defined to analyze a time-independent predictor not satisfying the PH assumption • E.g., RACE x time; RACE x log(time+1)
Internal Time-Dependent Variable • Internal time-dependent variables are particularly susceptible to be inappropriately controlled. • They often lie in the causal pathway about which we want to make inferences.
Internal Time-Dependent Variableexample • Clinical trial for treatment of metastatic colorectal cancer – do we adjust for most recent WBC?
Internal Time-Dependent Variableexample • Clinical trial for treatment of metastatic colorectal cancer – do we adjust for most recent WBC? • Treatment comparison among subjects with like prognosis at each time • But treatment might improve prognosis by improving depressed WBC over time • Adjustment for WBC over time might remove the apparent effect of treatment, since patients with the same WBC in either treatment group might have similar prognosis
Extended Cox Model for Time-Dependent Variables • Model • Assumption: • The effect of a time-dependent variable on the survival probability at time t depends on the value of this variable at that same time t. • Statistical inferences: • Wald, Score, Likelihood ratio tests
Extended Cox Model for Time-Dependent Variables • Even though the values of the time-dependent variable may change over time, the hazard model provides only one coefficient for each time-dependent variable in the model.
Hazard Ratio for the Extended Cox Model • Let X1= smoking yes/no; X2(t) = X1x t • The hazard ratio (or RR) is a function of time. • PH assumption is not satisfied for the extended Cox model
Hazard Ratio for the Extended Cox Model • Coefficient represents the “overall” effect of the corresponding time-dependent variable, considering all times at which this variable has been measured in the study. • Another model with a time-dependent variable • compares an exposed person to an unexposed person at time t.
Time-Dependent Variablesin SAS • Do not define the time-dependent variable in a data step • The variable will be time-independent • Use the programming statements in proc tphreg time depend example.sas
Left Truncation of Failure Times • Also know as staggered entry • Left truncation arises when individuals only come under observation some known time after the natural time origin of the phenomenon under study.
Left Truncation Examples • Ex 1 Atomic bomb survivors study • Time zero is August 1945 – time is time since radiation exposure • Observation of subjects begins with the 1950 census • People who died before 1950 are not in the sample - survival times are left truncated at 5 years
Left Truncation Examples Ex 2 Welsh nickel refiners • Time zero is employee’s start date - all were before 1925 • Observation of most subjects begin in 1934, some in 1939, 1944, or 1949 • In contrast to example 1, each subject has his own truncation time i.e. staggered entry
Left Truncation • The risk set just prior to an event time does not include individuals whose left truncation times exceed the given event time. Thus, any contribution to the likelihood must be conditional on the truncation limit having been exceeded.
Left Truncation • Please do not confuse this with left censoring • Recall – left censoring occurs when the true survival time is less than what we observed • We may not know a left censored participant's exact survival time, but at least we know he/she existed; i.e. he/she did get observed • In a staggered entry situation, we may not know how many participants we missed.
Implications of left truncation Ex. 1 • We have no way of making inferences about risk of death before 5 years • In a Cox model, if there are different relationships between the covariates and λ(t) when t<5 and when t>5, we have no way to detect this.
Implications of staggered entry Ex. 2 • Any subject in the cohort had to survive from initial employment to beginning of observation • If we ignore this in a Cox model, we will compare the covariates of subject 2 (for example) to all other subjects • This is not fair. There would be subjects in the denominator who could not possible be in the numerator
Solution • At each event time, include in the risk set only those subjects who have not yet died and who are under observation • Risk sets are not necessarily nested and can get bigger as time progresses • Every inferential statement we make must be made conditional on surviving to beginning of observation
Solution – main assumption • The sampling process leading to late entry into the sample does not preferentially select subjects with unusual risks or covariate values
? • How are the coefficient estimates from a Cox model for example 1 (atomic bomb survivors study) different if we correct for left truncation from those from an uncorrected model?
? • How are the coefficient estimates from a Cox model for example 1 (atomic bomb survivors study) different if we correct for left truncation from those from an uncorrected model? • Answer: They do not change. No one fails until after everyone has entered. The risk sets do not change.
? 2 • What changes in example 2 (Welsh nickel refiners) if instead of correcting for left truncation we change the time scale to be time since each subject’s entry into observation? • This is not the same as accounting for left truncation
? 2 • What changes in example 2 (Welsh nickel refiners) if instead of correcting for left truncation we change the time scale to be time since each subject’s entry into observation? • Answer The risk set compositions change. Thus, the coefficient estimates and hazard function changes.
Left Truncation • Coding example from SAS manual proc tphreg data=one; model t2*dead(0)=x1-x10/entry=t1; baseline out=out1 survival=s; run;