1 / 20

Implementation of Quasi-Least Squares using xtgee in Stata

Implementation of Quasi-Least Squares using xtgee in Stata. Justine Shults Assistant Professor of Biostatistics Department of Biostatistics University of Pennsylvania School of Medicine. Outline. Motivational Study -Traditional Approach (GEE) Describe quasi-least squares (QLS)

arodrigues
Télécharger la présentation

Implementation of Quasi-Least Squares using xtgee in Stata

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Implementation of Quasi-Least Squares using xtgee in Stata Justine Shults Assistant Professor of Biostatistics Department of Biostatistics University of Pennsylvania School of Medicine

  2. Outline • Motivational Study -Traditional Approach (GEE) • Describe quasi-least squares (QLS) -Advantages • Implementation of QLS using xtgee • Example • Conclusion Broad Overview of Work in Progress (Papers to be submitted to Biometrics and Stata journal)

  3. Motivational Study-Traditional approach (GEE) Study of Interstitial Cystitis (IC) in women IC is a disease of urinary tract Subjects recorded number of 24 hour voids

  4. Timeline for a subject with all measurements: 0 months 3 months 6 months 9 months 12 months xxx xxx xxx xxx __ xxx Bout 1 Bout 2 Bout 3 Bout 4 Bout 5 Time between bouts >>>>> time within bouts

  5. Goal of analysis: Relate covariates with expected number of 24 hour voids. • Should adjust for potential intra-subject correlation. • Traditional approach: generalized estimating equation approach (GEE) of Liang and Zeger (1986).

  6. Limitation of GEE: Can be difficult to implement complex correlation structures. • Banded Toeplitz structure (Mazurick and Landis, Journal of Urology, 2000): • [ 1 r1 r1 r2 r2 r2 r3 r3 r3 r4 r4 r4 r5 r5 r5] • [ r1 1 r1 r2 r2 r2 r3 r3 r3 r4 r4 r4 r5 r5 r5] • [ r1 r1 1 r2 r2 r2 r3 r3 r3 r4 r4 r4 r5 r5 r5] • [ r2 r2 r2 1 r1 r1 r2 r2 r2 r3 r3 r3 r4 r4 r4] • [ r2 r2 r2 r1 1 r1 r2 r2 r2 r3 r3 r3 r4 r4 r4] • [ r2 r2 r2 r1 r1 1 r2 r2 r2 r3 r3 r3 r4 r4 r4] • [ r3 r3 r3 r2 r2 r2 1 r1 r1 r2 r2 r2 r3 r3 r3] • [ r3 r3 r3 r2 r2 r2 r1 1 r1 r2 r2 r2 r3 r3 r3] • [ r3 r3 r3 r2 r2 r2 r1 r1 1 r2 r2 r2 r3 r3 r3] • [ r4 r4 r4 r3 r3 r3 r2 r2 r2 1 r1 r1 r2 r2 r2] • [ r4 r4 r4 r3 r3 r3 r2 r2 r2 r1 1 r1 r2 r2 r2] • [ r4 r4 r4 r3 r3 r3 r2 r2 r2 r1 r1 1 r2 r2 r2] • [ r5 r5 r5 r4 r4 r4 r3 r3 r3 r2 r2 r2 1 r1 r1] • [ r5 r5 r5 r4 r4 r4 r3 r3 r3 r2 r2 r2 r1 1 r1] • [ r5 r5 r5 r4 r4 r4 r3 r3 r3 r2 r2 r2 r1 r1 1]

  7. Allows for decline in correlation with time, but not as severe as for Markov structure • Mazurick and Landis (2000) could only implement in ad-hoc approach with GEE • Will directly implement using Quasi-Least Squares (QLS). • Developed in: Chaganty (JSPI, 1997), Shults and Chaganty (Biometrics,1998), Chaganty and Shults (JSPI, 1999)]

  8. Advantages of QLS • In framework of GEE- extends generalized linear models for correlated data • Guarantees feasible estimates for some structures, i.e. positive definite matrix • Allows for easier implementation of some complex correlation structures • e.g. Structure appropriate for data with multiple sources of correlation. Shults, Whitt, Kumanyika (Statistics in Medicine, in press) • Banded Toeplitz structure

  9. Two stage procedure • Based on GEE (Liang and Zeger, 1986). • Stage one: Alternates till convergence: • Estimate Β via GEE estimating equation. • Estimate p by minimizing an objective function (residuals). Solve stage one estimating equation. • Stage two: • Solve stage two equation to update estimate of p . • Obtain final estimate of Β by again solving GEE.

  10. Implementation of QLS using xtgee Basic Idea and Outline for program: important: xtgee allows for implementation of a correlation structure that is fixed and known

  11. Outline for program: • Let p = (r1,r2,r3,r4,r5). • To implement stage one of QLS: • Let p = 0 so that working structure = Identity. • Estimate Β by solving GEE estimating equation via xtgee, with identity correlation structure. • Repeat till convergence: • Obtain Pearson residuals. • Use algorithm based on method of bisection to solve stage one estimating equation for p. • Construct correlation matrix WORK based on updated estimate of p. • Update estimate of Β via xtgee, with correlation structure WORK treated as fixed and known.

  12. Stage two: Use algorithm based on method of bisection to solve stage two estimating equation for p . Construct correlation matrix FINAL based on updated estimate of p. Obtain final estimate of Β via xtgee with fixed and known structure = FINAL.

  13. Nice features of Stata: Matrix commands are helpful in setting up working structure. The xtgee procedure allows for implementation of a structure that is fixed and known. Programming, e.g. bisection method, is straightforward.

  14. Syntax of command • Under development- programs for stats paper done, but not “tied together” • Almost identical to xtgee Simplest syntax: xtqls depvar [varlist], family(family) link(link) corr(correlation) i(varname) t(varname)

  15. Description In Example Depvar = dependent variable 24hourvoid [varlist] = covariates age volumebladder family(family) Poisson link(link) log corr(correlation) Toepband(5,3) i(varname) subjectid t(varname) time

  16. What is different about syntax? Will use xtqls instead of xtgee The list of possible correlation structures will be expanded. Previous list: Independent Exchangeable Autoregressive Stationary Non-stationary Unstructured User-Specified

  17. Will add Toepband(b,v) = Toeplitz banded structure for b bouts with v visits per bout. • In future: will add additional structures.

  18. Grant Number: 1R01CA096885-01A2 PI Name: SHULTS, JUSTINE PI Email: jshults@cceb.upenn.edu Project Title: Longitudinal Analysis for Diverse Populations Aim 5 of Abstract: ….. (5) To implement the methods for analysis (Aim 1) and planning (Aim 2) in Stata programs, for use by other statisticians. Further, to widely disseminate the programs, and their documentation, on a web site developed for this project.

  19. Example • Estimated correlation matrix: r1 = 0.8478 r2 = 0.7165 r3 = 0.6831 r4 = 0.6653 r5 = 0.6654 This does suggest some decline, but not as severe as that indicated by a Markov structure. Note: This structure includes equicorrelated and Markov Banded as special cases.

  20. Conclusion • QLS is based on GEE, but allows for easier implementation of some complex structures. • Nice features of Stata allow for relatively straightforward programming of QLS. • Am building xtqls procedure based on xtgee. e-mail: jshults@cceb.upenn.edu

More Related