60 likes | 193 Vues
Psychology 202b Advanced Psychological Statistics, II. February 3, 2011. Overview. Multivariate data simulation Added variable plots (review) Partial correlation The problem of collinearity Regression diagnostics: Review of assumption checking Outliers Influence and leverage.
E N D
Psychology 202bAdvanced Psychological Statistics, II February 3, 2011
Overview • Multivariate data simulation • Added variable plots (review) • Partial correlation • The problem of collinearity • Regression diagnostics: • Review of assumption checking • Outliers • Influence and leverage
What does “control” mean? • Controlling or holding constant • Partial relationships and the added variable plot
Collinearity • The problem of collinearity • Formal definition: • Two predictors X1and X2 are collinear if there exist constants c1, c2, and c0 such that c1X1 + c2X2 = c0. • More generally, a set of k predictors is collinear if c1X1 + c2X2 + … + ckXk = c0. • Collinearity is not synonymous with correlation among the predictors.
Why is collinearity a problem? • Hocking and Pendelton’s picket fence. • Implication: when a set of predictors is approximately collinear, estimation becomes unstable and standard errors become large.
Diagnosing collinearity • Correlations may be diagnostic if the data are multivariate normal. • The condition number: • An alternative form removes column means from X. Other options eliminate the intercept column or use a correlation metric.