1 / 30

Mixture Modeling

Mixture Modeling. Chongming Yang Research Support Center FHSS College. Mixture of Distributions. Mixture of Distributions. Classification Techniques. Latent Class Analysis (categorical indicators) Latent Profile Analysis (continuous Indicators)

ezhno
Télécharger la présentation

Mixture Modeling

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Mixture Modeling Chongming Yang Research Support Center FHSS College

  2. Mixture of Distributions

  3. Mixture of Distributions

  4. Classification Techniques • Latent Class Analysis (categorical indicators) • Latent Profile Analysis (continuous Indicators) • Finite Mixture Modeling (multivariate normal variables) • …

  5. Integrate Classification Models into Other Models • Mixture Factor Analysis • Mixture Regressions • Mixture Structural Equation Modeling • Growth Mixture Modeling • Multilevel Mixture Modeling

  6. Disadvantages of Multi-steps Practice • Multistep practice • Run classification model • Save membership Variable • Model membership variable and other variables • Disadvantages • Biases in parameter estimates • Biases in standard errors • Significance • Confidence Intervals

  7. Latent Class Analysis (LCA) • Setting • Latent trait assumed to be categorical • Trait measured with multiple categorical indicators • Example: drug addiction, Schizophrenia • Aim • Identify heterogeneous classes/groups • Estimate class probabilities • Identify good indicators of classes • Relate covariates to Classes

  8. Graphic LCA Model • Categorical Indicators u: u1, u2,u3, …ur • Categorical Latent Variable C: C =1, 2, …, or K

  9. Probabilistic Model • Assumption: Conditional independence of u so thatinterdependence is explained by C like factor analysis model • An item probability • Joint Probability of all indicators

  10. LCA Parameters • Number of Classes -1 • Item Probabilities -1

  11. Class Means (Logit) • Probability Scale (logistic Regression without any Covariates x) • Logit Scale • Mean (highest number of Class) = 0

  12. Latent Class Analysis with Covariates • Covariates are related to Class Probability with multinomial logistic regression

  13. Posterior Probability(membership/classification of cases)

  14. Estimation • Maximum Likelihood estimation via • Expectation-Maximization algorithm • E (expectation) step: compute average posterior probabilities for each class and item • M (maximization) step: estimate class and item parameters • Iterate EM to maximize the likelihood of the parameters

  15. Test against Data • O = observed number of response patterns • E = model estimated number of response patterns • Pearson • Chi-square based on likelihood ratio

  16. Determine Number of Classes • Substantive theory (parsimonious, interpretable) • Predictive validity • Auxiliary variables / covariates • Statistical information and tests • Bayesian Information Criterion (BIC) • Entropy • Testing K against K-1 Classes • Vuong-Lo-Mendell-Rubin likelihood-ratio test • Bootstrapped likelihood ratio test

  17. Bayesian Information Criterion (BIC) L = likelihood h = number of parameters N = sample size Choose model with smallest BIC BIC Difference > 4 appreciable

  18. Quality of Classification • Entropy • = average of highest class probability of individuals • A value of close to 1 indicates good classification • No clear cutting point for acceptance or rejection

  19. Testing K against K-1 Classes • Bootstrapped likelihood ratio test LRT = 2[logL(model 1)- logL(model2)], where model 2 is nested in model 1. Bootstrap Steps: • Estimate LRT for both models • Use bootstrapped samples to obtain distributions for LRT of both models • Compare LRT and get p values

  20. Testing K against K-1 Classes • Vuong-Lo-Mendell-Rubin likelihood-ratio test

  21. Determine Quality of Indicators • Good indicators • Item response probability is close to 0 or 1 in each class • Bad indicators • Item response probability is high in more than one classes, like cross-loading in factor analysis • Item response probability is lowin all classes like low-loading in factor analysis

  22. LCA Examples • LCA • LCA with covariates • Class predicts a categorical outcome

  23. Save Membership Variable Variable: idvar = id; Output: Savedata: File = cmmber.txt; Save = cprob;

  24. Latent Profile Analysis • Covariance of continuous variables are dependent on class K and fixed at zero • Variances of continuous variables are constrained to be equal across classes and minimized • Mean differences are maximized across classes

  25. Finite Mixture Modeling(multivariate normal variables) • Finite = finite number of subgroups/classes • Variables are normally distributed in each class • Means differ across classes • Variances are the same across • Covariances can differ without restrictions or equal with restrictions across classes • Latent profile can be special case with covariances fixed at zero.

  26. Mixture Factor Analysis • Allow one to examine measurement properties of items in heterogeneous subgroups / classes • Measurement invariance is not required assuming heterogeneity • Factor structure can change • See Mplusoutputs

  27. Factor Mixture Analysis • Parental Control • Parental Acceptance

  28. Two dimensions of Parenting

  29. Mixture SEM • See mixture growth modeling

  30. Mixture Modeling with Known Classes • Identify hidden classes within known groups • Under nonrandomized experiments • Impose equality constraints on covariates to identify similar classes from known groups • Compare classes that differ in covariates

More Related