550 likes | 575 Vues
Explore the discrimination problem of classifying new elements in populations with known distributions, such as bones as human or not, or patients as ill or healthy. Understand model formulation and interpretation using Fisher's approach for various groups. Learn about discriminant analysis, cluster discrimination, and classification problems. Discover the identification of features through gene analysis and pattern recognition.
E N D
Two classification problems • Discrimination • Cluster
The discrimination problem • Given two populations with known distributions, classify a new element in one of the two populations
Examples Classify: • Bones as human or not • Consumer as reliable or not (credit scoring) • A patient as ill or healthy • An art work as made by author A or B. • Automatic classification (letters, coins, bills, ...)
Basic Data Data Matrix Element 1st Element 1st Element n2th Element n1th Group A Group B
Identification of features .23 …. Pattern Recognition Matrix Classify as known or unknown
Classification problems A ? 4 100 euros? 1000 dracmas?
Particular case: Normal Populations Classify P2
Interpretation Classify B B Classify A A
Fisher Clasificar en población B Clasificar en A B A
Problemas del modelo lineal • No hay garantía de que las probabilidades estén entre cero y uno, pueden tomar valores negativos o mayores que uno. • Es heterocedástico. Si estimamos el modelo lineal con variable de clasificación –1 +1 se obtiene la función lineal discriminante.