120 likes | 433 Vues
What is Data Clustering. Classification of objects into different groups.Objects in each subset share some common trait.Useful technique for Data Analysis and Data Mining.. Clustering Examples. Biology: clustering is used to group homologous (similar) DNA sequences into gene families. Market Re
E N D
1. Clustering in Data Mining CS 157B, spring 2007
Phuong Tran
2. What is Data Clustering Classification of objects into different groups.
Objects in each subset share some common trait.
Useful technique for Data Analysis and Data Mining.
3. Clustering Examples Biology: clustering is used to group homologous (similar) DNA sequences into gene families.
Market Research:partition the general population into market segments and to better understand the relationships between different groups of consumers.
WWW Search: division of web pages/documents into genres.
4. Types of Clustering Hierarchichal: Successively determine new clusters from previously determined clusters (parent/child clusters).
Partitional: Establish all clusters at once, at the same level.
5. Creating Clusters: Break Up vs Build Up Break Up: start from the bottom of the tree, divide the general population into smaller and smaller clusters.