180 likes | 295 Vues
Discover how to effectively apply clustering techniques using WEKA Explorer to identify groups of similar instances in your datasets. This tutorial covers various clustering algorithms including k-Means, EM, Cobweb, X-means, and FarthestFirst. Learn how to visualize clusters and compare them against true clusters if available. Additionally, we will discuss evaluation metrics, particularly focusing on log-likelihood for clustering schemes that output probability distributions. Enhance your machine learning skills and improve your data analysis with WEKA!
E N D
Explorer: clustering data • WEKA contains “clusterers” for finding groups of similar instances in a dataset • Implemented schemes are: • k-Means, EM, Cobweb, X-means, FarthestFirst • Clusters can be visualized and compared to “true” clusters (if given) • Evaluation based on loglikelihood if clustering scheme produces a probability distribution University of Waikato