Unsupervised Evolutionary Clustering Algorithm for Mixed Type Data

Unsupervised Evolutionary Clustering Algorithm for Mixed Type Data ZhiZheng, Maoguo Gong , Jingjing Ma , Licheng Jiao , Qiaodi Wu 2010,CEC Presented by Chien-Hao Kung 2011/12/1

Outlines • Motivation • Objectives • Methodology • Experiments • Conclusions • Comments

Motivation • As a partitional clustering algorithm, K-prototype (KP) algorithm is a well-known one for mixed type data. • However, it is sensitive to initialization and converges to local optimum easily.

Objectives In this study, KP is applied as a local search strategy, and runs under the Global searching to help KP overcome its flaws.

Methodology • K-prototype Algorithm • Step1.Initializing. • Step2.For each data item, calculating the distances. • Step3.Retest every data item. • Step4.Repeat Step3. until no item changes its cluster.

Methodology • Evolutionary k-prototype(EKP) • Step1 Initialization. • Step2 Crossover. • Step3 Mutation. • Step4 KP Search. • Step5 Evaluation and Selection. • Step6 Termination Test.

Methodology • Initialization • There are 8 parameters have to be set before evolution. • Cluster number • r is a weight in EKP which balance the influence on clustering • Population size • Proportion of initial individuals that generated by choosing items randomly in dataset (IP) • Crossover probability • Mutation probability • in simulated binary crossover(SBX) • n in polynomial mutation

Methodology • Initialization • Two kinds of random initialization schemes • The first is randomly choosing K data item as the prototypes of clusters • The second is randomly generating K prototypes • Ex: • [2.23,5.63],[6.56,5.13], and {1,2,3,4,5,6},{2,4} • =>{3.21,6.23,2,4}

Methodology • Crossover. • Numerical type --Simulated binary crossover(SBX) • Categorical type – Single point crossover

Methodology Mutation

Methodology • KP Search • Evaluation and Selection • Termination Test

Experiments Parameter setting

Experiments

Experiments • Dataset

Experiments

Conclusions • This paper propose a novel unsupervised clustering algorithm for mixed type data named evolutionary k-prototype(EKP) . • The experiment result show that the evolutionary framework improves the original algorithms markedly. • EKP which can adjust this weight automatically needs to be studied.

Comments • Drawback • This method use the parameter too much. Application • Clustering

Unsupervised Evolutionary Clustering Algorithm for Mixed Type Data

Unsupervised Evolutionary Clustering Algorithm for Mixed Type Data

Presentation Transcript

Unsupervised Learning: Clustering

Local Clustering Algorithm

unsupervised learning - clustering

Unsupervised learning: Clustering

HCS Clustering Algorithm

Unsupervised Learning and Clustering

Unsupervised models and clustering

An Effective Clustering Algorithm for Mixed-size Placement

Mixture model clustering for mixed data with missing information

A Hierarchical Clustering Algorithm for Categorical Sequence Data

Unsupervised Learning with Mixed Numeric and Nominal Data

Unsupervised pattern recognition models for mixed feature-type symbolic data

A k-mean clustering algorithm for mixed numeric and categorical data

A Secure Clustering Algorithm for Distributed Data Streams

Clustering Algorithm

Chapter 7: Clustering (Unsupervised Data Organization)

Algorithm design for MAPS clustering

Unsupervised Optimal Fuzzy Clustering

Boosting Algorithm for Clustering

Unsupervised Learning and Clustering

Local Clustering Algorithm

An Effective Clustering Algorithm for Mixed-size Placement