Fast global k-means clustering using cluster membership and inequality

Fast global k-means clustering using cluster membership and inequality Presenter : Lin, Shu-Han • Authors : Jim Z.C. Lai, Tsung-Jen Huang Pattern Recognition(PR, 2010)

Outline • Motivation • Objective • Methodology • Experiments • Conclusion • Comments

Motivation • FGKM and • MGKM Have the same computational complexity MGKM Claims that it is moreeffectivethanFGKM (see 2008.PR.8.書漢.1027.Modified global k-means algorithm for minimum sum-of-squares clustering problems)

Objectives , th=.9999 • Develop a set of inequalities to • Speed up FGKM and MGKM, called MFGKM • Using Karhunen-Loeve Transform (KLT) • closely related to the Principal Component Analysis (PCA)

Methodology–MFGKM Red = proposed (or s Yj’ , called MCS)

Methodology– cluster center selection algorithm (Speed up)

Methodology– Candidate set construction algorithm

Methodology– Candidate set construction algorithm (Cont.) 1. r10=2,r10=d(x10,c) |8.2-7.2|=1 1+|2.2-4.2|=3>r10,deletex10, x10cannotbethenearestneighborofx8 l+p m 1 2

Methodology– Candidate set construction algorithm (Cont.) 2. rmax=2 m

Methodology– Candidate set construction algorithm (Cont.) 3.

Methodology– Candidate set construction algorithm (Cont.) 4. Diff(distortion) Diff=(r9-d(x8,x9))+(r10-d(x8,x10)) =2-1+2-1 m Return2andcenterofx9andx7

Methodology– Candidate set construction algorithm (Cont.)

Methodology– MCS

Experiments–Computingtime 16

Experiments–Distortion Leastdistortion Faster,butdistortion 17

Conclusions • GKM • FGKM:faster,butlocal • MGKM:betterperformancethenFGKM,butneedsmorecomputationalcomplexity • MFGKM:faster,andbetterthenMGKM • MFGKM+MCS:fastestmethod,andperformanceiscomparabletoMGKM

Comments • Advantage • Improvebothperformanceandspeed • Drawback • … • Application • …

Methodology– k-Means sensitive to the choice of a starting point 20

Methodology– The GKM algorithm Objectivefunction 21

Methodology– Objectivefunction • Oldversion • Reformulatedversion 22

Methodology– fast GKM algorithm • Oldversion • Proposedversion(auxiliaryclusterfunction) k-1 k-1 j y i i 23

Methodology– modifiedGKM algorithm • Proposedversion S2 k-1 S2 S2 ci i S2 S2 24

Methodology– modifiedGKM algorithm 25

Fast global k-means clustering using cluster membership and inequality

Fast global k-means clustering using cluster membership and inequality

Presentation Transcript

k -means Clustering

K-means Clustering

K-means Clustering

K means Clustering ( Weka )

Canopy Clustering and K-Means Clustering

k NN , K- Means, Clustering and Bayesian Inference

K-MEANS CLUSTERING

K-Means Clustering

K-means clustering

K-means Clustering

Initial K-Means Clustering :

Fast modified global k-means algorithm for incremental cluster construction

K-means Clustering

K-means Clustering

Clustering Beyond K -means

Clustering: K-Means

K Means Clustering , Nearest Cluster and Gaussian Mixture

A Fast PTAS for k-Means Clustering

K-means clustering

Social Media Analysis using Optimized K Means Clustering