A general grid-clustering approach

A general grid-clustering approach Presenter : Chun-Ping Wu Authors : ShinongYue, Miaomiao Wei, Jeen-shing Wang, Huaxiang Wang 國立雲林科技大學 National Yunlin University of Science and Technology PRL 2008

Outline • Motivation • Objective • Methodology • Experiments • Conclusion • Comments

Motivation • Hard c-means(HCM) need to predetermine the number of clusters. • It’s crucially influenced by the choice of initial cluster centers. • The performances of the HCM often are too slow to apply any large size of dataset.

Objective • Determining an optimal grid size by a designed partitioning index. • Breaking the curse of dimensionality in high-dimensional datasets.

Methodology • (1)Solve the minimal grid that encloses all data objects in M. • (2)Successively bisect GRID. • The jth round of bisecting • Solve an optimal grid size • The bisecting stops if the bisected rounds equal OPT + q, where q is a constant. • (3)Find all core grids. • (4)Merge each group of core grids. • The optimal value of q, • (5)Assign all non-core optimal grids to a cluster.

Experiments • Clustering results of the five artificial datasets.

Experiments • Clustering results of the three benchmark datasets.

Conclusion • The GGCA integrates the advantages of both the divisible and the agglomerative clustering algorithms. • The GGCA solves two critical problems of conventional grid-clustering algorithms. • Grid size • Merging condition • The GGCA is a perfect non-parametric algorithm. 8

Comments • Advantage • Improving of the iteration speed in a large dataset. • Drawback • This paper is not clear. • Application • clustering 9

A general grid-clustering approach

A general grid-clustering approach

Presentation Transcript

Subspace Clustering Bottom-up Approach

A Clustering Based Approach to Creating Multi-Document Summaries

Incorporating Resource Efficiency-A general approach

Clustering Approach for Agroenterprise Development

Connecting to the Grid : A partnership approach

A user-friendly approach to grid security

Clustering-A neural network approach

Hamming Clustering: A New Approach to Rule Extraction

A Clustering Utility Based Approach for

A new data clustering approach-Generalized cellular automata

Scalable Clustering on the Data Grid

Grid Differentiated Services: a Reinforcement Learning Approach

Clustering short status messages: A topic model based approach

peHash : A novel approach to Fast Malware Clustering

A Genetic Algorithm Approach to K -Means Clustering

a general approach to Environment Design

A PAC-Bayesian Approach to Formulation of Clustering Objectives

Clustering Pathways Using Graph Mining Approach

General Purpose Grid Computing

Grid-based Coresets for Clustering Problems

A Heuristic Approach Towards Solving the Software Clustering Problem

A genetic approach to the automatic clustering problem