APSCAN: A parameter free algorithm for clustering

APSCAN: A parameter free algorithm for clustering Presenter: Cheng-Hui Chen Author: Xiaoming Chen, Wanquan Liu, HuiningQiu, Jianhuang Lai PRL 2011

Outlines • Motivation • Objectives • Methodology • Experiments • Conclusions • Comments

Motivation • There are two distinct drawbacks for DBSCAN: • It has two parameters are difficult to be determined. • DBSCAN does not perform well to datasets with varying densities

Objectives • This paper propose a novel parameter free clustering algorithm named as APSCAN • As a parameter free clustering method, APSCAN is different from AP and DBSCAN. It is not only suitable for a single density data like DBSCAN but also can be used to cluster density varying datasets.

Methodology

Methodology MinPts = 2 Noise Class ID DBSCAN algorithm

Methodology Responsibility Availabilities Candidate k Candidate examplar k Supporting data point I’ Competing Candidate k’ r(i, k) a(i, k) Data point i Affinity propagation clustering algorithm

APSCAN Normalized density list generation

APSCAN • The Double-Density Based SCAN (DDBCAN)

Synthesize the result by Label Update Rule • If all the points in Cjof Resultj are noise points in Resulti, we give a new class label for all points in cluster Cjin the updated clustering result. j • If p is in the cluster Cjin Resultjand p is in the cluster Ci in Resulti, then mark p with label Ci in the updated clustering result. • If p belongs to Cj in Resultj and p is a noise point in Resulti, but not all the points in Cj are noise points in Resulti , we give p a label as j j i i If p is a noise point both in Resulti and Resultj , it is labeled as a noise point in the updated clustering result.

Experiments

Experiments Dataset Three Dataset One Dataset Two

Experiments Toy dataset

Experiments

Conclusions • In this paper can conclude that the proposed APS-CAN has the following three advantages: • It is a parameter free clustering method. • Itis suitable for clustering datasets with varying densities. • Itcan preserve the irregular structure of a dataset.

Comments • Advantages • It has achieved satisfactory performance on clustering datasets with varying densities. • Applications • Clustering

APSCAN: A parameter free algorithm for clustering

APSCAN: A parameter free algorithm for clustering

Presentation Transcript

Efficient Parameter-free Clustering Using First Neighbor Relations

ROCK: A ROBUST CLUSTERING ALGORITHM FOR CATEGORICAL ATTRIBUTES

Local Clustering Algorithm

Linear Clustering Algorithm

SCAN : A Structural Clustering Algorithm for Networks

HCS Clustering Algorithm

Parameter-free Hierarchical Co-Clustering by n -Ary Splits

ROCK: A Robust Clustering Algorithm for Categorical Attributes

A novel genetic algorithm for automatic clustering

A Hierarchical Clustering Algorithm for Categorical Sequence Data

Genetic Algorithm for Parameter Optimization

A Secure Clustering Algorithm for Distributed Data Streams

Clustering Algorithm

RIC: Parameter-Free Noise-Robust Clustering

SCAN: A Structural Clustering Algorithm for Networks

Algorithm design for MAPS clustering

A New Gravitational Clustering Algorithm

Boosting Algorithm for Clustering

Local Clustering Algorithm

A novel genetic algorithm for automatic clustering

SCAN: A Structural Clustering Algorithm for Networks

A Combined Clustering and Placement Algorithm for FPGAs