Nonlinear Adaptive Distance Metric Learning for Clustering

Nonlinear Adaptive Distance Metric Learning for Clustering Jianhui Chen, Zheng Zhao, Jieping Ye, Huan Liu KDD, 2007 Reported by Wen-Chung Liao, 2010/01/19

Outlines • Motivation • Objective • ADAPTIVE DISTANCE METRIC LEARNING: THE LINEAR CASE • ADAPTIVE DISTANCE METRIC LEARNING: THE NONLINEAR CASE • NAML • Experiment • Conclusions • Comments • In distance metric learning, the goal is to achieve better • compactness (reduced dimensionality) • separability (inter-cluster distance) • on the data, in comparison with usual distance metrics, such as Euclidean distance.

Motivation • Traditionally, dimensionality reduction and clustering are applied in two separate steps. • If distance metric learning (via dimensionality reduction) and clustering can be performed together, the cluster separability in the data can be better maximized in the dimensionality-reduced space. • Many real-world applications may involve data with nonlinear and complex patterns. • Kernel methods

Objectives • Propose NAML for simultaneous distance metric learning and clustering. • NAML • first maps the data to a high-dimensional space through a kernel function; • next applies a linear projection to find a low-dimensional manifold; • and then perform clustering in the low-dimensional space. • The key idea of NAML is to integrate • kernel learning, • dimensionality reduction, • clustering in a joint framework so that the separability of the data is maximized in the low-dimensional space.

ADAPTIVE DISTANCE METRIC LEARNING: THE LINEAR CASE Data set: Data matrix: Linear transformation W: Mahalanobis distance measure: Min Max ≡ Weighted cluster indicator matrix Cluster indicator matrix

ADAPTIVE DISTANCE METRIC LEARNING: THE NONLINEAR CASE Kernel method: nonlinear mapping Hilbert space (feature space) ψK (X) : the data matrix in the feature space Symmetric kernel function K: (inner product) kernel Gram matrix G: For a given kernel function K, the nonlinear adaptive metric learning problem can be formulated as : convex combination of p kernel matrices

ADAPTIVE DISTANCE METRIC LEARNING: THE NONLINEAR CASE 3.1 The computation of L for given Q and G ≡

ADAPTIVE DISTANCE METRIC LEARNING: THE NONLINEAR CASE 3.2 The computation of Q for given L and G ≡ Max

ADAPTIVE DISTANCE METRIC LEARNING: THE NONLINEAR CASE 3.3 The computation of G for given Q and L ≡ Min MOSEK

NAML O(pk3n3)

Experiment • K-means algorithm as the baseline for comparison • three representative unsupervised distance metric learning algorithms: • Principle Component Analysis (PCA), Local Linear Embedding (LLE), and Laplacian Eigenmap (Leigs) Performance Measures ci the obtained cluster indicator yithe true class label C the set of cluster indicators Y be the set of class labels

Experimental Results (10 RBF kernels for NAML) Experiments

Sensitivity Study: the effect of the input kernels and the regularization parameter λ K-means using 10 kernels for NAML NAML provides a way to learn from multiple input kernels and generate a metric, with which an unsupervised learning algorithm, like K-means, is more likely to perform as well as with the best input kernel the quality of the initial kernel is low

A series of different λvalues ranging from 10−8 to 105 Experiments • λ value in the range of [10−4, 102] is helpful in most cases

Conclusion • NAML • The joint kernel learning, metric learning, and clustering can be formulated as a trace maximization problem, which can be solved iteratively in an EM framework. • NAML is effective in learning a good distance metric and improving the clustering performance. • multiple biological data, e.g., amino acid sequences, hydropathy profiles, and gene expression data. • A future work is to study how to combine a set of pre-specified Laplacian matrices to achieve better performance in spectral clustering.

Comments • Advantage • A joint framework for kernel learning, distance metric learning and clustering • Shortage • Applications • clustering

Nonlinear Adaptive Distance Metric Learning for Clustering

Nonlinear Adaptive Distance Metric Learning for Clustering

Presentation Transcript

Semisupervised Clustering with Metric Learning Using Relative Comparisons

An Adaptive Distance Learning Environment for Language Teaching

NONLINEAR ADAPTIVE CONTROL

A Survey on Distance Metric Learning (Part 1)

Learning Neighborhoods for Metric Learning

Adaptive Learning

From Context to Distance-Learning Dissimilarity for Categorical Data Clustering

Mining Social Images with Distance Metric Learning for Automated Image Tagging

Distance Metric

LearnMet: Learning a Domain-Specific Distance Metric for Graph Mining

Distance metric learning Vs. Fisher discriminant analysis

Clustering Sequences in a Metric Space

Semisupervised Clustering with Metric Learning using Relative Comparisons

Semisupervised Multiview Distance Metric Learning for Cartoon Synthesis

A Survey on Distance Metric Learning (Part 2)

Distance metric learning, with application to clustering with side-information

Learning Instance Specific Distance Using Metric Propagation

Integrating Constraints and Metric Learning in Semi-Supervised Clustering

Distance Learning Program for JEE, Distance Learning for IIT JEE, Distance Learning for AIIMS

distance learning MBA, distance MBA, distance learning education

A Survey on Distance Metric Learning (Part 2)

Distance Metric Learning: A Comprehensive Survey