Direct Convex Relaxations of Sparse SVM

Direct Convex Relaxations of Sparse SVM Antoni B. Chan, Nuno Vasconcelos, and Gert R. G. Lanckriet The 24th International Conference on Machine Learning (ICML 2007) Presented by Shuiwang Ji

Outline • Introduction; • Quadratically Constrained Quadratic Programming (QCQP) formulation; • Semidefinite Programming (SDP) formulation; • Experiments;

Sparsity of SVM x1, …, xd SVM is sparse w.r.t. data points, but not sparse w.r.t. features.

Motivations & Related Work • Features may be noisy, redundant; • Sparsity enhance interpretability; • Sparse PCA (Zou et al. & d'Aspremont et al.); • Sparse Eigen Methods by D.C. Programming (ICML07);

An Example

Vector Norm Number of nonzero entries in x

2-norm C-SVM Primal and Dual

1-norm LP-SVM Primal and Dual

Convex QCQP Relaxation

Interpretations of QCQP-SSVM • Problem 6 and 7 are equivalent; • QCQP-SSVM is a combination of C-SVM and LP-SVM, 1-norm encourages sparsity and 2-norm encourages large margin;

QCQP-SSVM Dual

QCQP-SSVM • QCQP-SSVM automatically learns an adaptive soft-threshold on the original SVM hyperplane.

SDP Relaxation

SDP-SSVM Dual • The optimal weighting matrix increases the influence of the relevant features while demoting the less relevant features; • SDP-SSVM learns a weighting on the inner product such that the hyperplane in the feature space is sparse.

Results on Synthetic Data

Results on 15 UCI data sets

Direct Convex Relaxations of Sparse SVM

Direct Convex Relaxations of Sparse SVM

Presentation Transcript

Convex Hull

SVM and SVR as Convex Optimization Techniques

Direct and Iterative Methods for Sparse Linear Systems

Convex Set of Points

Rounding Sum of Squares Relaxations

Amesos Sparse Direct Solver Package

Convex Relaxations of Non-Convex Mixed Integer Quadratically Constrained Problems

Amesos Interfaces to sparse direct solvers

Convex Recoloring of Trees

Sparse Direct Solvers on High Performance Computers

SuperLU: Sparse Direct Solver

Efficiently Solving Convex Relaxations

An Analysis of Convex Relaxations for MAP Estimation

SVM Implementation

Sparse Direct Solvers on High Performance Computers

Sparse Direct Solvers on High Performance Computers

Sparse Direct Methods on High Performance Computers

Efficiently Solving Convex Relaxations