220 likes | 321 Views
Learning Larger Margin Machine Locally and Globally. Dept. of Computer Science and Engineering The Chinese University of Hong Kong Shatin, NT. Hong Kong Kaizhu Huang February 9, 2004. Contributions.
E N D
Learning Larger Margin Machine Locally and Globally Dept. of Computer Science and Engineering The Chinese University of Hong Kong Shatin, NT. Hong Kong Kaizhu Huang February 9, 2004 The Chinese University of Hong Kong
Contributions • Theory: A unified model of Support Vector Machine (SVM), Minimax Probability Machine (MPM), and Linear Discriminant Analysis (LDA). • Practice: A sequential Conic Programming Problem. The Chinese University of Hong Kong
Outline • Background And Motivation • Maxi-Min Margin Machine(M4) • Model Definition • Geometrical Interpretation • Solving Methods • Connections With Other Models • M4: Non-separable case • Experimental Results • Future Work • Conclusion The Chinese University of Hong Kong
Background: Classifier The Chinese University of Hong Kong
SVM A more reasonable decision plane Support Vectors Background: SVM The Chinese University of Hong Kong
Maxi-Min Margin Machine(M4) The Chinese University of Hong Kong
M4:Geometrical Interpretation The Chinese University of Hong Kong
M4:Solving Method • Basic Technique: Divide and Conquer • If we fix to a specific , the problem changes to check whether this satisfies the following constraints: • If yes, we increase ; otherwise, we decrease it. Second Order Cone Programming Problem!!! The Chinese University of Hong Kong
M4:Solving Method (Continue) Iterate the following two steps to solve M4: The Chinese University of Hong Kong
Yes No can it satisfy the constraints? M4:Solving Method (Continue) The Chinese University of Hong Kong
Span all the data points and add them together Connection with MPM Exactly MPM Optimization Problem!!! + The Chinese University of Hong Kong
M4 Connection with MPM • Remarks: • The procedure is not reversible: MPM is a special case of M4 • MPM focuses on building decision boundary GLOBALLY, i.e., it exclusively depends on the means and covariances. However, means and covariances may not be accurately estimated. MPM The Chinese University of Hong Kong
Connection With SVM SVM with a further assumption: The magnitude of w can scale up without influencing the optimization The Chinese University of Hong Kong
M4 M4 M4 SVM SVM SVM Connection With SVM M4 The Chinese University of Hong Kong
Connection With SVM SVM assumes The Chinese University of Hong Kong
Links With LDA Perform the similar procedure as in MPM LDA The Chinese University of Hong Kong
Link With LDA The Chinese University of Hong Kong
Non-separable Case The Chinese University of Hong Kong
Experimental Results-Synthetic Toy example The Chinese University of Hong Kong
Experimental Results-Benchmark Datasets The Chinese University of Hong Kong
Future Work • Kernelization? • Nonlinear extension of M4 • Speed-up algorithms? • Is critical in large-scale applications • Generation error bound? • SVM and MPM have both error bounds. • Multi-way classification extension? The Chinese University of Hong Kong
Conclusion • Propose a unified model of MPM and SVM • Propose feasible solving methods based on sequential Second Order Cone Programming. The Chinese University of Hong Kong