340 likes | 354 Views
Toric Modification on Machine Learning. Keisuke Yamazaki & Sumio Watanabe Tokyo Institute of Technology. Agenda. Learning theory and algebraic geometry Two forms of the Kullback divergence Toric modification Application to a binomial mixture model Summary. Agenda.
E N D
Toric Modificationon Machine Learning Keisuke Yamazaki & Sumio Watanabe Tokyo Institute of Technology
Agenda • Learning theory and algebraic geometry • Two forms of the Kullback divergence • Toric modification • Application to a binomial mixture model • Summary
Agenda • Learning theory and algebraic geometry • Two forms of the Kullback divergence • Toric modification • Application to a binomial mixture model • Summary
What is the generalization error? The true model Predictive model Generalization Error MLE MAP Bayes i.i.d. Learning model Data
Algebraic geometry connected to learning theory in the Bayes method. • The formal definition of the generalization error. The true model Predictive model Data Learning model
Algebraic geometry connected to learning theory in the Bayes method. • The formal definition of the generalization error. The true model Predictive model Another Kullback divergence : Data Learning model
Algebraic geometry connected to learning theory in the Bayes method. • The formal definition of the generalization error. The true model Predictive model Another Kullback divergence : Data Learning model Algebraic Geometry Machine Learning / Learning Theory
h(w) The zeta function has an important role for the connection. Analytic func. Zeta function : C-infinity func. with compact support h(w) holomorphic function Algebraic Geometry Machine Learning / Learning Theory
The zeta function has an important role for the connection. Zeta function : Kullback divergence Prior distribution holomorphic function Algebraic Geometry Machine Learning / Learning Theory
The largest pole of the zeta function determines the generalization error. Asymptotic Bayes generalization error [ Watanabe 2001] Algebraic Geometry Machine Learning / Learning Theory
Agenda • Learning theory and algebraic geometry • Two forms of the Kullback divergence • Toric modification • Application to a binomial mixture model • Summary
Resolution of singularities Calculation of the zeta function requires well-formed H(w). : Factorized form : Polynomial form
Blow-ups The resolution of singularities with blow-ups is an iterative method.
Blow-ups The resolution of singularities with blow-ups is an iterative method. Kullback divergence in ML is complicated and has high dimensional w :
Blow-ups The bottleneck is the iterative method. Applicable function Any function Computational cost Large
Blow-ups Toric Modification Toric modification is a systematic method for the resolution of singularities. Applicable function Any function Computational cost Large
Agenda • Learning theory and algebraic geometry • Two forms of the Kullback divergence • Toric modification • Application to a binomial mixture model • Summary
Toric Modification Newton diagram is a convex hull in the exponent space
Toric Modification The borders determine a set of vectors in the dual space.
Toric Modification Add some vectors subdividing the spanned area.
Toric Modification Selected vectors construct the resolution map.
Toric Modification Non-degenerate Kullback divergence • The condition to apply the toric modification to the Kullback divergence For all i :
Toric Modification Blow-ups Toric modification is “systematic”. Blow-up The search space will be large. We cannot know how many iterations we need. The number of vectors is limited.
Blow-ups Toric Modification Toric modification can be an effective plug-in method. Non-degenerate function Applicable function Any function Computational cost Limited Large
Agenda • Learning theory and algebraic geometry • Two forms of the Kullback divergence • Toric modification • Application to a binomial mixture model • Summary
An application to a mixture model • Mixture of binomial distributions The true model: Learning model:
The generalization error of the mixture of binomial distributions : Polynomial form : Factorized form The zeta function: : Generalization error
Agenda • Learning theory and algebraic geometry • Two forms of the Kullback divergence • Toric modification • Application to a binomial mixture model • Summary
Summary • The Bayesian generalization error is derived on the basis of the zeta function. • Calculation of the coefficients requires the factorized form of the Kullback divergence. • Toric modification is an effective method to find the factorized form. • The error of a binomial mixture is derived as the application.
Thank you !!! Toric Modificationon Machine Learning Keisuke Yamazaki & Sumio Watanabe Tokyo Institute of Technology
Asymptotic form of G.E. has been derived in regular models. Fisher information matrix Essential assumption: