Machine Learning for Signal Processing Eigenfaces and Eigen Representations

Machine Learning for Signal ProcessingEigenfaces and Eigen Representations Class 6. 12 Feb 2015 Instructor: Bhiksha Raj 11-755/18-797

Recall: Representing images • The most common element in the image: background • Or rather large regions of relatively featureless shading • Uniform sequences of numbers 11-755/18-797

Adding more bases B1 B2 B3 B4 B5 B6 Getting closer at 625 bases! Checkerboards with different variations 11-755/18-797

“Bases” B1 B2 B3 B4 B5 B6 • “Bases” are the “standard” units such that all instances can be expressed a weighted combinations of these units • Ideal requirements: Bases must be orthogonal • Checkerboards are one choice of bases • Orthogonal • But not “smooth” • Other choices of bases: Complex exponentials, Wavelets, etc.. 11-755/18-797

Data specific bases? • Issue: All the bases we have considered so far are data agnostic • Checkerboards, Complex exponentials, Wavelets.. • We use the same bases regardless of the data we analyze • Image of face vs. Image of a forest • Segment of speech vs. Seismic rumble • How about data specific bases • Bases that consider the underlying data • E.g. is there something better than checkerboards to describe faces • Something better than complex exponentials to describe music? 11-755/18-797

The Energy Compaction Property • Define “better”? • The description • The ideal: • If the description is terminated at any point, we should still get most of the information about the data • Error should be small 11-755/18-797

Data-specific description of faces • A collection of images • All normalized to 100x100 pixels • What is common among all of them? • Do we have a common descriptor? 11-755/18-797

A typical face The typical face • Assumption: There is a “typical” face that captures most of what is common to all faces • Every face can be represented by a scaled version of a typical face • We will denote this face as V • Approximate everyface f as f = wf V • Estimate V to minimize the squared error • How? What is V? 11-755/18-797

A collection of least squares typical faces • Assumption: There are a set of K “typical” faces that captures most of all faces • Approximate every face f as f = wf,1 V1+ wf,2 V2 + wf,3 V3 +.. + wf,kVk • V2 is used to “correct” errors resulting from using only V1. So on average • V3 corrects errors remaining after correction with V2 • And so on.. • V = [V1 V2 V3] • Estimate V to minimize the squared error • How? What is V? 11-755/18-797

A recollection M = S=Pinv(N) M ? N = U = • Finding the best explanation of music M in terms of notes N • Also finds the score Sof M in terms of N 11-755/18-797

How about the other way? M = S = N = M Pinv(S) ? ? N = U = • Finding the notes Ngiven music M and score S • Also finds best explanation of M in terms of S 11-755/18-797

Finding Everything M = ? S = ? ? N = U = • Find the four notes and their score that generate the closest approximation to M 11-755/18-797

The same problem F = W Typical faces V U = Approximation 11-755/18-797

Abstracting the problem: Finding the FIRST typical face Pixel 2 Pixel 1 Each “point” represents a face in “pixel space” 11-755/18-797

Abstracting the problem: Finding the FIRST typical face V Pixel 2 Pixel 1 Each “point” represents a face in “pixel space” Any “typical face” V is a vector in this space 11-755/18-797

Abstracting the problem: Finding the FIRST typical face V Pixel 2 error f Pixel 1 Each “point” represents a face in “pixel space” The “typical face” V is a vector in this space The approximationwf, V for any face f is the projection of f onto V The distance between f and its projection wfVis the projection error for f 11-755/18-797

Abstracting the problem: Finding the FIRST typical face V Pixel 2 Pixel 1 Every face in our data will suffer error when approximated by its projection on V The total squared length of all error lines is the total squared projection error 11-755/18-797

Abstracting the problem: Finding the FIRST typical face V Pixel 2 Pixel 1 The problem of finding the first typical face V1:Find the V for which the total projection error is minimum! This “minimum squared error” V is our “best” first typical face It is also the first Eigen face 11-755/18-797

Abstracting the problem: Finding the FIRST typical face Pixel 2 V Pixel 1 The problem of finding the first typical face V1:Find the V for which the total projection error is minimum! This “minimum squared error” V is our “best” first typical face It is also the first Eigen face 11-755/18-797

Abstracting the problem: Finding the FIRST typical face V Pixel 2 Pixel 1 The problem of finding the first typical face V1:Find the V for which the total projection error is minimum! This “minimum squared error” V is our “best” first typical face It is also the first Eigen face 11-755/18-797

Abstracting the problem: Finding the FIRST typical face V1 Pixel 2 Pixel 1 • The problem of finding the first typical face V1:Find the V for which the total projection error is minimum! • This “minimum squared error” V is our “best” first typical face • It is also the first Eigen face • Not really, but we’ll get to the correction later 11-755/18-797

Formalizing the Problem: Error from approximating a single vector v vvTx Approximating: x = wv y x-vvTx x x • Projection of a vector x on to a vector v • Assuming v is of unit length: 11-755/18-797

Error from approximating a single vector v vvTx y x-vvTx x x Error from projecting a vector x on to a vector onto a unit vector v 11-755/18-797

Error from approximating a single vector v vvTx y x-vvTx x x =1 Error from projecting a vector x on to a vector onto a unit vector v 11-755/18-797

Error from approximating a single vector v vvTx y x-vvTx x x Error from projecting a vector x on to a vector onto a unit vector v 11-755/18-797

Error from approximating a single vector v vvTx y x-vvTx x x e(x) = xTx – xTv.vTx Length of projection This is the very familiar pythogoras’ theorem!! 11-755/18-797

Error for many vectors v y x Error for one vector: Error for many vectors Goal: Estimate v to minimize this error! 11-755/18-797

Error for many vectors v y x • Total error: • Add constraint: vTv = 1 • Constrained objective to minimize: 11-755/18-797

Two Matrix Identities Derivative w.r.t v 11-755/18-797

Minimizing error v y x • Differentiating w.r.t vand equating to 0 11-755/18-797

The correlation matrix X = Data Matrix Correlation = XT = TransposedData Matrix The encircled term is the correlation matrix 11-755/18-797

The best “basis” v y x • The minimum-error basis is found by solving • v is an Eigen vector of the correlation matrix R • l is the corresponding Eigen value 11-755/18-797

What about the total error? xTv = vTx(inner product) 11-755/18-797

Minimizing the error The total error is We already know that the optimal basis is an Eigen vector The total error depends on the negative of the corresponding Eigen value To minimize error, we must maximize l i.e. Select the Eigen vector with the largest Eigen value 11-755/18-797

The typical face The typical face • Compute the correlation matrix for your data • Arrange them in matrix X and compute R = XXT • Compute the principal Eigen vector of R • The Eigen vector with the largest Eigen value • This is the typical face 11-755/18-797

The approximation with the first typical face • The first typical face models some of the characteristics of the faces • Simply by scaling its grey level • But the approximation has error • The second typical face must explain some of this error 11-755/18-797

The second typical face The first typical face The second typical face? ? Approximation with only the first typical face has error The second face must explain this error How do we find this this face? 11-755/18-797

Solution: Iterate - = - = - = - = Get the “error” faces by subtracting the first-level approximation from the original image Repeat the estimation on the “error” images 11-755/18-797

Abstracting the problem: Finding the second typical face ERROR FACES Pixel 2 Pixel 1 • Each “point” represents an error face in “pixel space” • Find the vector V2 such that the projection of these error faces on V2 results in the least error 11-755/18-797

Minimizing error The same math appliesbut now to the set of error data points ERROR FACES Pixel 2 Pixel 1 • Differentiating w.r.t vand equating to 0 11-755/18-797

Minimizing error The same math appliesbut now to the set of error data points ERROR FACES Pixel 2 Pixel 1 • The minimum-error basis is found by solving • v2 is an Eigen vector of the correlation matrix Recorresponding to the largest eigen valuel of Re 11-755/18-797

Which gives us our second typical face But approximation with the two faces will still result in error So we need more typical faces to explain this error We can do this by subtracting the appropriately scaled version of the second “typical” face from the error images and repeating the process 11-755/18-797

Solution: Iterate Error face Second-level error - = - = - = - = • Get the second-level “error” faces by subtracting the scaled second typical face from the first-level error • Repeat the estimation on the second-level “error” images 11-755/18-797

An interesting property • Each “typical face” will be orthogonal to all other typical faces • Because each of them is learned to explain what the rest could not • None of these faces can explain one another! 11-755/18-797

To add more faces • We can continue the process, refining the error each time • An instance of a procedure is called “Gram-Schmidt” orthogonalization • OR… we can do it all at once 11-755/18-797

With many typical faces M = W Typical faces V U = Approximation • Approximate every face f as f = wf,1 V1+ wf,2 V2 +... + wf,k Vk • Here W, V and U are ALL unknown and must be determined • Such that the squared error between U and M is minimum 11-755/18-797

With multiple bases • Assumption: all bases v1 v2 v3.. are unit length • Assumption: all bases are orthogonal to one another: viTvj = 0 if i != j • We are trying to find the optimal K-dimensional subspace to project the data • Any set of vectors in this subspace will define the subspace • Constraining them to be orthogonal does not change this • I.e. if V = [v1v2v3 … ], VTV = I • Pinv(V) = VT • Projection matrix for V = VPinv(V) = VVT 11-755/18-797

With multiple bases Represents a K-dimensional subspace V VVTx y x-VVTx x x Projection for a vector Error vector = Error length = 11-755/18-797

Machine Learning for Signal Processing Eigenfaces and Eigen Representations

Machine Learning for Signal Processing Eigenfaces and Eigen Representations

Presentation Transcript

Bayesian Machine Learning for Signal Processing

Machine Learning for Natural Language Processing

Signal Processing

Eigen representations; Detecting faces in images

Signal Processing

Eigen Representations: Detecting faces in images

General Signal Processing and Machine Learning Tools for BCI Analysis (Part-2)

Information Theoretic Signal Processing and Machine Learning

Brain Machine Interfaces: Modeling Strategies for Neural Signal Processing

General Signal Processing and Machine Learning Tools for BCI Analysis (Part-1)

Speech Signal Representations I

When Signal Processing Meets Machine Learning

Speech Signal Representations

Machine Learning for Signal Processing Fundamentals of Linear Algebra - 2

Machine Learning for Signal Processing Principal Component Analysis

Machine Learning for Signal Processing Sparse and Overcomplete Representations

Machine Learning for Signal Processing Linear Gaussian Models