210 likes | 357 Views
Principal Component Analysis. Consider a collection of points. Suppose you want to fit a line. Project onto the Line. Consider variance of distribution on the line. Different line. different variance. Maximum Variance. Minimum Variance. Given by eigenvectors of covariance matrix
E N D
Project onto the Line Consider variance of distribution on the line
Different line . . . different variance
Given by eigenvectors of covariance matrix of coordinates of original points
PCA notes… • Input data set • Subtract the mean to get data set with 0-mean • Compute the covariance matrix • Compute the eigenvalues and eigenvectors of the covariance matrix • Choose components and form a feature vector. Order by eigenvalues – highest to lowest
PCA • To compress, ignore components of lesser significance • The feature vector F is a matrix is the matrix of ordered eigenvectors • Derive the data set in the new coordinates: • new_data = FT old_data
Covariance • C, of 2 random variables X and Y where
OOBB Choose bounding box oriented this way
OOBB: Fitting Covariance matrix of point coordinates describes statistical spread of cloud. OBB is aligned with directions of greatest and least spread (which are guaranteed to be orthogonal).
OOBB Good Box
OOBB Add points: worse Box
OOBB More points: terrible box