Classical Scaling

Classical Scaling 기계학습 연구실 박성인 2010.02.25

Contents • Introduction • Finding Coordinates in Classical Scaling • A Numerical Example for Classical Scaling • Choosing a Different Origin • Advanced Topics

Introduction • The basic idea of classical scaling is to assume that the dissimilarities are distances and then find coordinates that explain them • Classical scaling uses the same procedure (In Section 7.9) but operates on squared dissimilarities instead of • This method is popular because it gives an analytical solution, requiring no iterations

Finding Coordinates in Classical Scaling • (from 7.5 Section) • cis the vector with the diagonal elements of XX’ • Multiplying the left and the right sides by the centering matrix and by the factor -1/2 • Double centering • The centering around B can be removed because X is column centered, and hence so is B

Double Centering • A way of normalizing the data matrix • Types • Row centering • Each row of the row-centered data matrix sums to zero • Column centering • Each column of the column-centered data matrix sums to zero • Double centering • Consists of the successive application of both row centering and column centering • Example A = AB = (Column centering) ABA = (Double centering) B = BA = (Row centering)

Finding Coordinates in Classical Scaling • To find the MDS coordinates from B, we factor B by eigendecompositionQΛQ′ • The procedure for classical scaling • Compute the matrix of squared dissimilarities • Apply double centering to this matrix • Compute the eigendecomposition of QΛQ′ • Let the matrix of the first meigenvaluesgreater than zero be and the first m columns of Q. Then, the coordinate matrix of classical scaling is given by • If happens to be a Euclidean distance matrix, then classical scaling finds the coordinates up to a rotation

Finding Coordinates in Classical Scaling • In step 4, negative eigenvaluescan occur but not if is a Euclidean distance matrix (see Chapter 19) • In classical scaling, the negative eigenvalues (and its eigenvectors) are simply ignored as error • Classical scaling minimizes the loss function (called Strain) • A nice property of classical scaling is that the dimensions are nested • MDS by minimizing Stress does not give nested solutions

A numerical Example for Classical Scaling • Using the faces data from Table 4.4

Choosing a Different Origin • X is constructed so that its columns sum to zero • The origin of configuration X coincides with the center of gravity of its points (centriod) • However, choosing this origin is not necessarily the best choice • Example • Some objects may be less familiar to the respondents, and thus lead to less reliable distance estimates than others • It is wiser to pick as an origin a point that is based more on the points associated with less error • For general solution, consider picking some arbitrary point s as the new origin, with restriction that s lies in the space of the other points

Choosing a Different Origin • In terms of algebra, point s should lie in the row space of X • The coordinate vector of s is a weighted sum of the rows of X, s’ = w’X • w’ is an m-element row vector of weights • With s as the new origin, the point coordinates • If the weight vector w is chosen such that w’1 = 1, then is a projector • If B=XX’, one obtains after projecting X to a new origin s

Choosing a Different Origin • If one chooses a particular object i as the origin • w’=[0,…,1,…,0] • the 1 is in the ith position • If one picks the centroid as the origin, then w’ = [1/n,…,1/n] • Another choice is to pick the weights in w so that they reflect the reliability of the objects • Unreliable elements should have a weight close to zero and reliable elements a high value • The origin will be attracted more towards the reliable points

Advanced Topics • The linear constraints imposed on X require X = YC • Yis an n x rmatrix of r external variables • C are weights to be optimized by classical scaling • The method of computing C • P is orthonormal(P’P =I) • The Strain loss function used by classical scaling

Advanced Topics • Example • We reanalyze the constrained MDS of the facial expression data in Section 10.3 • The external constraint matrix Y is defined as in Table 10.2 • The optimal weight matrix C obtained by constrained classical scaling is

Advanced Topics • Example • To get the coordinates X • We compute X = YC • This solution does not differ much from the constrained MDS solution in 10.4 • The main difference lies in the location of point 8

Advanced Topics • Classical scaling can even be used to study or discover the “intrinsic geometry” of highly nonlinear structures contained in high-dimensional spaces • To study such geometries and to unroll them into low-dimensional Euclidean geometries, Tenenbaum, De Silva, and Langford(2000) first define the radius of a small neighborhood • Applying classical scaling to dissimilarities generated in such a way from nonlinear structures allowed Tenenbaum et al.(2000) to unroll these structures successfully

Classical Scaling