Multivariate Distance and Similarity

Multivariate Distance and Similarity Robert F. Murphy Cytometry Development Workshop 2000

General Multivariate Dataset • We are given values of p variables for n independent observations • Construct an n x p matrix M consisting of vectors X1 through Xn each of length p

Multivariate Sample Mean • Define mean vector I of length p or vector notation matrix notation

Multivariate Variance • Define variance vector s2 of length p matrix notation

Multivariate Variance • or vector notation

Covariance Matrix • Define a p x p matrix cov (called the covariance matrix) analogous to s2

Covariance Matrix • Note that the covariance of a variable with itself is simply the variance of that variable

Univariate Distance • The simple distance between the values of a single variable j for two observations i and l is

Univariate z-score Distance • To measure distance in units of standard deviation between the values of a single variable j for two observations i and l we define the z-score distance

Bivariate Euclidean Distance • The most commonly used measure of distance between two observations i and l on two variables j and k is the Euclidean distance

Multivariate Euclidean Distance • This can be extended to more than two variables

Effects of variance and covariance on Euclidean distance The ellipse shows the 50% contour of a hypothetical population. B A Points A and B have similar Euclidean distances from the mean, but point B is clearly “more different” from the population than point A.

Mahalanobis Distance • To account for differences in variance between the variables, and to account for correlations between variables, we use the Mahalanobis distance

Multivariate Distance and Similarity

Multivariate Distance and Similarity

Presentation Transcript

Similarity and clustering

CPN Distance/Similarity Functions

INTERFERENCE AND SIMILARITY

Similarity and Transformations

Introduction to Multivariate Analysis and Multivariate Distances

Similarity and Difference

Multivariate Tests Based on Pairwise Distance or Similarity Measures

Image Similarity and the Earth Mover’s Distance

The Google Similarity Distance

The Google Similarity Distance

Congruence and Similarity

Similarity and Transformations

Google Similarity Distance

Congruence and Similarity

Similarity and Denoising

Similarity and Congruency

Distance and Similarity Measures

Congruence and Similarity

Transformations and Similarity

Symmetry and Similarity

Congruence and Similarity

Image Similarity and the Earth Mover’s Distance