Multivariate Analysis of Variance, Part 1

Multivariate Analysis of Variance, Part 1 BMTRY 726 2/14/14

>2 Independent Samples What happens if we have more than two independent populations we want to compare?

Review ANOVA Given a random sample We rewrite ml as the overall mean plus the treatment effect of treatment l Thus we end up testing Using the model Grand mean Treatment effect Error Sample mean Est. treatment effect Residual

What we want to answer… is the contribution of the treatment large relative to our residuals? In ANOVA we do this using the sums of squares:

>2 Samples for MVN We can use one-way MANOVA to compare mean vectors for our g populations. Assumptions: (1) Independent random samples from g groups (2) Homogeneity of covariance matrices (3) Normality Summary Statistics:

One-way MANOVA Pooled estimate of the common covariance matrix: One-way MANOVA model:

One-way MANOVA Compute summary statistics for each sample SAS constraint: Vector of random errors that are ~NID(0,S) Mean vector for the lth population ml = m + tl Vector of obs for jth unit Selected from lth population

One-way MANOVA We can write the multivariate version of the sums of squares now… So if we want to test if all treatment effect are equal: Total SS and cross products Between (trt) SS and cross products Within (resid) SS and cross products

One-way MANOVA We reject out null hypothesis of Wilk’s lambda is too small. Under certain conditions, the exact distribution of lambda is known: -see table 6.3 on page 303 For other cases and when n is large, we can use the Bartlett modification

Distribution of L* • Sampling distribution for MVN data No. of No. of Sampling distribution for MVN data variables groups

Example: Infant Right Ventricular Failure An investigator examines the at the effect of giving H2S to rabbits with banded right ventricles on heart tissue “floppiness” and amount of oxidized cystine residues in mitochondria collected from heart tissue. There are three groups of animals (1) Banded and untreated (2) Banded but dosed with H2S (3) Banded and over-express CES to produce H2S There are 3 rabbits in each group.

Example

Confidence Intervals If we reject the null hypothesis, we want to determine which treatments are different. In this case, the Bonferroni approach applies We use this to develop an expression for the CI difference between treatment k and l

Example Estimate the 95% CIs for each treatment difference

MANOVA as Linear Model If we think about is as a linear model…

Model Features • Each row of the error matrix (or each row of X) -is independent of any other row -has covariance matrix S • Each column of X -corresponds to a different trait -has a model for the mean of the same form -has a corresponding column of parameters in the design matrix

MANOVA MLE’s Maximum likelihood estimates of parameters in b: For the PROC GLM version of the one-way MANOVA

The jth column of contains parameter estimates for the model fit to the data for the jth trait:

Kronecker product:

Inference We can make certain inferences about the elements of our parameter matrix Generalized LRT procedure (Wilks, 1932) Make comparison across groups (rows of b) Make comparisons across traits Matrix of zeros

Compute the corrected sums of squares and cross-products matrix for the model: • This reduces to: • The diagonal elements of H are model sums of squares whenM = I:

Compute a matrix of residual sums of squares and cross-products: When M = I: Diagonal elements of E are sums of the squared residuals Off-diagonal elements are sums of cross-products of residuals

Likelihood Ratio Test Reject the null hypothesis if Wilk’s criterion is too small Large sample chi-square approximation, reject H0 if

A more accurate approximation given by Rao (C.R. Rao (1951) Bull Int Stat Inst. 33(2), 177-180.) When is true: where

Exact distribution of Wilk’s criterion: (1) Independence (2) Homogeneity of covariance matrices (3) Multivariate normality When is true: When either These are the cases in Table 6.3 on page 303 In J & W

Other Criteria Wilk’s Lambda: Lawely-Hotelling Trace Pillai Trace Roy’s Maximum Root:

Example Recall our example right ventricular failure example…

Example Our estimate of b … Say we want to test the hypothesis of equal treatment effects:

Example We have to find E and H …

Example We can use E and H to find lambda… Does this match what we calculated using our MANOVA table? NOTE: If pand gmatch conditions in table 6.2, we can compare this to the F distribution. Otherwise we can use the c2 approximation if nis large.

Multivariate Analysis of Variance, Part 1