Rotation Tests - Computing exact adjusted p-values in multiresponse experiments

Rotation Tests - Computing exact adjusted p-values in multiresponse experiments Øyvind Langsrud, MATFORSK, Norwegian Food Research Institute.

Campylobacter experiment • Three biological replicates (block variable) • 312 FT-IR wavelengths as multiple responses • polysaccharide region [1200-900 cm-1]

Analysis with MINITAB - first wavelength

ANALYSIS with 50-50 MANOVA

Adjusted means as curves

Effect of Day - 312 single response p-values • Ordinary significance tests are not longer suitable • A lot of significant results cased by random variation (since several tests/responses) • The p-values need to be adjusted • So that they are interpretable

Adjusted p-values • So that experimentwise (or familywise) error rate is controlled • Bonferroni correction (classical method) • pAdj = #responses • pRaw • Conservative upper bound (in practice often too conservative ) • Dependence among responses not investigated • Modern methods • Makes active use of dependence among responses • Permutation tests • Rotation tests

Assume a regression model(simplified model without constant term) • Separate F-tests for each response • Random variables: F1, F2 … , Fq • Observed values: f1, f2 … , fq • Maximal F-value (= minimum p-value) obtained for response number k • Raw p-value: • Adjusted p-value:

Adjusting the minimum p-value by permutations • For m=1,2 …. M • permute data (Y P(m)Y ) • compute maximal F-statistic from these data • Compute p-value as

How is dependence handled? • Estimate of covariance matrix under H0: • Estimate based on permuted data: • The permutation test is a conditional test • Conditioned on the covariance matrix estimate • Conditioned on sufficient statistics for the unknown parameters • Fisher's exact test for 22 contingency tables • is the most famous conditional test

Conditional test under multivariate normality? • Need distribution of Y conditioned on YTY • Answer • Y is distributed as RYobs • where Yobs is the observed matrix • and where R is an uniformly distributed orthogonal matrix (random rotation matrix) • Relation to well-known tests • t-test, F-tests, Hotelling T2, Wilks’  are special cases of rotation testing • But these test statistics do not depend on YTY • Conditioning not needed • Simulations not necessary

Adjusting the minimum p-value by rotations • For m=1,2 …. M • simulate rotated data (Y R(m)Y ) • where R(m) is a simulated random rotation matrix • compute maximal F-statistic from these data • Compute p-value as • In practice: a much more efficient algorithm is applied

Adjusting the other p-values (permutations or rotations) • Remove response with minimum p-value • Adjust minimum p-value in new data set • and so on • Enforce monotonicity • All calculations can be done simultaneously

Permutation test or rotation test • Exact permutation testing • The only assumption: independent observations • Useless for few observations • Useless for complex ANOVA and regression models • Exact rotation testing • Assumes multivariate normality • Does not need as many observations as permutation testing • Can be use for complex ANOVA and regression models • F-test  rotation test

Adjusted p-values (FWE) • non-adjusted p-values (RAW) • False significance at 1% level is expected in 1% of all the investigated responses • If you have 5000 responses ….. • “Classically ” adjusted p-values (FWE) • False significance at 1% level is expected in not more that 1% of all experiments where the method is applied. • The experimentwise (or familywise) error rate is controlled

False Discovery Rate (FDR) • Adjusted p-values according to False Discovery Rate • False significance at 1% level is expected in 1% of all cases (responses) reported as significant at 1% level. • If you have 5000 responses and 200 are reported as significant at 1% level, one will expect two of these as false. • “q-values” is proposed instead of “adjusted p-values”

Calculation of FDR adjusted p-values • Several methods exist • Most of them do not handle the dependence among the responses • but OK if the “weak dependence requirement” is met • New variant • based on rotations (or alternatively permutations) • handles any kind of dependence • conservative compared to other methods • since the method does not involve an estimate of the amount of responses with true null hypotheses

Adjusted p-values (first 30 wavelengths)

Adjusted p-values (30 most significant wavelengths)

p-values

Rotation Tests - Conclusion • Simulation principle for computing exact Monte Carlo p-value for any test statistic. • Based on multivariate normal distribution. • Generalisation of classical tests. • Related to permutation testing. • Useful for computing adjusted p-values (F-tests) • FWE, FDR • General linear models (ANOVA and regression) • Implemented in the 50-50 MANOVA program (www.matforsk.no/ola)

Rotation Tests - Computing exact adjusted p-values in multiresponse experiments

Rotation Tests - Computing exact adjusted p-values in multiresponse experiments

Presentation Transcript

Three Common Misinterpretations of Significance Tests and p-values

Tutorial on Risk Adjusted P-chart

Statistical tests for replicated experiments

p-values and Discovery

P-Values

Inference: Fisher’s Exact p-values

Intelligence Tests and Psychological Experiments

6B Symmetry and Exact Values – exact values

Tutorial on Risk Adjusted P-chart

P Values

VARIATIONS IN REFERENCE VALUES UTILIZED IN PULMONARY FUNCTION TESTS

Dilepton cocktail in p +A experiments

p-values and Discovery

Finding Exact Values of Trig Ratios

Introduction Wald tests p – values Likelihood ratio tests

Approximation and Exact Values

Understanding p-values

-log 10 (p- values )

Hypothesis Tests with Critical Values

P Values