1 / 21

Small Sample Behavior of Re-Sampling Methods Dongmei Li & Jason C. Hsu Department of Statistics

Small Sample Behavior of Re-Sampling Methods Dongmei Li & Jason C. Hsu Department of Statistics The Ohio State University Columbus, OH 43210. Outline. Motivation Description of re-sampling methods Assumption for validity of re-sampling methods

selene
Download Presentation

Small Sample Behavior of Re-Sampling Methods Dongmei Li & Jason C. Hsu Department of Statistics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Small Sample Behavior of Re-Sampling Methods Dongmei Li & Jason C. Hsu Department of Statistics The Ohio State University Columbus, OH 43210

  2. Outline • Motivation • Description of re-sampling methods • Assumption for validity of re-sampling methods • Discreteness of estimated test statistics null distribution for two independent samples case. • Discreteness of estimated test statistics null distribution for one sample case. • Conditions of getting 0 P-values for post-pivot and pre-pivot re-sampling methods for one sample case.

  3. Motivation • Permutation: large P-values • Post-pivot re-sampling: 0 P-values • Pre-pivot re-sampling: 0 P-values

  4. General microarray study set-up • Data Format: are K dimension vector for ith subject for treatment 1 and are K dimension vector for jth subject for treatment 2. • Model: and where and can be arbitrary distribution. • Null Hypothesis: • Test Statistics:

  5. Popular re-sampling methods • Permutation • Resample raw data between groups • Build distribution of max T • Post-pivot re-sampling • Resample raw data within groups • Build distribution of max of pivoted T • Pre-pivot (linear mixed effects model) re-sampling • Resample predicted values & residuals • Build distribution of max T Post-pivot re-sampling reference: Pollard, K.S. and van der Laan, M (2003) Re-sampling based multiple testing: asymptotic control of type I error and applications to gene expression data. Berkeley Electronic Press.

  6. P-value calculation formula • Raw P-value • Single step maxT adjusted P-value B is the number of bootstrap samples.

  7. Assumption for validity • Permutation method: • Post-pivot re-sampling method • Pre-pivot re-sampling method (re-sample residualsfrom fixed effects general linear model)

  8. Discreteness of estimated test statistics null distribution (two independent samples case) • Maximum unique number of test statistic values. • Permutation method: 20 for m=n=3 • Post-pivot re-sampling method: 100for m=n=3 • Pre-pivot re-sampling method: 3081 for m=n=3

  9. Simulation Set-Up for two independent samples case

  10. Estimated & true null distribution (realization example) Unique max|T| <= 20 Unique max|T| <= 100 Unique max|T| <= 3081

  11. One sample case (Paired data case) • Test statistics

  12. Estimated test statistics null distribution by permutation • One sample case with n=3

  13. Estimated test statistics null distribution by post-pivot re-sampling • Paired data with m=n=3 for k genes • Estimated null distribution matrix

  14. Estimated test statistics null distribution by pre-pivot re-sampling • Paired data with m=n=3 for k genes • Null distribution matrix

  15. Discreteness of estimated test statistics null distribution (one sample case) • Unique number of test statistic values. • Permutation method: 8 for n=3 • Post-pivot re-sampling method: 10for n=3 • Pre-pivot re-sampling method: 10 for n=3

  16. One realization example for n=3

  17. Conditions of getting 0 raw P-values with n=2 for post-pivot and pre-pivot re-sampling

  18. Conditions of getting 0 adjusted P-values with n=2 for post-pivot and pre-pivot re-sampling

  19. Conditions of getting 0 raw P-values with n=3 for post-pivot and pre-pivot re-sampling

  20. Conditions of getting 0 adjusted P-values with n=3 for post-pivot and pre-pivot re-sampling

  21. Discussion • Permutation: large P-values • Post-pivot re-sampling: 0 P-values • Pre-pivot re-sampling: 0 P-values • In one sample case: • Same test statistics null distribution for post-pivot & pre-pivot re-sampling • In two independent samples case: • Pre-pivot re-sampling has less discrete and more smooth max|T| null distribution than post-pivot and permutation.

More Related