1 / 7

MV Stats News

MV Stats News. Vol. 1, Number 4, October 20, 2011. Bringing multivariate data analysis and data visualization to your breakfast table. Today’s topic: Early geometric proof rediscovered Fisher uses beautiful n-D interpretation to derive distribution of s and independence from mean.

crosemary
Download Presentation

MV Stats News

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. MV Stats News Vol. 1, Number 4, October 20, 2011 Bringing multivariate data analysis and data visualization to your breakfast table Today’s topic: Early geometric proof rediscovered Fisher uses beautiful n-D interpretation to derive distribution of s and independence from mean Michael Friendly, Staff Reporter Filed: 1/6/2020 12:10 AM

  2. Student (1908): the t distribution • In 1908, W.S. Gosset (“Student”) published “The probable error of the mean” (Biometrika, 6, 1-25) • Established the t-test for small samples • Gave a derivation of the sampling distribution of • Proof required showing:

  3. Student (1908): the t distribution • What Student did: • Computed the first 4 moments of the dist of s2 • Showed that these agreed with those of χ2 • “hence, it is probable that the curve found represents the theoretical distribution of s2; so that we have no actual proof, we shall assume it in what follows” • Independence: Showed only that mean and s2 were uncorrelated (not strictly independent) • Fisher (1939): “This was the most striking gap in his argument” (Why: depends on joint distribution). Independence: requires showing Pr(A,B) ~ Pr(A)*Pr(B)

  4. Fisher (1915): Geometric proof Student → Stratton → Fisher → Pearson In 1912, Fisher wrote to Student, suggesting a geometric proof of the independence of mean, M, and s2 “... the form establishes itself instantly, when the distribution of the sample is viewed geometrically.” Essential idea: In observation space (đn), the 1D line of the mean is orthogonal to the n-1 D space of (xi – M) Pearson initially refused to publish in Biometrika: “I do not follow Mr. Fisher’s proof, and it is not the kind which appeals to me.”

  5. Fisher (1915): these quantities have “an exceedingly beautiful interpretation in generalised space” Note: Independence is established if can show that

  6. From: JA Hanley etal (2008), Student’s z, t, and s: What if Gosset had R, The American Statistician, 62(1), 64-69.

  7. Not enough for Pearson? Corresponding proof for correlation coefficient: n pairs regarded as coords of a point in 2n-D space sample means, variances & covariance have “a beautiful interpretation”

More Related