Math 3680 Lecture #15 Hypothesis Testing: The t Test

Math 3680 Lecture #15 Hypothesis Testing: The t Test

In all of the previous examples, we assumed that we knew the population standard deviation s. • In practice, this is an extremely rare situation! • Instead, the sample standard deviation S is used in lieu of s. • This approximation is called the bootstrap estimate.

Example. The manufacturer of a new fiberglass tire claims that its average life will be at least 40,000 miles. To verify this claim, a sample of 12 tires is tested, and the lifetimes were found to be 36,100 40,200 33,800 38,500 42,000 35,800 37,000 41,000 36,800 37,200 33,000 36,000 Test the manufacture’s claim using a = 0.05.

Solution. H0: The average life is at least 40,000 miles. Ha: The average life is less than 40,000 miles. We choose a = 0.05. Notice that we have to compute the sample SD this time; it’s not given.

We now can compute the test statistic But there’s a catch: since we used S instead of s, the test statistic t does NOT follow the normal curve.

Since we used S instead of s, the test statistic t does NOT follow the normal curve. Instead, there’s a theorem which says that the test statistic t follows the Student t distribution with n - 1 degrees of freedom. (There’s a slight catch with this theorem that we’ll discuss later.) For the current problem, instead of using the normal curve to compute the observed significance level, we will use the Student t distribution with 11 degrees of freedom.

For the sake of completeness, here’s the pdf of the Student’s t-distribution with r degrees of freedom: If you find this intimidating, don’t worry: we will never use it.

Red: t distribution Blue: standard normal curve

For the t distribution with 11 df, we can compute the critical value for a = 0.05 using Table 4 (p. 510). 5% In Excel, be careful: the command is TINV(0.1,11). (The default is for two tails, not one tail.)

Observed Significance Level 0.002724 In Excel, the command is TDIST(3.448,11,1). (The third entry specifies the number of tails.)

Conclusion: We reject the null hypothesis. There is good reason to believe that the average lifespan of the tires is less than 40,000 miles. Note: It is possible to compute power with the Student’s t-distribution, but the computations are much, much more complicated than the normal case (Larsen & Marx, 3rd ed., p. 447). Many statistical software packages are able to compute power for the t-test automatically.

Excel: Use the command • =TTEST(A1:A12, B1:B12,1, 1) • This is silly, I know, but you have to list the claimed average once for each entry in the list. • The blue 1 stands for a one-tailed test; the second 1 is required.

Remember: If the sample is small (n < 30) and the population variance s is unknown, then we use the t-test and not the z-test. On the other hand, if either s is known or the sample is sufficiently large (n > 30), then we may safely use the z-test instead. Also, we must be careful about stating the null and alternative hypotheses so that we correctly choose whether to use a left-tail, a right-tail, or both tails.

Example. Before a substance can be deemed safe for landfilling, its chemical properties must be assessed. In a sample of six replicates of sludge from a New Hampshire wastewater treatment plant, the mean pH was 6.68 with a standard deviation of 0.20. Can we conclude than the mean pH is less than 7.0? J. Benoit, T. Eighmy and B. Crannell, Journal of Geotechnical and Geoenvironmental Engineering 1999, pp. 877--888.

Example. Certain rectangles appear more pleasing to the eye than others. The ancient Greeks called a rectangle with the golden rectangle, and this ratio was called the golden ratio. The golden ratio has been claimed to be a deliberate design of various art and architecture.

The data below shows the width-to-length ratios of beaded rectangles used by the Shoshone Native Americans to decorate their leather goods. Does it appear that the golden rectangle is also an aesthetic standard for the Shoshones? C. Dubois, ed., Lowie’s Selected Papers in Anthropology (UC Press, Berkeley, 1960), pp. 137--142

Robustness of the t Test

The t statistic is defined by If X1, X2, …, Xn follow a normal distribution, then there’s a theorem that says that this t statistic follows the Student t-distribution with n - 1 degrees of freedom. But, in real life, this assumption is almost certainly not true. Models are idealized; real data are, well, real. Now what? (FYI: The exact answer is 0.00330.)

The good news is that the underlying pdf doesn’t have to be very close to normal in order for the test statistic to be close to the Student t-distribution. The following graphs are empirical histograms of the t statistic computed from 10,000 data sets drawn from a “triangular” distribution with pdf which is not too far off from bell shaped. Even for very small samples, the t distribution is accurate.

The following graphs are empirical histograms of the t statistic computed from 10,000 data sets drawn from a Uniform(0,1) distribution, which is symmetric but decisively not bell-shaped. Notice that convergence does not occur as quickly.

The following graphs are empirical histograms of the t statistic computed from 10,000 data sets drawn from an Exponential(1) distribution, which is neither symmetric nor bell-shaped. This time, the sample has to be of size 40 or so for the t distribution to be accurate… that’s mostly due to the Central Limit Theorem. (FYI: The exact answer is 0.00330.)

Math 3680 Lecture #15 Hypothesis Testing: The t Test

Math 3680 Lecture #15 Hypothesis Testing: The t Test

Presentation Transcript

Math 3680 Lecture #17 Two-Sample Inference

Math 3680 Lecture #3 Probability

Math 3680 Lecture #19 Correlation and Regression

Math 3680 Lecture #4 Discrete Random Variables

Math 3680 Lecture #15 Confidence Intervals

Math 3680 Lecture #8 Continuous Random Variables

Math 3680 Lecture #15 Confidence Intervals

Math 3680 Lecture #5 Important Discrete Distributions