1 / 30

Identify the I.V. and D.V. variables as you read the story:

This article explores the relationship between intensive early training in tennis and performance outcomes using nonparametric statistical tests. The null and alternative hypotheses are examined, and data is gathered to make a decision about the null hypothesis. The construction of a sampling distribution based on ranks is discussed. The Mann-Whitney U test and the Wilcoxon Matched-Pairs Signed-Ranks test are introduced as techniques for analyzing the data. The results of the analysis suggest that there is no significant difference in tennis performance between players with intensive early training and those without.

randydavis
Download Presentation

Identify the I.V. and D.V. variables as you read the story:

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. EDUCATIONAL STATISTICS EDU5950 WEEK14 Identify the I.V. and D.V. variables as you read the story: Two researchers were talking shop over coffee one morning. The topic was intensive early training in athletics. Both researchers were convinced that such training made the child less sociable as an adult, but one researcher went even further. “I think that really intensive training of young kids is ultimately detrimental to their performance in the sport. Why, I’ll bet that, among the top ten men’s single tennis players, those with intensive early training are not in the highest ranks.”

  2. EDUCATIONAL STATISTICS EDU5950 WEEK14 Continued … “Well, I certainly wouldn’t go that far,” said the second researcher. “I think all that early intensive training is quite helpful.” “Good. In fact, great. We disagree and we may be able to decide who is right. Let’s get the ground rules straight. For tennis players, how early is early and what is intensive?” “Oh, I’d say early is starting by age 7 and intensive is playing every day for 2 or more hours.”

  3. EDUCATIONAL STATISTICS EDU5950 WEEK14 “That seem reasonable. Now, let’s see, our proportion is ‘excellent tennis players’ and these top ten will serve as our representative sample.” Can you identify the I.V. and D.V.? This is a new category of tests that can be used to analyze experiments in which the dependent variable is ranks. This category is called nonparametric statistics.

  4. EDUCATIONAL STATISTICS EDU5950 WEEK14 Doing nonparametric statistics are also using the hypothesis-testing steps. Write a paragraph about: - The null hypothesis - The alternative hypothesis • Gathering data • Using a sampling distribution • Making a decision about the null hypothesis • Telling a story that the data support

  5. EDUCATIONAL STATISTICS EDU5950 WEEK14 The different in the nonparametric statistics are those on sampling distribution of ranks. How a sampling distribution based on ranks might be constructed? Suppose you drew two samples of equal size (for example, N1 = N2 = 10) from the same population. You then arranged all the scores from both samples into one overall ranking, from 1 to 20.

  6. EDUCATIONAL STATISTICS EDU5950 WEEK14 Because the samples are from the same population, the sum of the ranks of one group should be equal to the sum of the ranks of the second group. After repeated sampling, all the sums could be arranged into a sampling distribution, which would allow you to determine the likelihood of any sum (if both samples come from the same population). We’ll going to learn four new techniques – Mann-Whitney U, Wilcoxon matched-pairs signed-ranks test, Wilcoxon-Wilcox multiple-comparisons test, and Spearman rs(formely known as Spearman rho (ρ)).

  7. EDUCATIONAL STATISTICS EDU5950 WEEK14 Table 1. The function of some nonparametric and parametric tests

  8. EDUCATIONAL STATISTICS EDU5950 WEEK14 The Mann-Whitney U test Nonparametric test for data from an independent-samples design. Produces a statistics, U – that is evaluated by consulting the sampling distribution of U. When sample sizes are relatively small, critical value of U can be found in Table H.

  9. EDUCATIONAL STATISTICS EDU5950 WEEK14 Use Table H if N1 = 20 or less and N2 = 20 or less. If the number of scores in one of the samples is greater than 20, the statistic U is distributed approximately as a normal. In this case, a z score is calculated, and the values of ±1.96 and ±2.58 are used as critical values for α = .05 and α = .01 Table 2 illustrate the intensive early training of the top ten male singles tennis players.

  10. EDUCATIONAL STATISTICS EDU5950 WEEK14 Table 2 ΣRyes = 4 + 6 + 8 + 9 = 27 ΣRno = 1 + 2 + 3 + 5 + 7 + 10

  11. EDUCATIONAL STATISTICS EDU5950 WEEK14 The sums of the ranks are used to calculate two U values. The smaller of the two U is used to enter Table H, which yields a probability figure. For yes group, For no group, The sum of the 2 U values = the product of (N1)(N2)

  12. EDUCATIONAL STATISTICS EDU5950 WEEK14 Refer to Table H, the lightface type gives a critical values for α levels of .01 for a one tailed test and .02 for a two tailed test. the boldface type are critical values for α = .005 for a one-tailed test and α = .01 for a two-tailed test. In similar way, the other page gives larger α values for both one- and two-tailed tests.

  13. EDUCATIONAL STATISTICS EDU5950 WEEK14 Back to our problem earlier, test the U = 7 value from the tennis data. From Table H, by looking for critical value of U for a two-tailed test with α = .05. The critical value is 2. Because the obtained value of U is 7, you must retain the null hypothesis and concluded that there is no evidence from the sample that the distribution of players trained early and intensively is significantly different from the distribution of those without such training.

  14. EDUCATIONAL STATISTICS EDU5950 WEEK14 The Wilcoxon Matched-Pairs Signed-Ranks T Test Is appropriate for testing difference between two correlated samples. The result of a Wilcoxon matched-pairs signed-ranks test is a T value – interpreted using Table J. Based on data in Table 3, the following steps lead to a T value for a Wilcoxon matched-pairs signed-ranks T:

  15. EDUCATIONAL STATISTICS EDU5950 WEEK14 Find a difference, D, for every pair of scores. The order of subtraction doesn’t matter, but it must be the same for every pair. Using the absolute value for each difference, rank the differences. The rank of 1 is given to the smallest difference, 2 goes to the next smallest, and so on. Attach to each rank the sign of its difference. Thus, if a difference produces a negative value, the rank for that pair is negative.

  16. EDUCATIONAL STATISTICS EDU5950 WEEK14 Sum the positive ranks and sum the negative ranks. T is the smaller of the absolute values of the two sums. Table 3 Σ(positive ranks) = 6 Σ(negative ranks) = -4 T = 4

  17. EDUCATIONAL STATISTICS EDU5950 WEEK14 Table J shows the critical values for both one- and two-tailed tests for several α levels. To enter the table, use N, the number of pairs of subjects. Reject H0 when the obtained T is equal to or less than the critical in the table. Note: Ties among the D scores – each tied score is assigned the mean of the ranks that would have assigned if there had been no ties

  18. EDUCATIONAL STATISTICS EDU5950 WEEK14 The Wilcoxon-Wilcox Multiple-Comparisons Test Allows to compare all possible pairs of treatments. This is like running several Mann-Whitney tests on each pair of treatments. This test requires independent samples. The following steps walk us through this test: Ordering the scores from the K treatments into one overall ranking.

  19. EDUCATIONAL STATISTICS EDU5950 WEEK14 Within each sample, add the ranks, which gives a ΣR for each sample. For each pair of treatments, subtract one ΣR from the other, which gives a difference. Finally, compare the absolute size of the difference to a critical value in Table K. This test can be used only when the N’s for all groups are equal.

  20. EDUCATIONAL STATISTICS EDU5950 WEEK14 The data in Table 4 represent the results of an experiment conducted on a solar collector. Ranks are given to each temperature, ignoring the group the temperature is in. The ranks of all those in a group are summed, producing ΣR values that range from 20 to 84 for flow rates of 14 to 17 gallons per hour.

  21. EDUCATIONAL STATISTICS EDU5950 WEEK14 Table 4 Check: 20 + 35 + 71 + 84 = 210

  22. EDUCATIONAL STATISTICS EDU5950 WEEK14 The next step is to make pairwise comparisons. With four group, six pairwise comparisons are possible: The rate of 14 gallons per hour can be paired with 15, 16, and 17; The rate of 15 can be paired with 16 and 17; and The rate of 16 can be paired with 17. For each pair, a difference in the sum of ranks is found and the absolute value of that difference is compared with the critical value in Table K.

  23. EDUCATIONAL STATISTICS EDU5950 WEEK14 The upper half of Table K has critical value for α = .05; the lower half has values for α = .01. For the experiment in Table 4, where K = 4 and N = 5, the critical values are 48.1 (α = .05) and 58.2 (α = .01). To reject H0, a difference in rank sums must be equal to or greater than the critical values. It is convenient to arrange the differences in rank sums into a matrix summary table as in Table 5.

  24. EDUCATIONAL STATISTICS EDU5950 WEEK14 Table 5

  25. EDUCATIONAL STATISTICS EDU5950 WEEK14 The Spearman rs (formely known as Spearman ρ (Rho)) Spearman rsis a special case of the Pearson product-moment correlation coefficient and is most often used when the number of pairs of scores is small (less than 20). Spearman’s name is attached to the coefficient that is used to show the degree of correlation between two sets of ranked data. The formula for rs is, Where; D = difference in ranks of a pair of scores N = number of pairs of scores

  26. EDUCATIONAL STATISTICS EDU5950 WEEK14 Let say we have an actual data for female tennis player. What is the progression in women’s professional tennis? Do they work their way up through the ranks, advancing their ranking year by year? Or do the younger player flash to the top and then gradually lose their ranking year by year? An rs might lend support to one of these hypotheses.

  27. EDUCATIONAL STATISTICS EDU5950 WEEK14 If the rank of 1 is assigned to the oldest player, a positive rs means that the older the player, the higher her rank (support first hypothesis). A negative rs means that the older the player, the lower her rank (supporting the second hypothesis). A zero rs means no relationship between age and rank. How to testing the significance of rs?

  28. EDUCATIONAL STATISTICS EDU5950 WEEK14 Table 6 shows the ten top ranked women tennis players for 1998, their age as a rank score among the ten, and rs of -.20.

  29. EDUCATIONAL STATISTICS EDU5950 WEEK14 For a sample rs, you can test the null hypothesis that the population correlation coefficient is .00. That is, a sampling distribution of rs values taken from a population in which the true correlation is zero will show you the probability of various rs‘s. If the probability for your sample rs is small, reject the null hypothesis and conclude that the relationship in the population that rs came from is not zero. Table L gives critical values for rs for the .05 and .01 levels of significance (when the number of pairs is 16 or fewer).

  30. EDUCATIONAL STATISTICS EDU5950 WEEK14 Reject the hypothesis that the population correlation coefficient is zero if the obtained rs is equal to or greater than the tabled value. Refer to Table 1, rs = -.20 based on ten pairs. Table L shows that a correlation of .648 (either positive or negative) is required for significance at the .05 level. Thus, a correlation of -.20 is not statistically significant.

More Related