1 / 27

Major Points

Major Points. An example Sampling distribution Hypothesis testing The null hypothesis Test statistics and their distributions The normal distribution and testing Important concepts. Distribution of M&M’s in the population. Yellow 20%. Brown 30%. Orange 10%. Blue 10%. Red 20%.

arlais
Download Presentation

Major Points

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Major Points • An example • Sampling distribution • Hypothesis testing • The null hypothesis • Test statistics and their distributions • The normal distribution and testing • Important concepts

  2. Distribution of M&M’s in the population Yellow 20% Brown 30% Orange 10% Blue 10% Red 20% Green 10% Original Distribution

  3. Testing one sample against population • There are normally 5.65 red M&M’s in bag (a population parameter) • Mean number of red M&M’s in a halloween candy bag = 4.56 (a sample statistic) • Are there sufficiently more red M&M’s to conclude significant differences / the two numbers come from different populations.

  4. Understanding the theoretical and statistical question • Theoretical Question • Did M&M use a different proportion of red ones for Halloween? • Statistical Question: • Is the difference between 5.65 and 4.25 large enough to conclude that it is a real (significant) difference? • Would we expect a similar kind of difference with a repeat of this experiment? • Or... • Is the difference due to “sampling error?”

  5. Sampling Error • Often differences are due to sampling error • Sampling Error does not imply doing a mistake • Sampling Error simply refers to the normal variability that we would expect to find from one sample to another, or one study to another

  6. How could we assess Sampling Error? • Take many bags of regular M&M candy. • Record the number of red M&M’s. • Plot the distribution and record its mean and standard deviation. • This distribution is a “Sampling Distribution” of the Mean

  7. Sampling Distribution Number of Red M&M’s in the population 1400 1200 1000 800 Frequency 600 400 Std. Dev = .45 200 Mean = 5.65 0 3.75 4.00 4.25 4.50 4.75 5.00 5.25 5.50 5.75 6.00 6.25 6.50 6.75 7.00 7.25 Means of various samples of Brown M&M’s

  8. What is Sampling Distribution • The distribution of a statistic over repeated sampling from a specified population. • Can be computed for many different statistics

  9. Number of Red M&M’s 1400 1200 1000 800 Frequency 600 400 Std. Dev = .45 200 Mean = 5.65 N = 10000.00 0 3.75 4.00 4.25 4.50 4.75 5.00 5.25 5.50 5.75 6.00 6.25 6.50 6.75 7.00 7.25 Mean Number Aggressive Associates • Distribution ranges between 3.76 and 7.25 • Mean is 5.65, SD is .45 • Mean of 4.00 is not likely, mean of 5 is more likely

  10. How likely is it to get score as low as 4.25 by chance. • Convert score to z score (z distribution has a mean of 0 and standard deviation of 1) • score - mean / standard deviation • Look up Table E10, smaller portion Score Z Score Probability 4.25 -3.1 .0006 5.60 -.11 .45 7.5 3.5 .0002

  11. Hypothesis Testing • A formal way of testing if we should accept results as being significantly different or not • Start with hypothesis that halloween M&M’s are from normal distribution • The null hypothesis • Find parameters of normal distribution • Compare halloween candy to normal distribution

  12. The Null Hypothesis (H0) • Is the hypothesis postulating that there is no difference, that two things are from the same distribution. • The hypothesis that Halloween candy came from a population of normal M&M’s • The hypothesis we usually want to reject. • Alternative Hypothesis: Halloween and Regular M&M’s are from different distributions

  13. It is easier to prove Alternative Hypothesis than Null Hypothesis • Hypothesis: All crows are black • Sample: 3000 crows • Results: all are black • Conclusion: Are all crows black?

  14. Observation: One white crow • Conclusion: Statement :Every crow is black” is false • It is easier to prove alternative hypothesis (all crows are not black) than null hypothesis (all crows are black)

  15. Another example of Null & Alternative Hypothesis • Hypothesis: Shopping on Amazon.com is different than on BN.com • Study: 100 usability tests comparing the two shopping process on both • Results: Both sites performed similarly • Conclusion: ?

  16. Observation: On Test No:101 Amazon did better • Conclusion: Amazon.com and BN are different.

  17. Steps in Hypothesis Testing • Define the null hypothesis. • Decide what you would expect to find if the null hypothesis were true. • Look at what you actually found. • Reject the null if what you found is not what you expected.

  18. Important Concepts • Concepts critical to hypothesis testing • Decision • Type I error • Type II error • Critical values • One- and two-tailed tests

  19. Decisions • When we test a hypothesis we draw a conclusion; either correct or incorrect. • Type I error • Reject the null hypothesis when it is actually correct. • Type II error • Retain the null hypothesis when it is actually false.

  20. Possible Scenarios

  21. M&M candy example

  22. Type I Errors • Assume Halloween and Regular candies are same (null hypothesis is true) • Assume our results show that they are not same (we reject null hypothesis) • This is a Type I error • Probability set at alpha () •  usually at .05 • Therefore, probability of Type I error = .05

  23. Type II Errors • Assume Halloween and Regular Candies are different (alternative hypothesis is true) • Assume that we conclude they are the same (we accept null hypothesis) • This is also an error • Probability denoted beta () • We can’t set beta easily. • We’ll talk about this issue later. • Power = (1 - ) = probability of correctly rejecting false null hypothesis.

  24. Critical Values • These represent the point at which we decide to reject null hypothesis. • e.g. We might decide to reject null when (p|null) < .05. • Our test statistic has some value with p = .05 • We reject when we exceed that value. • That value is the critical value.

  25. One- and Two-Tailed Tests • Two-tailed test rejects null when obtained value too extreme in either direction • Decide on this before collecting data. • One-tailed test rejects null if obtained value is too low (or too high) • We only set aside one direction for rejection.

  26. One- & Two-Tailed Example • One-tailed test • Reject null if number of red in Halloween candies is higher • Two-tailed test • Reject null if number of red in Halloween candies is different (whether higher or lower)

  27. Designing an Experiment: Feature Based Product Advisors Identify some good online product advisors Feature Based Filtering: CNET Digital Camera Advisor Basic Sony Decision Guide Dealtime Feature Based Choice Multi-Dimension (including features) based filtration Sony Advanced Decision Guide Review Based Choosing Epinions CNET Computers / Cameras / both; Sample Size; Experimental Questions

More Related