1 / 54

Statistics

Statistics. Design of Experiment. Contents. An Introduction to Experimental Design Completely Randomized Designs Randomized Block Design Factorial Experiments. An Introduction to Experimental Design. Statistical studies can be classified as being either experimental or observational.

vahe
Download Presentation

Statistics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Statistics Design of Experiment

  2. Contents • An Introduction to Experimental Design • Completely Randomized Designs • Randomized Block Design • Factorial Experiments

  3. An Introduction to Experimental Design • Statistical studies can be classified as being either experimental or observational. • In an experimental study, one or more factors are controlled so that data can be obtained about how the factors influence the variables of interest. • In an observational study, no attempt is made to control the factors.

  4. An Introduction to Experimental Design • Cause-and-effect relationships are easier to establish in experimental studies than in observational studies.

  5. An Introduction to Experimental Design • A factor is a variable that the experimenter has selected for investigation. • A treatment is a level of a factor. • Experimental unitsare the objects of interest in the experiment.

  6. An Introduction to Experimental Design • Example: • Suppose we want to examine if there is a difference among working methods A, B, and C. 18 workers are randomly chosen. • factor is the working methods. • treatment is one of A, B, and C. • Experimental unitsare the workers chosen

  7. An Introduction to Experimental Design • A completely randomized designis an experimental design in which the treatments are randomly assigned to the experimental units. • If the experimental units are heterogeneous, blocking can be used to form homogeneous groups, resulting in a randomized block design.

  8. Completely Randomized Design • Recall Simple Random Sampling • Finite populations are often defined by lists such as: • Organization membership roster • Credit card account numbers • Inventory product numbers

  9. Completely Randomized Design • A simple random sample of size n from a finite population of size N is a sample selected such that each possible sample of size n has the same probability of being selected. • Replacing each sampled element before selecting subsequent elements is called sampling with replacement. • Sampling without replacement is the procedure used most often.

  10. Completely Randomized Design • In large sampling projects, computer-generated random numbers are often used to automate the sample selection process. • Excel provides a function for generating random numbers in its worksheets. • Infinite populations are often defined by an ongoing process whereby the elements of the population consist of items generated as though the process would operate indefinitely.

  11. Completely Randomized Design • A simple random sample from an infinite population is a sample selected such that the following conditions are satisfied. • Each element selected comes from the same population. • Each element is selected independently.

  12. Completely Randomized Design • Random Numbers: the numbers in the table are random, these four-digit numbers are equally likely.

  13. Completely Randomized Design • Most experiments have critical error on random sampling. • Ex: sampling 8 samples from a production line in one day • Wrong method: • Get one sample every 3 hours not random!

  14. Completely Randomized Design • Ex: sampling 8 samples from a production line • Correct method: • You can get one sample at each 3 hours interval but not every 3 hours correct but not a simple random sampling • Get 8 samples in 24 hours • Maximum population is 24, getting 8 samples two digits • 63, 27, 15, 99, 86, 71, 74, 45, 10, 21, 51, … • Larger than 24 is discarded • So eight samples are collected at: 15, 10, 21, … hour

  15. Completely Randomized Design • In Completely Randomized Design, samples are randomly collected by simple random sampling method. • Only one factor is concerned in Completely Randomized Design, and k levels in this factor.

  16. Completely Randomized Design • Data: Treatment Observations sample sample (level) 12i…nj mean SD 1x11 x12 x1i… s1 2x21 x22 x1i… s2 . jxj1 xj2 xji… sj . . kxk1 xk2 xki… sk

  17. Completely Randomized Design • Between-Treatments Estimate of Population Variance • The between-samples estimate of σ2 is referred to as the mean square due to treatments(MSTR). denominator is the degrees of freedom associated with SSTR numerator is called the sum of squares due to treatments (SSTR)

  18. Completely Randomized Design • Within-Treatments Estimate of Population Variance. • The second estimate of σ2, the within-samples estimate, is referred to as the mean square due to error (MSE). denominator is the degrees of freedom associated with SSE numerator is called the sum of squares due to error(SSE)

  19. Completely Randomized Design • ANOVA Table Source of Variation Sum of Squares Degrees of Freedom Mean Squares F SSTR Treatments k - 1 Error SSE nT - k Total SST nT - 1

  20. Wax vpe 2 Wax Type 3 Wax Type 1 Completely Randomized Design--Example • Example: AutoShine, Inc. • AutoShine, Inc. is considering marketing a long-lasting car wax. Three different waxes (Type 1, Type 2, and Type 3) have been developed.

  21. Completely Randomized Design--Example Wax vpe 2 Wax Type 3 Wax Type 1 • In order to test the durability of these waxes, 5 new cars were waxed with Type 1, 5 with Type 2, and 5 with Type 3. Each car was then repeatedly run through an automatic carwash until the wax coating showed signs of deterioration.

  22. Completely Randomized Design--Example Wax vpe 2 Wax Type 3 Wax Type 1 • The number of times each car went through the carwash is shown on the next slide. AutoShine, Inc. must decide which wax to market. Are the three waxes equally effective?

  23. Completely Randomized Design--Example Wax vpe 2 Wax Type 3 Wax Type 1 Wax Type 1 Wax Type 2 Wax Type 3 Observation 1 2 3 4 5 27 30 29 28 31 33 28 31 30 30 29 28 30 32 31 Sample Mean 29.0 30.4 30.0 Sample Variance 2.5 3.3 2.5

  24. Completely Randomized Design--Example Wax vpe 2 Wax Type 3 Wax Type 1 • Hypotheses H0: 1=2=3 Ha: Not all the means are equal where: μ1= mean number of washes for Type 1 wax μ2= mean number of washes for Type 2 wax μ3= mean number of washes for Type 3 wax

  25. Completely Randomized Design--Example = (29 + 30.4 + 30)/3 = 29.8 Wax vpe 2 Wax Type 3 Wax Type 1 • Mean Square Between Treatments Because the sample sizes are all equal: SSTR = 5(29–29.8)2 + 5(30.4–29.8)2 + 5(30–29.8)2 = 5.2 MSTR = 5.2/(3 - 1) = 2.6 • Mean Square Error SSE = 4(2.5) + 4(3.3) + 4(2.5) = 33.2 MSE = 33.2/(15 - 3) = 2.77

  26. Completely Randomized Design--Example Wax vpe 2 Wax Type 3 Wax Type 1 • Rejection Rule p-Value Approach: RejectH0ifp-value < .05 Critical Value Approach: RejectH0ifF > 3.89 where F.05= 3.89 is based on an F distribution with 2 numerator degrees of freedom and 12 denominator degrees of freedom

  27. Completely Randomized Design--Example Wax vpe 2 Wax Type 3 Wax Type 1 • Test Statistic F = MSTR/MSE = 2.6/2.77 = .939 • Conclusion The p-value is greater than .10, where F = 2.81. (Excel provides a p-value of .42.) Therefore, we cannot reject H0. There is insufficient evidence to conclude that the mean number of washes for the three wax types are not all the same.

  28. Completely Randomized Design--Example Wax vpe 2 Wax Type 3 Source of Variation Sum of Squares Degrees of Freedom Wax Type 1 Mean Squares F 5.2 Treatments 2 2.60 .939 Error 33.2 12 2.77 Total 38.4 14 • ANOVA Table

  29. Completely Randomized Design--Example Wax vpe 2 Wax Type 3 Wax Type 1 • Minitab Outputs One-way ANOVA: Observation versus Type Source DF SS MS F P Type 2 5.20 2.60 0.94 0.418 Error 12 33.20 2.77 Total 14 38.40 S = 1.663 R-Sq = 13.54% R-Sq(adj) = 0.00%

  30. In Completely Randomized Experimental Design, the test for a difference among treatment means, we compute a F value by using the ratio A problem can arise whenever differences due to extraneous factors (not considered in the experiment) cause the MSE become large. Randomized Block Design

  31. In previous example, the time after wax is produced is not considered, but it could make differences. To examine the effects on shift in a production line, shift is a factor but gender is not considered. It could also make differences. Randomized Block Design

  32. To isolate the effects of a known effect caused by extraneous factors, one can “block” it by using the Randomized Block Design. After blocking the extraneous effect, the value of MSE becomes smaller and the ability of F test increases. Randomized Block Design

  33. Randomized Block Design • ANOVA Procedure • For a randomized block design the sum of squares total (SST) is partitioned into three groups: sum of squares due to treatments, sum of squares due to blocks, and sum of squares due to error. • The total degrees of freedom, nT - 1, are partitioned such that k - 1 degrees of freedom go to treatments, b - 1 go to blocks, and (k - 1)(b - 1) go to the error term. SST = SSTR + SSBL + SSE

  34. Randomized Block Design • ANOVA Table Source of Variation Sum of Squares Degrees of Freedom Mean Squares F SSTR Treatments k - 1 Blocks SSBL b - 1 Error SSE (k – 1)(b – 1) Total SST nT - 1

  35. rescent Randomized Block Design • Example: Crescent Oil Co. • Crescent Oil has developed three new blends of gasoline and must decide which blend or blends to produce and distribute. A study of the miles per gallon ratings of the three blends is being conducted to determine if the mean ratings are the same for the three blends.

  36. rescent Randomized Block Design • Five automobiles have been tested using each of the three gasoline blends and the miles per gallon ratings are shown on the next slide.

  37. rescent Randomized Block Design Type of Gasoline (Treatment) Automobile (Block) Block Means Blend X Blend Y Blend Z 1 2 3 4 5 31 30 29 33 26 30 29 29 31 25 30 29 28 29 26 30.333 29.333 28.667 31.000 25.667 Treatment Means 29.8 28.8 28.4

  38. rescent Randomized Block Design • Mean Square Due to Treatments • Mean Square Due to Error The overall sample mean is 29. Thus, • SSTR = 5[(29.8 - 29)2 + (28.8 - 29)2 + (28.4 - 29)2] = 5.2 MSTR = 5.2/(3 - 1) = 2.6 • Mean Square Due to Blocks SSBL = 3[(30.333 - 29)2 + . . . + (25.667 - 29)2] = 51.33 MSBL = 51.33/(5 - 1) = 12.8 SSE = 62 - 5.2 - 51.33 = 5.47 MSE = 5.47/[(3 - 1)(5 - 1)] = .68

  39. rescent Randomized Block Design • ANOVA Table Source of Variation Sum of Squares Degrees of Freedom Mean Squares F Treatments 5.20 2.60 3.82 2 Blocks 51.33 4 12.80 Error 5.47 8 .68 Total 62.00 14

  40. rescent Randomized Block Design • Rejection Rule p-Value Approach: Reject H0 if p-value < .05 Critical Value Approach: Reject H0 if F > 4.46 For = .05, F.05= 4.46 (2 d.f. numerator and 8 d.f. denominator)

  41. rescent Randomized Block Design • Test Statistic • Conclusion F = MSTR/MSE = 2.6/.68 = 3.82 The p-value is greater than .05 (where F = 4.46) and less than .10 (where F = 3.11). (Excel provides a p-value of .07). Therefore, we cannot reject H0. There is insufficient evidence to conclude that the miles per gallon ratings differ for the three gasoline blends.

  42. Factorial Experiments • In some experiments we want to draw conclusions about more than one variable or factor. • Factorial experiments and their corresponding ANOVA computations are valuable designs when simultaneous conclusions about two or more factors are required.

  43. Factorial Experiments • The term factorial is used because the experimental conditions include all possible combinations of the factors. • For example, for a levels of factor A and b levels of factor B, the experiment will involve collecting data on ab treatment combinations.

  44. Two-Factor Factorial Experiment • ANOVA Procedure • The ANOVA procedure for the two-factor factorial experiment is similar to the completely randomized experiment and the randomized block experiment • We again partition the sum of squares total (SST) into its sources. SST = SSA + SSB + SSAB + SSE

  45. Two-Factor Factorial Experiment SST = SSA + SSB + SSAB + SSE • The total degrees of freedom, nT - 1, are partitioned such that (a – 1) d.f go to Factor A, (b – 1) d.f go to Factor B, (a – 1)(b – 1) d.f. go to Interaction, and ab(r – 1) go to Error.

  46. Source of Variation Sum of Squares Degrees of Freedom Mean Squares F SSA Factor A a - 1 Factor B SSB b - 1 Interaction SSAB (a – 1)(b – 1) Error SSE ab(r – 1) Total SST nT - 1 Two-Factor Factorial Experiment

  47. Two-Factor Factorial Experiment • Step 1 Compute the total sum of squares • Step 3 Compute the sum of squares for factor B • Step 2 Compute the sum of squares for factor A

  48. Two-Factor Factorial Experiment • Step 4Compute the sum of squares for interaction • Step 5 Compute the sum of squares due to error SSE = SST – SSA – SSB - SSAB

  49. Two-Factor Factorial Experiment • Example: State of Ohio Wage Survey • A survey was conducted of hourly wages for a sample of workers in two industries at three locations in Ohio. Part of the purpose of the survey was to determine if differences exist in both industry type and location. The sample data are shown on the next slide.

  50. Two-Factor Factorial Experiment • Example: State of Ohio Wage Survey

More Related