1 / 23

STAT 401 EXPERIMENTAL DESIGN AND ANALYSIS

STAT 401 EXPERIMENTAL DESIGN AND ANALYSIS. Assist.Prof.Dr . R. Serkan Albayrak Department of Business Administration Yaşar University. INTRODUCTION TO ANOVA. The easiest wau to understand ANOVA is to generate a tny data set using GLM:

louise
Download Presentation

STAT 401 EXPERIMENTAL DESIGN AND ANALYSIS

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. STAT 401 EXPERIMENTAL DESIGN AND ANALYSIS Assist.Prof.Dr. R. Serkan Albayrak Department of Business Administration Yaşar University

  2. INTRODUCTION TO ANOVA • The easiestwautounderstand ANOVA is togenerate a tny data set using GLM: As a first step set themean, to 5 forthedatasetwith 10 cases. Inthetablebelowall 10 caseshave a score of 5 at thispoint.

  3. The next step is toaddtheeffects of the IV. Supposethattheeffect of thetreatment at is toraisescoresby 2 unitsandtheeffect of thetreatment at is tolowerscoresby 2 units.

  4. The changesproducedbytreatmentarethedeviations of thescoresfromOverall of thesecasesthedeviations is This is thesum of the (squared) effects of treatmentifallcasesareinfluencedidenticallybythevariouslevels of A andthere is noerror.

  5. Thethird step is tocompletethe GLM withaddition of error.

  6. Then thevarianceforthegroup is Andthevarianceforthegroup is Theaverage of thesevariances is also 1.5 Checkthatthesenumbersrepresenterrorvariance; thatmeanstheyrepresentrandomvariability in scoreswithineachgroupwhereallcasesaretreatedthesameandthereforeareuncontaminatedbyeffects of the IV. Thevarianceforthisgroup of 10 numbers, ignoringgroupmemebership is

  7. StandardSetupfor ANOVA The differencebetweeneachscoreandthe Grand Mean is brokenintotwocomponents: Thedifferencebetweenthescoreanditsowngroupmean Thedifferencebetweenthatgroupmeanandthegrandmean

  8. Sum of squaresfortreatment Theeffect of the IV!!! Sum of squaresforerror Eachterm is thensquaredandsummedseperatelytoproducethesum of squaresforerrorandthesum of squaresfortreatmentseperately. Thebasicpartitionholdsbecausethecrossproducttermsvanish.

  9. This isthedeviation form of basic ANOVA. Each of theseterms is a sum of squares(SS). Theaverage of thissum is the total variance in the set of scoresignoringgroupmemebership. Thisterm is calledsum of squarewithingroups. Thisterm is called SS betweengroups. This is sum is frequentlysymbolized as,

  10. At this point it is important to realize that the total variance in the set of scores is partitioned into two sources. One is the effect of the IV and the other is all remaining effects (which we call error). Because the effects of the IV are assessed by changes in the central tendencies of the groups, the inferences that come from ANOVA are about differences in central tendency. Before going further it may be a nice exercise to think about alternatives of central tendency. Can we alter the formula of variance by replacing mean with something else? Why mean is preferred? Howeversum of squaresare not yet variances. Tobecomevariances, theymust be ‘averaged’. Thedenominatorsforaveraging SS must be degrees of freedomsothatthestatisticswillhave a properdistribution(rememberpreviousslides).

  11. So far we now that the degrees of freedom of must be N-1. Furthermore, Also, Thuswehave (as expected)

  12. Varianceis an ‘averaged’ sum of squares (forempirical data of course). Thentoobtainmeansum of squares (MS), The F distribution is a samplingdistribution of theratio of twodistributions. Thisstatististic is usedto test thenullhypothesisthat

  13. Source tableforbasic ANOVA

  14. Distribution of SSE, SST SSE= SST=

  15. What is theexpectedvalue of F under NULL? under NULL Suppose is true. Then, Sounderarei.i.d. Then, is an unbiasedestimator of Observethat,

  16. under NULL Remember that Then under NULL,

  17. Therefore under, F must be around 1. It is sensitivetodeviationsfromthenulland can measureevidenceagainst. Onemorenotebeforeweproceed. Whydid not wetryout t-test initially? Whatwaswrongwith it in thissetup?

More Related