Experimentos Fatoriais do tipo 2 k

Experimentos Fatoriais do tipo 2k Capítulo 6

6.5 Unreplicated 2kFactorial Designs • These are 2k factorial designs with oneobservation at each corner of the “cube” • An unreplicated 2k factorial design is also sometimes called a “singlereplicate” of the 2k • These designs are very widely used • Risks…if there is only one observation at each corner, is there a chance of unusual response observations spoiling the results? • Modeling “noise”?

Spacing of Factor Levels in the Unreplicated 2kFactorial Designs If the factors are spaced too closely, it increases the chances that the noise will overwhelm the signal in the data More aggressive spacing is usually best

Unreplicated 2kFactorial Designs • Lack of replication causes potential problems in statistical testing • Replication admits an estimate of “pure error” (a better phrase is an internal estimate of error) • With no replication, fitting the full model results in zero degrees of freedom for error • Potential solutions to this problem • Pooling high-order interactions to estimate error • Normal probability plotting of effects (Daniels, 1959) • Other methods…see text

Example of an Unreplicated 2k Design • A 24 factorial was used to investigate the effects of four factors on the filtration rate of a resin • The factors are A = temperature, B = pressure, C = mole ratio, D= stirring rate • Experiment was performed in a pilot plant

The Resin Plant Experiment

dados=read.table("e:\\dox\\pfat2a4sr.txt",header=T) A=as.factor(dados$A) B=as.factor(dados$B) C=as.factor(dados$C) D=as.factor(dados$D) modeloC=dados$y~A+B+C+D+A:B+A:C+A:D+B:C+B:D+C:D+A:B:C+A:B:D+A:C:D+B:C:D+A:B:C:D fitC=aov(modeloC) summary(fitC) Df Sum Sq Mean Sq A 1 1870.56 1870.56 B 1 39.06 39.06 C 1 390.06 390.06 D 1 855.56 855.56 A:B 1 0.06 0.06 A:C 1 1314.06 1314.06 A:D 1 1105.56 1105.56 B:C 1 22.56 22.56 B:D 1 0.56 0.56 C:D 1 5.06 5.06 A:B:C 1 14.06 14.06 A:B:D 1 68.06 68.06 A:C:D 1 10.56 10.56 B:C:D 1 27.56 27.56 A:B:C:D 1 7.56 7.56

Estimates of the Effects

The Half-Normal Probability Plot of Effects

Design Projection: ANOVA Summary for the Model as a 23 in Factors A, C, and D

modeloP=dados$y~A+C+D+A:C+A:D+C:D+A:C:D fitP=aov(modeloP) summary(fitP) Df Sum Sq Mean Sq F value Pr(>F) A 1 1870.56 1870.56 83.3677 1.667e-05 *** C 1 390.06 390.06 17.3844 0.0031244 ** D 1 855.56 855.56 38.1309 0.0002666 *** A:C 1 1314.06 1314.06 58.5655 6.001e-05 *** A:D 1 1105.56 1105.56 49.2730 0.0001105 *** C:D 1 5.06 5.06 0.2256 0.6474830 A:C:D 1 10.56 10.56 0.4708 0.5120321 Residuals 8 179.50 22.44 --- Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

The Regression Model

regP=lm(dados$y~dados$A+dados$C+dados$D+dados$A*dados$C+dados$A*dados$D)regP=lm(dados$y~dados$A+dados$C+dados$D+dados$A*dados$C+dados$A*dados$D) summary(regP) Residuals: Min 1Q Median 3Q Max -6.3750 -1.5000 0.0625 2.9063 5.7500 Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 70.062 1.104 63.444 2.30e-14 *** dados$A 10.812 1.104 9.791 1.93e-06 *** dados$C 4.938 1.104 4.471 0.00120 ** dados$D 7.313 1.104 6.622 5.92e-05 *** dados$A:dados$C -9.062 1.104 -8.206 9.41e-06 *** dados$A:dados$D 8.312 1.104 7.527 2.00e-05 *** --- Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 Residual standard error: 4.417 on 10 degrees of freedom Multiple R-squared: 0.966, Adjusted R-squared: 0.9489 F-statistic: 56.74 on 5 and 10 DF, p-value: 5.14e-07

Model Residuals are Satisfactory

Model Interpretation – Main Effects and Interactions

Outliers: suppose that cd = 375 (instead of 75)

Dealing with Outliers • Replace with an estimate • Make the highest-order interaction zero • In this case, estimate cd such that ABCD = 0 • Analyze only the data you have • Now the design isn’t orthogonal • Consequences?

The Drilling Experiment Example 6.3 A = drill load, B = flow, C = speed, D = type of mud, y = advance rate of the drill

Normal Probability Plot of Effects –The Drilling Experiment

Residual Plots

Residual Plots • The residual plots indicate that there are problems with the equality of variance assumption • The usual approach to this problem is to employ a transformation on the response • Power family transformations are widely used Transformations are typically performed to • Stabilize variance • Induce at least approximate normality • Simplify the model

Selecting a Transformation • Empirical selection of lambda • Prior (theoretical) knowledge or experience can often suggest the form of a transformation • Analytical selection of lambda…the Box-Cox (1964) method (simultaneously estimates the model parameters and the transformation parameter lambda) • Box-Cox method implemented in Design-Expert

(15.1)

The Box-Cox Method A log transformation is recommended The procedure provides a confidenceinterval on the transformation parameter lambda If unity is included in the confidence interval, no transformation would be needed

Effect Estimates Following the Log Transformation Three main effects are large No indication of large interaction effects What happened to the interactions?

ANOVA Following the Log Transformation

Following the Log Transformation

The Log Advance Rate Model • Is the log model “better”? • We would generally prefer a simpler model in a transformed scale to a more complicated model in the original metric • What happened to the interactions? • Sometimes transformations provide insight into the underlying mechanism

Other Examples of Unreplicated 2k Designs • The sidewall panel experiment (Example 6.4, pg. 245) • Two factors affect the mean number of defects • A third factor affects variability • Residual plots were useful in identifying the dispersion effect • The oxidation furnace experiment (Example 6.5, pg. 245) • Replicates versus repeat (or duplicate) observations? • Modeling within-run variability

Por que trabalha-se com as variáveis de planejamento codificadas? EXEMPLO

Na análise com as variáveis codificadas, as magnitudes dos coeficientes do modelo são diretamente comparáveis, isto é, elas são adimensionais, e medem os efeitos da variação de cada fator de planejamento sobre um intervalo unitário. • Além disso, são todas estimadas com a mesma precisão. • Variáveis codificadas são muito efetivas para determinar o tamanho relativo dos efeitos dos fatores.

Em geral, os coeficientes obtidos usando-se as unidades originais não são diretamente comparáveis, mas eles podem ter significado físico. • Em quase todas as situações, a análise codificada é preferível.

Experimentos Fatoriais do tipo 2 k

Experimentos Fatoriais do tipo 2 k

Presentation Transcript

Experimentos Fatoriais do tipo 2 k

GLICOGENOSI TIPO 2:

Clave Diabetes Tipo 2