Topic 30: Random Effects

Topic 30: Random Effects

Outline • One-way random effects model • Data • Model • Inference

Data for one-way random effects model • Y, the response variable • Factor with levels i = 1 to r • Yij is the jth observation in cell i, j = 1 to ni • Almost identical model structure to earlier one-way ANOVA. • Difference in level of inference

Level of Inference • In one-way ANOVA, interest was in comparing the factor level means • In random effects scenario, interest is in the pop of factor level means, not just the means of the r study levels • Need to make assumptions about population distribution • Will take “random” draw from pop of factor levels for use in study

KNNL Example • KNNL p 1036 • Y is the rating of a job applicant • Factor A represents five different personnel interviewers (officers), r=5 • n=4 differentapplicants were interviewed by each officer • The interviewers were selected at random and the applicants were randomly assigned to interviewers

Read and check the data data a1; infile 'c:\...\CH25TA01.DAT'; input rating officer; proc print data=a1; run;

The data Obs rating officer 1 76 1 2 65 1 3 85 1 4 74 1 5 59 2 6 75 2 7 81 2 8 67 2 9 49 3 10 63 3

The data Obs rating officer 11 61 3 12 46 3 13 74 4 14 71 4 15 85 4 16 89 4 17 66 5 18 84 5 19 80 5 20 79 5

Plot the data title1 'Plot of the data'; symbol1 v=circle i=none c=black; proc gplot data=a1; plot rating*officer/frame; run;

Find and plot the means proc means data=a1; output out=a2 mean=avrate; var rating; by officer; title1 'Plot of the means'; symbol1 v=circle i=join c=black; proc gplot data=a2; plot avrate*officer/frame; run;

Random effects model Key difference • Yij = μi + εij • the μi are iid N(μ, σμ2) • the εij are iid N(0, σ2) • μi and εij are independent • Yij ~ N(μ, σμ2 + σ2) • Two sources of variation • Observations with the same i are not independent, covariance is σμ2

Random effects model • This model is also called • Model II ANOVA • A variance components model • The components of variance are σμ2 and σ2 • The models that we previously studied are called fixed effects models

Random factor effects model • Yij = μ + i + εij • i ~ N(0, σμ2) ***** • εij ~ N(0, σ2) • Yij ~ N(μ, σμ2 + σ2)

Parameters • There are three parameters in these models • μ • σμ2 • σ2 • The cell means (or factor levels) are random variables, not parameters • Inference focuses on these variances

Primary Hypothesis • Want to know if H0: σμ2 = 0 • This implies all mi in model are equal but also all mi not selected for analysis are also equal. • Thus scope is broader than fixed effects case • Need the factor levels of the study to be “representative” of the population

Alternative Hypothesis • We are sometimes interested in estimating σμ2 /(σμ2 +σ2) • This is the same as σμ2 /σY2 • In some applications it is called the intraclass correlation coefficient • It is the correlation between two observations with the same I • Also percent of total variation of Y

ANOVA table • The terms and layout of the anova table are the same as what we used for the fixed effects model • The expected mean squares (EMS) are different because of the additional random effects but F test statistics are the same • Be wary that hypotheses being tested are different

EMS and parameter estimates • E(MSA) = σ2 + nσμ2 • E(MSE) = σ2 • We use MSE to estimate σ2 • Can use (MSA – MSE)/n to estimate σμ2 • Question: Why might it we want an alternative estimate for σμ2?

Main Hypotheses • H0: σμ2 = 0 • H1: σμ2 ≠ 0 • Test statistic is F = MSA/MSE with r-1 and r(n-1) degrees of freedom, reject when F is large, report the P-value

Run proc glm proc glm data=a1; class officer; model rating=officer; random officer/test; run;

Model and error output Source DF MS F P Model 4 394 5.39 0.0068 Error 15 73 Total 19

Random statement output Source Type III Expected MS officer Var(Error) + 4 Var(officer)

Proc varcomp proc varcomp data=a1; class officer; model rating=officer; run;

Output MIVQUE(0) Estimates Variance Component rating Var(officer) 80.41042 Var(Error) 73.28333 Other methods are available for estimation, minque is the default

Proc mixed proc mixed data=a1 cl; class officer; model rating=; random officer/vcorr; run;

Output Covariance Parameter Estimates Cov Parm Est Lower Upper officer 80.4 24.4 1498 Residual 73.2 39.9 175 80.4104/(80.4104+73.2833)=.5232

Output from vcorr Row Col1 Col2 Col3 Col4 1 1.0000 0.5232 0.5232 0.5232 2 0.5232 1.0000 0.5232 0.5232 3 0.5232 0.5232 1.0000 0.5232 4 0.5232 0.5232 0.5232 1.0000

Other topics • Estimate and CI for μ, p1038 • Standard error involves a combination of two variances • Use MSA instead of MSE → r-1 df • Estimate and CI for σμ2 /(σμ2 +σ2), p1040 • CIs for σμ2 andσ2, p1041-1047 • Available using Proc Mixed

Applications • In the KNNL example we would like σμ2 /(σμ2 +σ2) to be small, indicating that the variance due to interviewer is small relative to the variance due to applicants • In many other examples we would like this quantity to be large, • e.g., Are partners more likely to be similar in sociability?

Last slide • Start reading KNNL Chapter 25 • We used program topic30.sas to generate the output for today

Topic 30: Random Effects

Topic 30: Random Effects

Presentation Transcript

Random Variables

Environmental Effects on Autumn Tree Color

Guitar Effects Processor Using DSP

While Loop in Alice

Jointly distributed Random variables

Chapter 5:

Continuous Random Variables

Continuous Random Variables and Probability Distributions

Topic 1

Discrete Random Variables and Probability Distributions

Multi-Q Introduction

Multi-Q Introduction

Quiz Show Review

Chapter 5. Joint Probability Distributions and Random Sample

AP STATISTICS EXAM REVIEW

The Overall Agenda

Chapter 4-1 Continuous Random Variables

Topic 12: Electromagnetic induction

SIDE EFFECTS AND TOXICITY

Markov Random Fields