1 / 21

Biostatistics Case Studies 2005

Biostatistics Case Studies 2005. Session 4: Taking Risks and Playing the Odds: OR vs. RR. Peter D. Christenson Biostatistician http://gcrc.humc.edu/Biostat. Case Study. What's the Relative Risk? A Method of Correcting the Odds Ratio in Cohort Studies of Common Outcomes

Download Presentation

Biostatistics Case Studies 2005

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Biostatistics Case Studies 2005 Session 4: Taking Risks and Playing the Odds: OR vs. RR Peter D. Christenson Biostatistician http://gcrc.humc.edu/Biostat

  2. Case Study What's the Relative Risk? A Method of Correcting the Odds Ratio in Cohort Studies of Common Outcomes Jun Zhang, MB, PhD; Kai F. Yu, PhD JAMA. 1998;280:1690-1691. ABSTRACT Logistic regression is used frequently in cohort studies andclinical trials. When the incidence of an outcome of interestis common in the study population (>10%), the adjusted oddsratio derived from the logistic regression can no longer approximatethe risk ratio. The more frequent the outcome, the more theodds ratio overestimates the risk ratio when it is more than1 or underestimates it when it is less than 1. We propose asimple method to approximate a risk ratio from the adjustedodds ratio and derive an estimate of an association or treatmenteffect that better represents the true relative risk

  3. Further OR to RR References • McNutt LA, Wu C, Xue X, et. al. Estimating the relative risk in cohort studies and clinical trials of common outcomes. American Journal of Epidemiology 2003 ; 157: 940-3. • Greenland S. Model-based estimation of relative risks and other epidemiologic measures in studies of common outcomes and in case-control studies. American Journal of Epidemiology 2004 ; 160: 301-5.

  4. Goals • Why is RR vs. OR an issue? • Examine the solution given in our case study. • Point out difficulties with this solution. • Suggest other solutions.

  5. Why Use Odds Ratio at All? • In case-control studies, cannot measure RR. • This is due to selection of controls. Example: Risk Factor Cases Controls1 Controls2 + 90 60 600 - 1040400 100 100 1000 Ratio of (90/150) (90/690) Percents /(10/50) /(10/410) = 3.0 = 5.3 Odds [(90/150)/(60/150)] [(90/690)/(600/690)] Ratio /[(10/50)/(40/50)] /[(10/410)/(400/410)] = 6.0 = 6.0

  6. Cohort Study Example Risk Factor Diseased + 50/100 = 50% • 30/100 = 30% RR = 50%/30% = 1.67. OR = (50%/50%)/(30%/70%) = 2.33

  7. Cohort Study Example – Adjust for Gender Strata Risk Factor Diseased Male + 50/100 = 50% • 30/100 = 30% Female + 80/100 = 80% • 40/120 = 33% RR = (Male Weight)(50%/30%) + (Female Weight)(80%/33%) = (200/420)1.67 + (220/420)2.40 = 2.06 Other weights may be used.

  8. Need for Additional Adjustment Limitations of stratification: • Only a few covariates can be adjusted. • Covariates need to be categorical, or made so. Regression adjustment: • Allows more covariates (up to ~ 10 subjects per parameter). • Allows continuous covariates. • Logistic, log-linear, poisson regression.

  9. Logistic Regression Model: Log[Odds of disease] = Log[ Prob(disease) / Prob(non-disease) ] = function( exposure, covariates, interactions) Thus, only functions of the odds can be estimated, e.g., antilog of Log[Odds of disease] for exposed minus Log[Odds of disease] for unexposed, i.e., odds ratio (OR). E.g., if log(odds) = 10.3 + 0.8(exposed) + 0.2(covariate), then OR = exp(0.8) = 2.23, if exposed =1 or 0 for Yes or No

  10. RR is Preferred To obtain RR rather than OR, we can: • Convert OR from logistic regression to RR, or • Use a model other than logistic that also fits the data. We consider (2) later (down 6 slides). The solution proposed in our case study is to apply (1). Their solution is displayed on the next slide.

  11. Case Study, Page 1691, 1st column In a cohort study, P0 indicates the incidence of the outcomeof interest in the nonexposed group and P1 in the exposed group; OR, odds ratio; and RR, risk ratio: OR=(P1/(1-P1))/(P0/(1-P0)); thus, (P1/P0)=OR/[(1-P0)+(P0xOR)]. Since RR=P1/P0, the corrected

  12. Table from Case Study The authors perform a simulation, setting the RR to be constant over all strata of the covariates, and show that their conversion does correct the logistic OR to a RR close to the stratified (M-H) RR which is correct here, with categorical strata:

  13. Difficulties with the OR to RR Conversion • The P0 is assumed to be fixed and known, so confidence intervals for the RR are too narrow. • The formula is only valid if RR is constant over all covariate patterns, or is used for one particular set of covariates. Thus, it does not do the intended job of adjustment, i.e., account for confounding.

  14. Breast Cancer Example • See table on next slide. • Outcome: 5-year mortality. • Predictor: receptor level, low vs. high; low is suspected mortality risk factor. • Covariate: CA staging I, II, III. • Want: RR of death (low over high receptor), adjusted for stage. • For this study, do not need to model; can use observed death rates. Thus, we can compare the logistic OR to RR results with actual RRs. • Since the RRs increase with stage, the OR to RR conversion does not perform well.

  15. Greenland (2004) Comparison of RRs RR by Strata: 1.84 1.78 1.43 2/12 Adjusted RR 1.65 1.89* 1.56 1.63 * Using Zhang & Yu: OR=2.51 and p0=(5+17+9)/ (55+74+15)=0.215: Converted RR= 2.51/[(1-.215) + (0.215*2.51)]=1.89

  16. Alternative #1 to Zhang & Yu Conversion • The OR to RR conversion over-estimates RR. • An alternative is to find Prob[death] for a low receptor and for a high receptor population, using the distribution of staging (“Standardized RRs”), and take the ratio of these probabilities. • These probabilities can be found from the logistic equation: E.g., if log(p/(1-p)) = 10.3 + 0.8(exposed) + 0.2(covariate) = u, then p=eu/(1+eu), from algebra, where p = prob (death). • In fact, these probabilities are given in the row for logistic in the previous table. We now find the standardized risk ratio using these probabilities:

  17. Alternative #1 continued • If all women were at the low receptor level, the standardized (to the staging distribution) risk of death is: 0.190(0.349) + 0.422(0.500) + 0.816(0.151) = 0.401 • Similarly, at the high receptor level, 0.086(0.349) + 0.226(0.500) + 0.639(0.151) = 0.239 • Thus, the standardized RR is 0.401/0.239 = 1.68, which is close to the adjusted RR from the observed death rates. • Note that the weights (0.349, 0.500, 0.151) are the relative proportions of women at each stage, e.g., 0.349=(12+55)/(12+55+22+74+14+15).

  18. Alternative #2 to Zhang & Yu Conversion • Use a model other than the logistic, provided that it fits as well. • The Greenland(2004) table used two other models, log-linear (binomial) and Poisson regression. • The log-linear binomial fits log(prob(death), compared to the logistic that fits log(odds): Logistic: log(p/(1-p)) = 10.3 + 0.8(exposed) + 0.2(covariate) Log-linear: log(p) = 7.1 + 0.6(exposed) + 0.1(covariate) (made-up numbers) • From the log-linear model, adjusted RR is found directly as antilog(0.6) = 1.82. This avoids the unwanted OR entirely.

  19. SAS Code for Log-Linear Model of Breast Cancer data ca; input receptor stage death survive total; datalines; 0 1 2 10 12 1 1 5 50 55 0 2 9 13 22 1 2 17 57 74 0 3 12 2 14 1 3 9 6 15 ; proc genmod; class receptor stage; model death/total = receptor stage/dist=binomial link=log; estimate 'r1' intercept 1 receptor 1 0 stage 1 0 0/exp; estimate 'r2' intercept 1 receptor 0 1 stage 1 0 0/exp; estimate 'r3' intercept 1 receptor 1 0 stage 0 1 0/exp; estimate 'r4' intercept 1 receptor 0 1 stage 0 1 0/exp; estimate 'r5' intercept 1 receptor 1 0 stage 0 0 1/exp; estimate 'r6' intercept 1 receptor 0 1 stage 0 0 1/exp; estimate 'rr' receptor 1 -1 /exp; run; *r1-r6 are in the Greenland(2004) table;

  20. SAS Code for Logistic Model of Breast Cancer data ca; input receptor stage death survive total; datalines; 0 1 2 10 12 1 1 5 50 55 0 2 9 13 22 1 2 17 57 74 0 3 12 2 14 1 3 9 6 15 ; proc logistic data=ca; class receptor stage; model death/total = receptor stage/lackfit; output out=out1 pred=predicted l=lower u=upper; run; proc print data=out1; run; *values for predicted are in the Greenland(2004) table;

  21. Conclusions • The JAMA OR to RR conversion is faulty. • Can use logistic regression, and find standardized risks, then take ratio to get RR. • Can use log-linear models that model risk directly, rather than odds. • Should check whether any model adequately fits the data. Often, several do. • The major advantage of the log-linear model is that confidence intervals for the adjusted RR are much easier. I know of no software that gives CIs for the standardized RRs.

More Related