Sensitivity Analysis of Randomized Trials with Missing Data

Sensitivity Analysis of Randomized Trials with Missing Data Daniel Scharfstein Department of Biostatistics Johns Hopkins University dscharf@jhsph.edu

ACTG 175 • ACTG 175 was a randomized, double bind trial designed to evaluate nucleoside monotherapy vs. combination therapy in HIV+ individuals with CD4 counts between 200 and 500. • Participants were randomized to one of four treatments: AZT, AZT+ddI, AZT+ddC, ddI • CD4 counts were scheduled at baseline, week 8, and then every 12 weeks thereafter. • Additional baseline characteristics were also collected.

ACTG 175 • One goal of the investigators was to compare the treatment-specific means of CD4 count at week 56 had all subjects remained on their assigned treatment through that week. • The interest is efficacy rather than effectiveness. • We define a completer to be a subject who stays on therapy and is measured at week 56. Otherwise, the subject is called a drop-out. • 33.6% and 26.5% of subjects dropped out in the AZT+ddI and ddI arms, respectively.

ACTG 175 • Completers-only analysis

ACTG 175 • The completers-only means will be valid estimates if, within treatment groups, the completers and drop-outs are similar on measured and unmeasured characteristics. • Missing at random (MAR), with respect to treatment group. • Without incorporating additional information, the MAR assumption is untestable. • It is well known from other studies that, within treatment groups, drop-outs tend to be very different than completers.

Goal • Present a coherent paradigm for the presentation of results of clinical trials in which it is plausible that MAR fails (i.e., NMAR). • Sensitivity Analysis • Bayesian Analysis

Sensitivity Analysis Step 1: Models • For each treatment group, specify a set of models for the relationship between the distributions of the outcome for drop-outs and completers. • Index the treatment-specific models by an untestable parameter (alpha), where zero denotes MAR. • alpha is called a selection bias parameter and it indexes deviations from MAR. • Pattern-mixture model

Treatment-specific imputed distributions of CD4 count at week 56 for drop-outs

Sensitivity Analysis Step 1: Models • Selection model • The parameter alpha is interpreted as the log odds ratio of dropping out when comparing subjects whose log CD4 count at week 56 differs by 1 unit. • alpha>0 (<0) indicates that subjects with higher (lower) CD4 counts are more likely to drop-out. • alpha=0.5 (-0.5) implies that a 2-fold increase in CD4 count yields a 1.4 increase (0.7 decrease) in the odds of dropping out.

Sensitivity Analysis Step 2: Estimation • For a plausible range of alpha’s, estimate the treatment-specific means by taking a weighted average of the mean outcomes from the completers and drop-outs.

Treatment-specific estimated mean CD4 at week 56 as function of alpha

Sensitivity Analysis Step 3: Testing • Test the null hypothesis of no treatment effect as a function of treatment-specific selection bias parameters. • For each combination of the treatment-specific selection bias parameters, form a Z-statistic by taking the difference in the estimated means divided by the estimated standard error of the difference.

Sensitivity Analysis Step 3: Testing • If the selection bias parameters are correctly specified, this statistic is normal(0,1) under the null hypothesis. • Reject the null hypothesis at the 0.05 level if the absolute value of the Z-statistic is greater than 1.96.

Contour Plot of Z-statistic

Bayesian Analysis • Think of all model parameters as random. • Place prior distributions on these parameters. • Informative prior on alpha (e.g., normal with mean -0.5 and standard deviation 0.25). • Non-informative priors on all other parameters (e.g., the distribution of the outcome). • Results are summarized through posterior distributions.

Posterior Distributions 368 (342,391) 348 (330,365)

Posterior Distribution of Mean Difference 20 (-11,49); 91%

Likelihood-based Inference • A parametric model for the outcome and a parametric for the probability of being a completer given the outcome. • For example, the outcome is log normal. • Inference proceeds by maximum likelihood (ML). • ML inference can be well approximated using Bayesian machinery.

Maximum Likelihood Distributions -2.6 (-3.0,-2.1) 303 (278,331) 368 (342,391) -2.8 (-3.3,-2.2) 297 (271,324) 348 (330,365)

Maximum Likelihood Distribution of Mean Difference 7 (-31,44) 20 (-11,49)

Incorporating Auxiliary Information • MAR (with respect to all observable data) • Sensitivity analysis with respect alpha. • Bayesian methods under development. Longitudinal/Time-to-Event Data • Same underlying principles.

LOCF Bad idea • Imputing an unreasonable value. • Results may be conservative or anti-conservative. • Uncertainty is under-estimated.

Conjecture • There is information from previously conducted clinical studies to help in the analysis of the current trials. • Data from previous trials may be able to restrict the range of or estimate alpha.

Summary • We have presented a paradigm for reporting the results of clinical trials where missingness is plausibly related to outcomes. • We believe this approach provides a more honest characterization of the overall uncertainty, which stems from both sampling variability and lack of knowledge of the missingness mechanism.

dscharf@jhsph.edu • Scharfstein, Rotnitzky A, Robins JM, and Scharfstein DO: “Semiparametric Regression for Repeated Outcomes with Non-ignorable Non-response,” Journal of the American Statistical Association, 93, 1321-1339, 1998. • Scharfstein DO, Rotnitzky A, and Robins, JM.: “Adjusting for Non-ignorable Drop-out Using Semiparametric Non-response Models (with discussion),” Journal of the American Statistical Association, 94, 1096-1146, 1999. • Rotnitzky A, Scharfstein DO, Su TL, and Robins JM: “A Sensitivity Analysis Methodology for Randomized Trials with Potentially Non-ignorable Cause-Specific Censoring,” Biometrics, 57:30-113, 2001 • Scharfstein DO, Robin JM, Eddings W and Rotnitzky A: “Inference in Randomized Studies with Informative Censoring and Discrete Time-to-Event Endpoints,” Biometrics, 57: 404-413, 2001. • Scharfstein DO and Robins JM: “Estimation of the Failure Time Distribution in the Presence of Informative Right Censoring,” Biometrika 89:617-635, 2002. • Scharfstein DO, Daniels M, and Robins JM: “Incorporating Prior Beliefs About Selection Bias in the Analysis of Randomized Trials with Missing Data,” Biostatistics, 4: 495-512, 2003.

Sensitivity Analysis of Randomized Trials with Missing Data

Sensitivity Analysis of Randomized Trials with Missing Data

Presentation Transcript

Randomized controlled trials

Randomized Controlled Trials

Missing Data in Randomized Control Trials

Meta-analysis with missing data: metamiss

Sensitivity Analysis of Randomized Trials with Missing Data

Randomized Controlled Trials

Missing Data in Randomized Control Trials

Group-Randomized Trials

RANDOMIZED TRIALS

Randomized Control Trials

Survival Analysis for Randomized Clinical Trials

Missing Data in Clinical Trials

Comparing two strategies for primary analysis of longitudinal trials with missing data

Survival Analysis for Randomized Clinical Trials

Implementation of Randomized Trials

RANDOMIZED TRIALS

Survival Analysis for Randomized Clinical Trials

Treatment of Missing Data in Randomized Clinical Trials

Randomized Trials

Monitoring Randomized Trials

Missing Data in Randomized Control Trials

Randomized Control Trials