A n Analytic Road Map for Incomplete Longitudinal Clinical Trial Data

Craig Mallinckrodt ICSA June 4, 2007Raleigh, NC An Analytic Road Map for Incomplete Longitudinal Clinical Trial Data

Context • Road map ≠ driving directions • provides info about alternatives • framework to plan route, doesn’t choose it • Route to valid analyses of incomplete longitudinal data can be problematic • Many advances in missing data theory and implementation • We can do better than LOCF • Need framework to plan use of newer methods

Outline • Missing data mechanisms • Modeling philosophies • Example of developing an analytic road map • Example of implementing an analytic road map • Today’s focus is on missing data. Concepts easily extended to include modeling of time and correlation

Starting Point • No universally best method for analyzing longitudinal data • This implies analysis must be tailored to the specific situation at hand • Consider the desired attributes of the analysis and the characteristics of the data

Missing Data Mechanisms • MCAR - missing completely at random • Conditional on the model, neither observed or unobserved outcomes are related to dropout • MAR - missing at random • Conditional on the model, observed outcomes are related to dropout, but unobserved outcomes are not • MNAR - missing not at random • Unobserved outcomes are related to dropout

Missing Data Mechanisms • MCAR - missing completely at random • Conditional on independent variables in the model, neither observed or unobserved outcomes of the dependent variable are related to dropout • MAR - missing at random • Conditional on the independent variables in the model, observed outcomes of the dependent variable are related to dropout, but unobserved outcomes are not

Consequences • Missing data mechanism is a characteristic of the data AND the model • Differential dropout by treatment indicates covariate dependence, not mechanism • Mechanism can vary from one outcome to another • Terms like ignorable missingness and informative censoring must also consider the method • What is ignorable with likelihood-based analysis is non-ignorable with GEE

Analytic Approaches for Continuous Outcomes • MCAR • ANOVA, GEE • MAR • Likelihood-based mixed-effects models (LB-MEM), Multiple imputation (MI), Propensity scoring, Weighted analyses • MNAR • Selection models, Pattern mixture models, Shared parameter models, sensitivity analyses • Note that if missing values are imputed other assumptions also need to be considered

Modeling Philosophies • Restrictive modeling • Inclusive modeling

Restrictive Modeling • Simple models with few independentvariables • Often include only the design factors of theexperiment

Inclusive Modeling • In addition to the design factors of the experiment, models include auxiliary variables • Auxiliary variables included to improve performance of the missing data procedure – expand the scope of MAR • baseline covariates • time varying post-baseline covariates

Rationale For Inclusive Modeling • MAR: conditional on the dependent and independent variables in the analysis, unobserved values of the dependent variable are independent of dropout • Hence adding more variables that explain dropout can make missingness MAR that would otherwise be MNAR

Inclusive Modeling with LB-MEM • Add auxiliary variables to analysis model • Adding baseline covariates is straightforward • Adding post-baseline time varying covariates that are also influenced by treatment often dilute the treatment effect • Could conduct a multivariate analysis where the auxiliary variable is a second dependent variable

Inclusive Modeling with MI • Include auxiliary variables in the imputation model • Typically analyze complete data sets using a restrictive model, but could use the same inclusive model to analyze the data • Similar approaches can be used in propensity scoring and weighted analyses where a dropout model is used to develop propensity score bins or inverse probability of dropout weights, followed by a second analysis step

Developing An Analytic Road Map:Example From A Depression Trial • Confirmatory clinical trial of an antidepressant. Primary analysis should be simple and dependable • MAR with restrictive modeling as primary • Use MAR with inclusive modeling and MNAR methods as sensitivity analyses • Use local influence to investigate impact of influential patients

Why MAR? • Data in clinical trials are seldom MCAR because the observed outcomes typically influence dropout (lack of efficacy) • Trials are designed to observe all the relevant information, which minimizes MNAR data • Hence in the highly controlled scenario of clinical trials missing data may be mostly MAR

Why not MNAR? • Rubin (1994): “…, even inferences for the data parameters generally depend on the posited missingness mechanism, a fact that typically implies greatly increased sensitivity of inference…” • Laird (1994): “estimating the unestimable can be accomplished only by making modeling assumptions, The consequences of model misspecification will be more severe in the non-random case.” • Molenberghs, Kenward & Lesaffre (1997): “conclusions are conditional on the appropriateness of the assumed model, which in a fundamental sense is not testable.”

Why Restrictive Modeling? • Historically favor simple models, with impact of other factors addressed via secondary analyses • No strong a priori evidence of important auxiliary variables • Cost of including unnecessary variables? • Avoids any potential confounding of auxiliary variables with design factors (e.g., treatment)

Implementing The Road Map: Example From A Depression Trial • 259 patients, randomized 1:1 ratio to drug and placebo • Response: Change of HAMD17 score from baseline • 6 post-baseline visits (Weeks 1,2,3,5,7,9) • Primary objective: test the difference of mean change in HAMD17 total score between drug and placebo at the endpoint • Primary analysis: LB-MEM

Primary Analysis: LB-MEM proc mixed; class subject treatment time site; model Y = baseline treatment time site treatment*time ; repeated time / sub = subject type = un; lsmeans treatment*time / cl diff; run; This is a full multivariate model, with unstructured modeling of time and correlation. More parsimonious approaches may be useful in other scenarios Treatment contrast 2.17, p = .024

Patient Disposition • Drug Placebo • Protocol complete 60.9% 64.7% • Adverse event 12.5% 4.3% • Lack of efficacy 5.5% 13.7% • Differential rates, timing, and/or reasons for dropout do not necessarily distinguish between MCAR, MAR, MNAR

Inclusive Modeling in MI:Including Auxiliary AE Data • Imputation Models • *Yih = µ +1 Yi1 +…+ h-1 Yi(h-1) + ih • Yih = µ + 1 Yi1 +…+ h-1 Yi(h-1) + 1 AEi1 +…+ h-1 AEi(h-1) + ih • Yih= µ + 1 Yi1 +…+ h-1 Yi(h-1) + 1 AEi1 +…+ h-1 AEi(h-1) +11 (Yi1 *AEi1 ) + …+i(h-1) (Yi(h-1) * AEi(h-1) ) + ih • Analysis Model MMRM as previously described

Result • MI results were not sensitive to the different imputation models Endpoint contrastMMRM 2.2MI Y+AE 2.3MI Y+AE+Y*AE 2.1 • Including AE data might be important in other scenarios. Many ways to define AE

MNAR Modeling • Implement a selection model • Had to simplify model: modeled time as linear + quadratic, and used ar(1) correlation • Compare results from assuming MAR, MNAR • Also obtain local influence to assess impact of influential patients on treatment contrasts and non-random dropout

Selection Model Results

Local Influence: Influential Patients

Individual Profiles with Influential Patients Highlighted

Investigating The Influential Patients • The most influential patient was #30, a drug-treated patient that had the unusual profile of a big improvement but dropped out at week 1 • This patient was in his/her first MDD episode when s/he was enrolled • This patient dropped out based on his/her own decision claiming that the MDD was caused by high carbon monoxide level in his/her house • This patient was of dubious value for assessing the efficacy of the drug

Selection Model: Influential Patients Removed

Implications • Comforting that no subjects had a huge influence on results. Impact bigger if it were a smaller trial • Similar to other depression trials we have investigated, results not influenced by MNAR data • We can be confident in the primary result

Discussion • MAR with restrictive modeling was a reasonable choice for the primary analysis • MAR with inclusive modeling and MNAR was useful in assessing sensitivity • Sensitivity analyses promote the appropriate level of confidence in the primary result and lead us to an alternative analysis in which we can have the greatest possible confidence

Opinions • Inclusive modeling has been under utilized • More research to understand dropout would be useful • Did not discuss pros and cons of various ways to implement inclusive modeling. Use the one you know? Be careful to not dilute treatment • The road map for analyses used in the example data is specific to that scenario and is not intended to be a general prescription

Conclusions • No universally best method for analyzing longitudinal data • Analysis must be tailored to the specific situation at hand • We can do better than LOCF etc. • Considering the missingness mechanism and the modeling philosophy provides the framework in which to choose an appropriate primary analysis and appropriate sensitivity analyses

A n Analytic Road Map for Incomplete Longitudinal Clinical Trial Data

A n Analytic Road Map for Incomplete Longitudinal Clinical Trial Data

Presentation Transcript

Road Map

A n Analytic Road Map for Incomplete Longitudinal Clinical Trial Data

CLINICAL TRIAL

Road Map for Today

Clinical Trial

An Ontology for Clinical Trial Data Integration

Road Map for Today

A Road Map to

Data Management for Longitudinal Data

DATA FLOW FOR A CLINICAL TRIAL

Road Map for Today

Rescuing Clinical Trial Data For Economic Evaluation

Road Map for Transition

Robust Analysis of Incomplete Longitudinal Data in Clinical Trials

Road Map for Today

Road Map for Today

Road Map for Today

Road Map for Today

Road Map for Today

Road Map for Today

Road Map for Today