Statistical issues in the validation of surrogate endpoints

Statistical issues in the validation of surrogate endpoints Stuart G. Baker, Sc.D. sb16i@nih.gov

Surrogate endpoint: definition • used to make conclusions about the effect of intervention on true endpoint • obtained sooner, at less cost, or less invasively than the true endpoint

Outline • Asking the right questions • Hypothesis testing • General framework for validation and application • Graphical view • Estimation (meta-analytic) • General framework for validation and application • Trial-level statistics –graphical view • Predicted effect of intervention for binary surrogate and true endpoints (NEW approach) – graphical view • Caveats

Asking the right questions

Asking the right questions • Validation trial: Both surrogate and true endpoints are observed • QUESTION: Are the conclusions about the effect of intervention on true endpoint the same when based on (i) only surrogate endpoint (ii) only the true endpoint ? • Application trial: Surrogate but not true endpoint is observed • QUESTION: What is the effect of intervention on true endpoint?

Hypothesis testing

General Framework for Hypothesis Testing If Prentice Criteria hold, valid hypothesis testing using surrogate endpoint Surrogate endpoints True endpoints Validation trial Validation Test hypothesis using surrogate endpoint New trial Surrogate endpoint Application Valid hypothesis test: H0(T): no effect of intervention on true implies H0(S): no effect of intervention on surrogate so that reject H0(S) implies reject H0(T) Prentice Criteria; pr (true | surrogate) not depend on group + extra requirement (if binary: surrogate predicts true) (Buyse,Molenbergs, 1998)): easy to reject;hard to show they hold

Understanding hypothesis testing • Graphical illustration using • Binary surrogate endpoint • Binary true endpoint

Prentice Criterion “holds” Fraction with true endpoint Treatment A Validation Trial Treatment B Fraction with surrogate endpoint 0 0 Extrapolation: Prentice Criterion “holds” (but how close is close enough?) Fraction with true endpoint B A Application trial 0 Fraction with a surrogate endpoint A B 1 0 A=B for true implies A=B for surrogate

Prentice Criterion does not hold Fraction with true endpoint Treatment A Validation Trial Treatment B Fraction with surrogate endpoint 0 0 Fraction with true endpoint Extrapolation: same lines Hypothesis testing gives incorrect conclusion A B Application trial Fraction with a surrogate endpoint 0 A B 1 0 A=B for true does not imply A=B for surrogate

Estimation Meta-analytic (based on multiple previous trials)

General Framework for Estimation Surrogate endpoints True endpoints Previous trials Predicted effect of intervention on true endpoint model Validation (similar confidence intervals) Surrogate endpoint Validation trial Observed effect of intervention on true endpoint True endpoint Application trial Predicted effect of intervention on true endpoint Surrogate endpoint Application

Meta-analytic methods of estimation • Trial-level statistics • Buyse et al (2000); Gail et al (2000) • Estimated predicted effect of intervention on true endpoint • proposal for binary surrogate and true endpoints • simple computations

Focus • Binary surrogate endpoint • Binary true endpoint

DATA SCHEME Application trial Validation trial Previous trial 1 Previous trial 2 Previous trial 3

Meta-analysis of trial-level statistics Graphical overview of approach of Buyse et al (2000) and Gail et al (2000)

Trial-level meta-analysis (simplified overview) Regression: using random effects and within trial data Previous study 1 Fraction with true endpoint Previous study 2 Previous study 3 d 0 Fraction with surrogate endpoint A B 1 0

Meta-analysis of estimated predicted effects of intervention A new approach for binary surrogate and true endpoints

Predicted effect of intervention on true endpoint based on surrogatesA and B in new study and data from previous study 1 Fraction with true endpoint d1 0 Fraction with surrogate endpoint A B 1 0 Note: Lines for each group need not be identical—Prentice Criterion not needed

Predicted effect of intervention on true endpoint based on surrogatesA and B in new study and data from previous study 2 Fraction with true endpoint d2 d1 0 Fraction with surrogate endpoint A B 1 0 Note: Lines for each group need not be identical—Prentice Criterion not needed

Predicted effect of intervention on true endpoint based on surrogatesA and B in new study and data from previous study 3 Fraction with true endpoint d2 d1 d3 0 Fraction with surrogate endpoint A B 1 0 d=(d1w1+ d2w2+ d3w3)/(w1+w2+w3 )

Meta-analysis of estimated predicted treatment effects: d1, d2, d3 d= (d1 w1+d2 w2 + d3 w3)/(w1+w2+w3), • Weights wi are based on a random-effects model for di (with variance s2) • simpler than a random-effects for slopes • wi = 1 / (sampling variance of di+ s2 ) • Weights minimize variance of d if di are not correlated • simplification since di‘s are correlated

Meta-analysis computation • d= (d1 w1+d2 w2 + d3 w3)/(w1+w2+w3), where • di={fi0ApA+fi1A(1-pA)} - {fi0BpB+fi1B(1-pB)} • Application or validation trial: fraction with surrogate endpoint= pAandpB • Previous trials: fraction with true given surrogate endpoint: fi0A, fi1A, fi0B, fi1B • wi=1/(Vi + s2 ), Vi = sampling variance of (di) • To estimate s2 • adapt method of DerSimonian and Laird for usual meta-analysis accounting for covariance among di’s due to share parameters pAandpB • To compute variance of d • Bootstrap trials and data within trials

Meta-analysis simulation d= (d1 w1+d2 w2 + d3 w3)/(w1+w2+w3), where • di={fi0ApA+fi1A(1-pA)} - {fi0BpB+fi1B(1-pB)} • Simulation • Generate random fi0A, fi1A, fi0B, fi1B • Generate random data for each trial • Mean squared error • Slightly smaller for meta-analysis of predicted effect of intervention than for meta-analysis of trial-level statistics (computed via method-of-moments)

Hypothetical data: Example 1

Hypothetical data: Example 2

Real Data (x 10) from multicenter trial in Gail et al (2000): surrogate is cholesterol level, true endpoint is artery diameter

Caveats

Caveats • Needed even if surrogate is validated with data from many previous studies • Extrapolation to a new trial • Hypothesis testing • Estimation using data from previous trials • Surrogate endpoint does not predict harms that might arise after surrogate is observed

When caveats are less critical • Preliminary drug development when the surrogate endpoint is used to decide on further development or definitive testing with a true endpoint • Establishing dose or timing of an intervention previously shown effective using true endpoint at a different (suboptimal?) dose or timing

Summary

Types of trials • Validation trial: • Both surrogate and true endpoint • Do you obtain the same conclusion about effect of intervention on true endpoint using (i) surrogate endpoint and (ii) true endpoint? • Application trial: • Only surrogate endpoint • What is the effect of intervention on true endpoint?

Hypothesis testing • Not validated if reject Prentice’s criteria • Not clear what to conclude about surrogate if cannot reject Prentice’s criteria

Estimation (meta-analysis) • Not need Prentice’s criteria • Meta-analysis of trial-level statistic • Applicable to all types of endpoints • Meta-analysis of estimated predicted effect of intervention on true endpoint • Binary surrogate and true endpoints • Computationally simple • Slightly smaller MSE than with meta-analysis of trial-level statistics (in simulation)

Statistical issues in the validation of surrogate endpoints

Statistical issues in the validation of surrogate endpoints

Presentation Transcript

Statistical Issues in the Evaluation of Predictive Biomarkers

Transition of Biomarkers to Surrogate Endpoints: A Critical Path Initiative….. …..Introduction

Surrogate Endpoints

Statistical challenges in the validation of surrogate endpoints

Surrogate Outcomes and Handling Multiple Endpoints in Clinical Trials

Statistical Issues in Specification of D

Surrogate Endpoints in Infectious Diseases Trials: FDA Perspective

Statistical Issues in Clinical Studies Using Laboratory Endpoints

On Surrogate Endpoints in HIV Vaccine Efficacy Trials

Surrogate Endpoints and Non-randomized Trials

Cloud Validation: The issues

Surrogate Endpoints for Hepatitis B Trials

Economic evaluation of outcomes: long term primary and surrogate endpoints

The gap between biomarkers and surrogate endpoints Oncology

Statistical Issues in the Design of Microarray Experiment

Issues in the Validation of Battle Models

Statistical Validation of The Unified Cycle Theory

Cloud Validation: The issues

Surrogate Endpoints: A Regulatory View

Statistical Issues in Clinical Studies Using Laboratory Endpoints

Economic evaluation of outcomes: long term primary and surrogate endpoints

Statistical Issues