Hypothesis Testing and Dynamic Treatment Regimes

Hypothesis Testing and Dynamic Treatment Regimes S.A. Murphy, L. Gunter & B. Chakraborty ENAR March 2007

Outline • Dynamic treatment regimes • Constructing and addressing questions regarding an optimal dynamic treatment regime • Why and when non-regular? • A Solution • Simulation Results.

Dynamic treatment regimes are individually tailored treatments, with treatment type and dosage changing according to patient outcomes. Operationalize clinical practice. k Stages for one individual Observation available at jth stage Action at jth stage

k Stages History available at jth stage “Reward” following jth stage (rj is a known function) Primary Outcome:

Goal: Construct decision rules that input information in the history at each stage and output a recommended decision; these decision rules should lead to a maximal mean Y. The dynamic treatment regime is the sequence of decision rules:

In the future we employ the actions determined by the decision rules: An example of a simple decision rule is: alter treatment at time j if otherwise maintain on current treatment; Sj is a summary of the history, Hj.

Data for Constructing the Dynamic Treatment Regime: Subject data from sequential, multiple assignment, randomized trials. At each stage subjects are randomized among alternative options. Aj is a randomized action with known randomization probability. binary actions with P[Aj=1]=P[Aj=-1]=.5

Sequential, Multiple Assignment Randomized Studies • CATIE (2001) Treatment of Psychosis in Schizophrenia • STAR*D (2003) Treatment of Depression • Tummarello (1997) Treatment of Small Cell Lung Cancer (many, for many years, in this field) • Oslin (on-going) Treatment of Alcohol Dependence • Pellman (on-going) Treatment of ADHD

Constructing and Addressing Questions Regarding an Optimal Dynamic Treatment Regime

Regression-based methods for constructing decision rules • Q-Learning (Watkins, 1989) (a popular method from computer science) • A-Learning or optimal nested structural mean model (Murphy, 2003; Robins, 2004) • The first method is an inefficient version of the second method when each stages’ covariates include the prior stages’ covariates and the actions are centered to have conditional mean zero.

Dynamic Programming (k=2)

A Simple Version of Q-Learning –binary actions Approximate for S', S vector summaries of the history and • Stage 2 regression: Use least squares with outcome, Y, and covariates to obtain • Set • Stage 1 regression: Use least squares with outcome, and covariates to obtain

Decision Rules:

Why non-regular?

Non-regularity

When do we have non-regularity?

A Soft-Max Solution

Distributions for Soft-Max

Regularized Q-Learning (binary actions) • Set • Stage 1 regression: Use least squares with outcome, • and covariates to obtain

Interpretation of λ Estimator of Stage 1 Treatment Effect when

Interpretation of λ

Proposal

Simulation

P[β2TS2=0]=1 β1(∞)=β1(0)=0 Test Statistic Nominal Type 1 based on Error=.05 • Nonregularity results in low Type 1 error • Additional smoothing due to use of is useful.

P[β2TS2=0]=1 β1(∞)=β1(0)=.1 Test Statistic Power based on • The low Type 1 error rate translates into low power

P[β2TS2=0]=0 β1(∞)=.125, β1(0)=0 Test Statistic Power based on • Averaging over the future is not a panacea

P[β2TS2=0]=.25 β1(∞)=0, β1(0)=-.25 Test Statistic Type 1 Error=.05 based on • The price is that the null hypothesis is altered.

Discussion • We replace the hypothesis test concerning a non-regular parameter, β1(∞) by a hypothesis test concerning a near-by regular parameter β1(λ*). • This is work in progress—limited theoretical results are available. • If you let increase with the sample size you again end up with a non-regular problem (convergence to limiting distribution is locally non-uniform).

Discussion • Robins (2004) proposes several conservative confidence intervals for β1. • Ideally to decide if the two stage 1 treatments are equivalent, we would evaluate whether the choice of stage 1 treatment influences the mean outcome resulting from the use of the dynamic treatment regime. We did not do this here. • Constructing “evidence-based” regimes is of great interest in clinical research and there is much to be done by statisticians.

This seminar can be found at: http://www.stat.lsa.umich.edu/~samurphy/ seminars/ENAR0307.ppt Email me with questions or if you would like a copy! samurphy@umich.edu

Hypothesis Testing and Dynamic Treatment Regimes

Hypothesis Testing and Dynamic Treatment Regimes

Presentation Transcript

Hypothesis Testing

Testing Hypothesis

Hypothesis Testing

Developing Dynamic Treatment Regimes for Chronic Disorders

Hypothesis Testing

Hypothesis Testing

Developing Dynamic Treatment Regimes for Chronic Disorders

Hypothesis Testing:

Hypothesis testing

Hypothesis Testing

Hypothesis Testing

Q-Learning and Dynamic Treatment Regimes

Hypothesis Testing

Hypothesis Testing

SMART Designs for Developing Dynamic Treatment Regimes

Dynamic Treatment Regimes

Hypothesis testing

SMART Designs for Developing Dynamic Treatment Regimes

Hypothesis Testing

Dynamic Treatment Regimes: Challenges in Data Analysis

Hypothesis and Testing of Hypothesis

Hypothesis Testing and Adaptive Treatment Strategies