1 / 27

Methods for Estimating the Decision Rules in Dynamic Treatment Regimes

Methods for Estimating the Decision Rules in Dynamic Treatment Regimes. S.A. Murphy Univ. of Michigan IBC/ASC: July, 2004. Dynamic Treatment Regimes.

callia
Download Presentation

Methods for Estimating the Decision Rules in Dynamic Treatment Regimes

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Methods for Estimating the Decision Rules in Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan IBC/ASC: July, 2004

  2. Dynamic Treatment Regimes

  3. Dynamic Treatment Regimes are individually tailored treatments, with treatment type and dosage changing with ongoing subject information. Mimic Clinical Practice. • Brooner et al. (2002) Treatment of Opioid Addiction • Breslin et al. (1999) Treatment of Alcohol Addiction • Prokaska et al. (2001) Treatment of Tobacco Addiction • Rush et al. (2003) Treatment of Depression

  4. EXAMPLE: Treatment of alcohol dependency. Primary outcome is a summary of heavy drinking scores over time.

  5. Examples of sequential multiple assignment randomized trials: • CATIE (2001) Treatment of Psychosis in Alzheimer’s Patients • CATIE (2001) Treatment of Psychosis in Schizophrenia • STAR*D (2003) Treatment of Depression • Thall et al. (2000) Treatment of Prostate Cancer

  6. k Decisions Observations made prior to jth decision Action at jth decision Primary Outcome: for a known function f

  7. A dynamic treatment regime is a vector of decision rules, one per decision If the regime is implemented then

  8. Methods for Estimating Decision Rules

  9. Three Methods for Estimating Decision Rules • Q-Learning (Watkins, 1989) • ---regression • A-Learning (Murphy, Robins, 2003) • ---regression on a mean zero space. • Weighting (Murphy, van der Laan & Robins, 2002) • ---weighted mean

  10. One decision only! Data: is randomized with probability

  11. Goal Choose to maximize:

  12. Q-Learning Minimize

  13. A-Learning Minimize

  14. Weighting

  15. Discussion

  16. Discussion • Consistencyof Parameterization • ---problems for Q-Learning • Model Space • ---bias • ---variance

  17. Q-Learning Minimize

  18. Minimize

  19. Discussion • Consistencyof Parameterization • ---problems for Q-Learning • Model Space • ---bias • ---variance

  20. Points to keep in mind • The sequential multiple assignment randomized trial is a trial for developing powerful dynamic treatment regimes; it is not a confirmatory trial. • Focus on MSE recognizing that due to the high dimensionality of X, the model parameterization is likely incorrect.

  21. Goal Given a restricted set of functional forms for the decision rules, say , find

  22. Discussion • Mismatch in Goals • ---problems for Q-Learning & A-Learning

  23. Suppose our sample is infinite. Then in general neither or is close to

  24. Open Problems • How might we “guide” Q-Learning or A-Learning so as to more closely achieve our goal? • Dealing with high dimensional X-- feature extraction---feature selection.

  25. This seminar can be found at: http://www.stat.lsa.umich.edu/~samurphy/seminars/ ibc_asc_0704.ppt My email address: samurphy@umich.edu

More Related