Course
Outline
&
(Ying Ding) Evaluating multiple treatment courses in clinical trials by P. Thall, R. Millikan and H. Sung, 2000 in Statistics in Medicine, vol. 19, pg. 1011-1028 (you have the password). Ying presentation
01/22/07 (Danny
Almirall) Selecting
Therapeutic Strategies Based on Efficacy and Death in Multi-Course Clinical
Trials by P. Thall, H. Sung and E. Estey, 2002 in Journal of the American Statistical Association, vol 97, pg 29-39. (you have the password).
Reinforcement Learning
(Murphy) Intro to Markov Decision Processes and Q-learning (Ch. 6 in Sutton and Barto, Reinforcement Learning)
02/12/07 (Murphy) Review
02/19/07 (Min Qian) Benjamin van Roy’s
chapter on Neuro-dynamic Programming, Min
presentation
03/05/07 (Mark Kliger) Least-Squares Policy Iteration by Michail G. Lagoudakis, Ronald Parr, JMLR, 4(Dec):1107-1149, 2003. http://jmlr.csail.mit.edu/papers/volume4/lagoudakis03a/lagoudakis03a.pdf
03/19/07 Review of homework.
03/26/07 (Ou Zhao) Kernel-based Reinforcement Learning by Dirk Ormoneit and Saunak Sen, Machine Learning, 49, pg. 161-178, 2002. (you have the password). Ou Presentation
Connections to Causal Inference
04/04/07 in MLB B131 (Murphy) Causal Inference Introduction; (Bibhas Chakraborty) Bias Correction in Non-Differentiable Estimating Equations for Optimal Dynamic Treatment Regimes by Erica Moodie
Brief return to Reinfocement Learning:
04/09/07 (Ali Shojaie) Tree-Based
Batch Mode Reinforcement Learning by Damien Ernst, Pierre Guerts and Louie
Wehenkel, JMLR, 6(2005) 503-556.
04/11/07 in MLB B131 (Bodhi Sen) Optimal Structural Nested Models for Optimal Sequential Decisions by James Robins with corrections.
04/16/07 Overflow and ending discussion.