共 33 条
- [1] Q-learning for estimating optimal dynamic treatment rules from observational data CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2012, 40 (04): : 629 - 645
- [10] Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32