共 34 条
[1]
Beck A(2003)Mirror descent and nonlinear projected subgradient methods for convex optimization Operations Research Letters 31 167-175
[2]
Teboulle M(2016)Personalizing mechanical ventilation for acute respiratory distress syndrome Journal of thoracic disease 8 E172-334
[3]
Berngard SC(1997)Nonlinear programming Journal of the Operational Research Society 48 334-464
[4]
Beitler JR(2014)Dynamic treatment regimes Annual Review of Statistics and its Application 1 447-1528
[5]
Malhotra A(2016)A linearly convergent variant of the conditional gradient algorithm under strong convexity, with applications to online and stochastic optimization SIAM Journal on Optimization 26 1493-232
[6]
Bertsekas DP(2016)Mimic-iii, a freely accessible critical care database Scientific Data 3 160035-566
[7]
Chakraborty B(2002)Near-optimal reinforcement learning in polynomial time Machine Learning 49 209-407
[8]
Murphy SA(2018)The artificial intelligence clinician learns optimal treatment strategies for sepsis in intensive care Nature Medicine 24 1716-721
[9]
Garber D(2017)Random gradient-free minimization of convex functions Foundations of Computational Mathematics 17 527-undefined
[10]
Hazan E(2000)Algorithms for inverse reinforcement learning ICML 1 2-undefined