共 219 条
[1]
Agrawal S(2013)Thompson sampling for contextual bandits with linear payoffs ICML 3 127-135
[2]
Goyal N(2010)Structural nested mean models for assessing time-varying effect moderation Biometrics 66 131-139
[3]
Almirall D(1994)Feed-forward neural networks IEEE Potentials 13 27-31
[4]
Ten Have T(2007)The gurobi optimizer Transp Res Part B 41 159-178
[5]
Murphy SA(2019)Infectious disease threats in the twenty-first century: strengthening the global response Front Immunol 10 549-1637
[6]
Bebis G(2018)A comprehensive survey of graph embedding: Problems, techniques, and applications IEEE Trans Knowl Data Eng 30 1616-436
[7]
Georgiopoulos M(2005)Generalized bootstrap for estimating equations Ann Stat 33 414-556
[8]
Bixby B(2005)Tree-based batch mode reinforcement learning J Mach Learn Res 6 503-381
[9]
Bloom DE(2021)Robust q-learning J Am Stat Assoc 116 368-977
[10]
Cadarette D(2018)Constructing dynamic treatment regimes over indefinite time horizons Biometrika 105 963-862