共 81 条
[2]
Alban A, 2018, WINT SIMUL C PROC, P2459, DOI 10.1109/WSC.2018.8632500
[3]
[Anonymous], 1989, Learning from delayed rewards: A foundation of reinforcement learning
[5]
Atkinson AC., 2014, RANDOMISED RESPONSE
[6]
Bai ZD, 2002, ANN STAT, V30, P122
[7]
Bandyopadhyay U., 1996, Calcutta Statistical Association Bulletin, V46, P69, DOI [10.1177/0008068319960107, DOI 10.1177/0008068319960107]
[8]
Bellman R, 1956, SANKHYA, V16, P221
[9]
Berry D.A., 1996, BAYESIAN BIOSTATISTI, P3