共 47 条
[2]
Berkenkamp F, 2017, ADV NEUR IN, V30
[5]
Brockett R.W, 1983, PROG MATH, P181
[8]
Q-Learning: Theory and Applications
[J].
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 7, 2020,
2020, 7
:279-301