共 39 条
[22]
Ma XC, 2017, IEEE CONF COMPUT, P874, DOI 10.1109/INFCOMW.2017.8116491
[25]
Puterman Martin L, 1994, Markov Decision Processes: Discrete Stochastic Dynamic Programming
[29]
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V2