共 22 条
[2]
Bertsekas D.P., 2019, REINFORCEMENT LEARNI, V1st
[5]
Bertsekas DimitriP., 2017, DYNAMIC PROGRAMMING, V1
[6]
Bhat N., 2023, Stochastic Systems, V25, P1
[7]
Bishop C., 1995, NEURAL NETWORKS PATT
[9]
Dietterich TG, 2002, ADV NEUR IN, V14, P1491