共 50 条
[31]
Qu G, 2020, PR MACH LEARN RES, V125
[32]
Salgia S., 2024, 38 ANN C NEUR INF PR
[33]
Shen H, 2022, Arxiv, DOI arXiv:2012.15511
[34]
Shi Laixi, 2022, P MACHINE LEARNING R
[35]
Sun J, 2020, PR MACH LEARN RES, V108
[36]
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[37]
Szepesvari C, 1998, ADV NEUR IN, V10, P1064
[38]
ASYNCHRONOUS STOCHASTIC-APPROXIMATION AND Q-LEARNING
[J].
MACHINE LEARNING,
1994, 16 (03)
:185-202
[39]
Wai HT, 2020, IEEE DECIS CONTR P, P4897, DOI [10.1109/CDC42340.2020.9304466, 10.1109/cdc42340.2020.9304466]
[40]
Wainwright M. J., 2019, arXiv