共 39 条
[1]
Rusu AA, 2016, Arxiv, DOI [arXiv:1606.04671, DOI 10.43550/ARXIV:1606.04671, DOI 10.48550/ARXIV.1606.04671]
[2]
Agarwal Alekh, 2020, P MACHINE LEARNING R, V125
[3]
Ahansazan B., 2014, INT J ENV SCI DEV, V5, P81, DOI DOI 10.7763/IJESD.2014.V5.455
[4]
Ammar Haitham Bou, 2012, Adaptive and Learning Agents. International Workshop, ALA 2011 Held at AAMAS 2011. Revised Selected Papers, P21, DOI 10.1007/978-3-642-28499-1_2
[5]
Bertsekas D. P., 2019, algorithm for optimal control with integral reinforcement learn
[8]
Czarnecki WM, 2019, PR MACH LEARN RES, V89
[9]
Devlin S., 2012, P INT C AUT AG MULT, P433