共 27 条
- [1] Abbeel Pieter, 2006, INT C MACH LEARN ICM, P1
- [2] Argerich M. F., 2020, AAAI SPRING S COMB M
- [3] Berkenkamp F, 2017, ADV NEUR IN, V30
- [4] Bertsekas D., 1996, NEURO DYNAMIC PROGRA
- [5] Brockman G, 2016, Arxiv, DOI [arXiv:1606.01540, DOI 10.48550/ARXIV.1606.01540]
- [6] De Lellis F, 2021, 2021 EUROPEAN CONTROL CONFERENCE (ECC), P580, DOI 10.23919/ECC54610.2021.9654881
- [7] Deisenroth M., 2011, P 28 INT C MACH LEAR, P465, DOI [10.5555/3104482.3104541, DOI 10.5555/3104482.3104541]
- [8] Duan Y, 2016, PR MACH LEARN RES, V48
- [9] Even-Dar E, 2003, J MACH LEARN RES, V5, P1
- [10] Nguyen HT, 2019, 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), P102, DOI 10.1109/SSCI44817.2019.9002756