共 22 条
[1]
Achiam J, 2017, PR MACH LEARN RES, V70
[2]
Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ECC.2019.8796030, 10.23919/ecc.2019.8796030]
[3]
[Anonymous], 2009, Convex optimization
[4]
Berkenkamp F, 2017, ADV NEUR IN, V30
[5]
Efficient and Safe Exploration in Deterministic Markov Decision Processes with Unknown Transition Models
[J].
2019 AMERICAN CONTROL CONFERENCE (ACC),
2019,
:1792-1799
[6]
Brockman G, 2016, Arxiv, DOI arXiv:1606.01540
[7]
Cheng R, 2019, AAAI CONF ARTIF INTE, P3387
[8]
Chow Y, 2019, Arxiv, DOI arXiv:1901.10031
[9]
Fan DD, 2020, IEEE INT CONF ROBOT, P4093, DOI [10.1109/ICRA40945.2020.9196709, 10.1109/icra40945.2020.9196709]
[10]
García J, 2015, J MACH LEARN RES, V16, P1437