共 19 条
[1]
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]
[Anonymous], 2016, CoRR abs/1606.01540
[3]
[Anonymous], PROXIMAL POLICY OPTI
[5]
Furrer F, 2016, STUD COMPUT INTELL, V625, P595, DOI 10.1007/978-3-319-26054-9_23
[6]
Hill A., 2018, Stable baselines
[9]
Mnih V Badia, 2016, ASYNCHRONOUS METHODS
[10]
Narayanamoorthy A, 2015, PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), P142, DOI 10.1109/ICCIS.2015.7274563