共 40 条
- [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
- [2] [Anonymous], 1998, INTRO REINFORCEMENT
- [3] Real-Time Collision Avoidance Algorithm for Robotic Manipulators [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGIES FOR PRACTICAL ROBOT APPLICATIONS (TEPRA 2009), 2009, : 113 - 122
- [4] BRADTKE SJ, 1994, PROCEEDINGS OF THE 1994 AMERICAN CONTROL CONFERENCE, VOLS 1-3, P3475
- [7] Dang T, 2017, VALIDATION IND CYBER, P57
- [8] Dutton AGBRichard D, 1998, REINFORCEMENT LEARNI
- [9] Eryilmaz MS, 2012, SE EUR J SOFT COMPUT, V1
- [10] Gosavi A., 2015, SIMULATION BASED OPT, V2nd