共 12 条
[1]
Hwang Y.K., Ahuja N., A potential field approach to path planning, IEEE Transactions on Robotics and Automation, 8, 1, (1992)
[2]
Han S.C., Bang H.C., Proportional navigation-based optimal collision avoidance for UAVs, Journal of Institute of Control, Robotics and Systems (In Korean), 10, 11, pp. 1065-1070, (2004)
[3]
Chen Y.F., Liu M., Everett M., How J.P., Decentra-Lized Non-Communicating Multiagent Collision Avoidance with Deep Reinforcement Learning
[4]
Park S.G., Kim D.H., Autonomous flying of drone based on PPO reinforcement learning algorithm, Journal of Institute of Control, Robotics and Systems (In Korean), 26, 11, pp. 955-963
[5]
Kim M., Kim J., Jung M., Oh H., Collision avoidance for a small drone with monocular camera using deep reinforcement learning in an indoor environment, Journal of Institute of Control, Robotics and Systems (In Korean), 26, 6, pp. 399-411
[6]
Tesauro G., Practical issues in temporal difference learn-ing, Machine Learning, 8, pp. 257-277, (1992)
[7]
Mnih V., Badia A.P., Mirza M., Graves A., Lillicarap T.P., Harley T., Silver D., Kavukcuoglu K., Asynchronous methods for deep reinforcement learning, ICML, (2016)
[8]
Schulman J., Wolski F., Dhariwal P., Radford A., Klimov O., Proximal Policy Optimization Algorithms
[9]
Oh J., Guo Y., Singh S., Lee H., Self-Imitation Learn-Ing
[10]
Kostrikov I., Nachum O., Tompson J., Imitation learn-ing via off-policy distribution matching, ICLR, (2020)