共 18 条
[1]
PETERS J, SCHAAL S., Reinforcement learning of motor skills with policy gradients, Neural Networks, 21, 4, pp. 682-697, (2008)
[2]
CHEBOTAR Y, KALAKRISHNAN M, YAHYA A, Et al., Path integral guided policy search, Proceedings of IEEE International Conference on Robotics and Automation, pp. 3381-3388, (2017)
[3]
ANDRYCHOWICZ M, BAKER B, CHOCIEJ M, Et al., Learning dexterous in-hand manipulation, The International Journal of Robotics Research, 39, 1, pp. 3-20, (2020)
[4]
LI H, KUMAR N, CHEN R, Et al., A deep reinforcement learning framework for identifying funny scenes in movies, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3116-3120, (2018)
[5]
YANG Rui, YAN Jiangpeng, LI Xiu, Research on reinforcement learning sparse reward algorithm, CAAI Transactions on Intelligent Systems, 15, 5, pp. 888-899, (2020)
[6]
GULLAPALLI V, BARTO A G., Shaping as a method for accelerating reinforcement learning, Proceedings of IEEE International Symposium on Intelligent Control, pp. 554-559, (1992)
[7]
HUSSEIN A, GABER M M, ELYAN E, Et al., Imitation learning: A survey of learning methods, ACM Computing Surveys, 50, 2, pp. 1-35, (2017)
[8]
BENGIO Y, LOURADOUR J, COLLOBERT R, Et al., Curriculum learning, Proceedings of International Conference on Machine Learning, pp. 41-48, (2009)
[9]
ANDRYCHOWICZ M, WOLSKI F, RAY A, Et al., Hindsight experience replay, Advances in Neural Information Processing Systems, 12, 3, pp. 5048-5058, (2017)
[10]
REIZINGER P, SZEMENYEI M., Attention-based curiosity-driven exploration in deep reinforcement learning, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 3542-3546, (2020)