Learning to Box: Reinforcement Learning using Heuristic Three-step Curriculum Learning

被引:0
作者
Rho, Heeseon [1 ]
Yu, Yeonguk [1 ]
Lee, Kyoobin [1 ]
机构
[1] Gwangju Institute of Science and Technology (GIST), School of Integrated Technology (SIT), Gwangju,61005, Korea, Republic of
来源
International Conference on Control, Automation and Systems | 2022年 / 2022-November卷
关键词
Compendex;
D O I
暂无
中图分类号
学科分类号
摘要
Reinforcement learning
引用
收藏
页码:227 / 231
相关论文
共 50 条
[1]   DETERRENT: Detecting Trojans using Reinforcement Learning [J].
Gohil, Vasudev ;
Patnaik, Satwik ;
Guo, Hao ;
Kalathil, Dileep ;
Rajendran, Jeyavijayan J.V. .
Proceedings - Design Automation Conference, 2022, :697-702
[2]   DETERRENT: Detecting Trojans using Reinforcement Learning [J].
Gohil, Vasudev ;
Patnaik, Satwik ;
Guo, Hao ;
Kalathil, Dileep ;
Rajendran, Jeyavijayan .
arXiv, 2022,
[3]   An N-step Look Ahead Algorithm Using Mixed (On and Off) Policy Reinforcement Learning [J].
Kuchibhotla, Vivek ;
Harshitha, P. ;
Goyal, Shobhit .
Proceedings of the 3rd International Conference on Intelligent Sustainable Systems, ICISS 2020, 2020, :677-681
[4]   Redundant Robot Control Using Multi Agent Reinforcement Learning [J].
Perrusquia, Adolfo ;
Yu, Wen ;
Li, Xiaoou .
IEEE International Conference on Automation Science and Engineering, 2020, 2020-August :1650-1655
[5]   Podracer architectures for scalable reinforcement learning [J].
DeepMind, United Kingdom .
arXiv,
[6]   Optimization of welding route by automatic machine using reinforcement learning method [J].
Okumoto, Yasuhisa .
Journal of Ship Production, 2008, 24 (03) :135-138
[7]   Training and Evaluation of Deep Policies Using Reinforcement Learning and Generative Models [J].
Ghadirzadeh, Ali ;
Poklukar, Petra ;
Arndt, Karol ;
Finn, Chelsea ;
Kyrki, Ville ;
Kragic, Danica ;
Björkman, Mårten .
Journal of Machine Learning Research, 2022, 23
[8]   Adaptive beamforming based on the deep reinforcement learning [J].
Hao, Chuanhui ;
Sun, Xubao ;
Liu, Yidong .
ICNSC 2022 - Proceedings of 2022 IEEE International Conference on Networking, Sensing and Control: Autonomous Intelligent Systems, 2022,
[9]   Reinforcement learning explains various conditional cooperation [J].
Geng, Yini ;
Liu, Yifan ;
Lu, Yikang ;
Shen, Chen ;
Shi, Lei .
Applied Mathematics and Computation, 2022, 427
[10]   Computational-Statistical Gap in Reinforcement Learning [J].
Kane, Daniel ;
Liu, Sihan ;
Lovett, Shachar ;
Mahajan, Gaurav .
Proceedings of Machine Learning Research, 2022, 178 :1282-1302