Learning to Box: Reinforcement Learning using Heuristic Three-step Curriculum Learning

被引：0

作者：

Rho, Heeseon ^{[1
]}

Yu, Yeonguk ^{[1
]}

Lee, Kyoobin ^{[1
]}

机构：

[1] Gwangju Institute of Science and Technology (GIST), School of Integrated Technology (SIT), Gwangju,61005, Korea, Republic of

来源：

International Conference on Control, Automation and Systems | 2022年 / 2022-November卷

关键词：

Compendex;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Reinforcement learning

引用

页码：227 / 231

共 50 条

[1] DETERRENT: Detecting Trojans using Reinforcement Learning [J].

Gohil, Vasudev ;

Patnaik, Satwik ;

Guo, Hao ;

Kalathil, Dileep ;

Rajendran, Jeyavijayan J.V. .

Proceedings - Design Automation Conference, 2022, :697-702

[2] DETERRENT: Detecting Trojans using Reinforcement Learning [J].

Gohil, Vasudev ;

Patnaik, Satwik ;

Guo, Hao ;

Kalathil, Dileep ;

Rajendran, Jeyavijayan .

arXiv, 2022,

[3] An N-step Look Ahead Algorithm Using Mixed (On and Off) Policy Reinforcement Learning [J].

Kuchibhotla, Vivek ;

Harshitha, P. ;

Goyal, Shobhit .

Proceedings of the 3rd International Conference on Intelligent Sustainable Systems, ICISS 2020, 2020, :677-681

[4] Redundant Robot Control Using Multi Agent Reinforcement Learning [J].

Perrusquia, Adolfo ;

Yu, Wen ;

Li, Xiaoou .

IEEE International Conference on Automation Science and Engineering, 2020, 2020-August :1650-1655

[5] Podracer architectures for scalable reinforcement learning [J].

DeepMind, United Kingdom .

arXiv,

[6] Optimization of welding route by automatic machine using reinforcement learning method [J].

Okumoto, Yasuhisa .

Journal of Ship Production, 2008, 24 (03) :135-138

[7] Training and Evaluation of Deep Policies Using Reinforcement Learning and Generative Models [J].

Ghadirzadeh, Ali ;

Poklukar, Petra ;

Arndt, Karol ;

Finn, Chelsea ;

Kyrki, Ville ;

Kragic, Danica ;

Björkman, Mårten .

Journal of Machine Learning Research, 2022, 23

[8] Adaptive beamforming based on the deep reinforcement learning [J].

Hao, Chuanhui ;

Sun, Xubao ;

Liu, Yidong .

ICNSC 2022 - Proceedings of 2022 IEEE International Conference on Networking, Sensing and Control: Autonomous Intelligent Systems, 2022,

[9] Reinforcement learning explains various conditional cooperation [J].

Geng, Yini ;

Liu, Yifan ;

Lu, Yikang ;

Shen, Chen ;

Shi, Lei .

Applied Mathematics and Computation, 2022, 427

[10] Computational-Statistical Gap in Reinforcement Learning [J].

Kane, Daniel ;

Liu, Sihan ;

Lovett, Shachar ;

Mahajan, Gaurav .

Proceedings of Machine Learning Research, 2022, 178 :1282-1302

← 1 2 3 4 5 →