Guided Reinforcement Learning via Sequence Learning

被引：0

作者：

Ramamurthy, Rajkumar ^{[1
]}

Sifa, Rafet ^{[1
]}

Luebbering, Max ^{[1
]}

Bauckhage, Christian ^{[1
]}

机构：

[1] Fraunhofer IAIS, St Augustin, Germany

来源：

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II | 2020年 / 12397卷

关键词：

Reinforcement Learning; Exploration; Novelty Search; Representation learning; Sequence learning;

D O I：

10.1007/978-3-030-61616-8_27

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Applications of Reinforcement Learning (RL) suffer from high sample complexity due to sparse reward signals and inadequate exploration. Novelty Search (NS) guides as an auxiliary task, in this regard to encourage exploration towards unseen behaviors. However, NS suffers from critical drawbacks concerning scalability and generalizability since they are based off instance learning. Addressing these challenges, we previously proposed a generic approach using unsupervised learning to learn representations of agent behaviors and use reconstruction losses as novelty scores. However, it considered only fixed-length sequences and did not utilize sequential information of behaviors. Therefore, we here extend this approach by using sequential auto-encoders to incorporate sequential dependencies. Experimental results on benchmark tasks show that this sequence learning aids exploration outperforming previous novelty search methods.

引用

页码：335 / 345

页数：11

共 50 条

[31] Coordinated crawling via reinforcement learning [J].

Mishra, Shruti ;

van Rees, Wim M. ;

Mahadevan, L. .

JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2020, 17 (169)

[32] Methodologies for Imitation Learning via Inverse Reinforcement Learning: A Review [J].

Zhang K. ;

Yu Y. .

Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (02) :254-261

[33] Learning Personalized Health Recommendations via Offline Reinforcement Learning [J].

Preuett, Larry .

PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, :1355-1357

[34] Bayesian Deep Reinforcement Learning via Deep Kernel Learning [J].

Xuan, Junyu ;

Lu, Jie ;

Yan, Zheng ;

Zhang, Guangquan .

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (01) :164-171

[35] State-novelty guided action persistence in deep reinforcement learning [J].

Hu, Jianshu ;

Weng, Paul ;

Ban, Yutong .

MACHINE LEARNING, 2025, 114 (02)

[36] Vulnerability Mining of Deep Learning Framework for Model Generation Guided by Reinforcement Learning [J].

Pan L. ;

Liu L. ;

Luo S. ;

Zhang Z. .

Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2024, 44 (05) :521-529

[37] Nondominated Policy-Guided Learning in Multi-Objective Reinforcement Learning [J].

Kim, Man-Je ;

Park, Hyunsoo ;

Ahn, Chang Wook .

ELECTRONICS, 2022, 11 (07)

[38] Learning to calibrate: Reinforcement learning for guided calibration of visual-inertial rigs [J].

Nobre, Fernando ;

Heckman, Christoffer .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2019, 38 (12-13) :1388-1402

[39] Online Reinforcement Learning for Designing Automotive Hybrid Assembly Sequence: A Task Clustering-Guided Approach [J].

Elhoud, Anass ;

Piranda, Benoit ;

De Matos, Raphael ;

Bourgeois, Julien .

ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT II, AIAI 2024, 2024, 712 :115-128

[40] Analytically Guided Reinforcement Learning for Green It and Fluent Traffic [J].

Korecki, Marcin ;

Helbing, Dirk .

IEEE ACCESS, 2022, 10 :96348-96358

← 1 2 3 4 5 →