Exploring Multi-action Relationship in Reinforcement Learning

被引:8
|
作者
Wang, Han [1 ]
Yu, Yang [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
关键词
D O I
10.1007/978-3-319-42911-3_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many real-world reinforcement learning problems, an agent needs to control multiple actions simultaneously. To learn under this circumstance, previously, each action was commonly treated independently with other. However, these multiple actions are rarely independent in applications, and it could be helpful to accelerate the learning if the underlying relationship among the actions is utilized. This paper explores multi-action relationship in reinforcement learning. We propose to learn the multi-action relationship by enforcing a regularization term capturing the relationship. We incorporate the regularization term into the least-square policy-iteration and the temporal-difference methods, which result efficiently solvable convex learning objectives. The proposed methods are validated empirically in several domains. Experiment results show that incorporating multi-action relationship can effectively improve the learning performance.
引用
收藏
页码:574 / 587
页数:14
相关论文
共 50 条
  • [21] Evolutionary MCTS for Multi-Action Adversarial Games
    Baier, Hendrik
    Cowling, Peter, I
    PROCEEDINGS OF THE 2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG'18), 2018, : 253 - 260
  • [22] A Multi-action Reinforcement Learning Framework via Pointer Graph Neural Network for Flexible Job-Shop Scheduling Problems with Resource Transfer
    Xu, Fuhao
    Li, Junqing
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14863 : 179 - 190
  • [23] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Horie, Naoto
    Matsui, Tohgoroh
    Moriyama, Koichi
    Mutoh, Atsuko
    Inuzuka, Nobuhiro
    ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (03) : 352 - 359
  • [24] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Naoto Horie
    Tohgoroh Matsui
    Koichi Moriyama
    Atsuko Mutoh
    Nobuhiro Inuzuka
    Artificial Life and Robotics, 2019, 24 : 352 - 359
  • [25] Study of Automatic Cleaning Device with Multi-action for Shellfish
    Dong, Jia-Xu
    Zhang, Xin-Dan
    Fang, Xiao-Yan
    Tao, Xue-Heng
    Zhang, Peng
    Zhang, Xu
    Lu, Jin-Shi
    JOINT 2016 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE AND ENVIRONMENTAL SCIENCE (SSES 2016) AND INTERNATIONAL CONFERENCE ON FOOD SCIENCE AND ENGINEERING (ICFSE 2016), 2016, : 253 - 261
  • [26] Playing a Multi-action, Adversarial Game in a Dynamic Environment
    Chiong, Raymond
    Noman, Nasimul
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 612 - 618
  • [27] SMART: An integrated multi-action advisor for storage systems
    Yin, Li
    Uttamchandani, Sandeep
    Korupolu, Madhukar
    Voruganti, Kaladhar
    Katz, Randy
    USENIX ASSOCIATION PROCEEDINGS OF THE 2006 USENIX ANNUAL TECHNICAL CONFERENCE, 2006, : 229 - +
  • [28] Exploring Continuous Action Spaces with Diffusion Trees for Reinforcement Learning
    Vollmer, Christian
    Schaffernicht, Erik
    Gross, Horst-Michael
    ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT II, 2010, 6353 : 190 - 199
  • [29] Lower Bounds for Policy Iteration on Multi-action MDPs
    Ashutosh, Kumar
    Consul, Sarthak
    Dedhia, Bhishma
    Khirwadkar, Parthasarathi
    Shah, Sahil
    Kalyanakrishnan, Shivaram
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 1744 - 1749
  • [30] A view on multi-action Pt(IV) antitumor prodrugs
    Ravera, Mauro
    Gabano, Elisabetta
    McGlinchey, Michael J.
    Osella, Domenico
    INORGANICA CHIMICA ACTA, 2019, 492 : 32 - 47