Exploring Multi-action Relationship in Reinforcement Learning

被引:8
|
作者
Wang, Han [1 ]
Yu, Yang [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
关键词
D O I
10.1007/978-3-319-42911-3_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many real-world reinforcement learning problems, an agent needs to control multiple actions simultaneously. To learn under this circumstance, previously, each action was commonly treated independently with other. However, these multiple actions are rarely independent in applications, and it could be helpful to accelerate the learning if the underlying relationship among the actions is utilized. This paper explores multi-action relationship in reinforcement learning. We propose to learn the multi-action relationship by enforcing a regularization term capturing the relationship. We incorporate the regularization term into the least-square policy-iteration and the temporal-difference methods, which result efficiently solvable convex learning objectives. The proposed methods are validated empirically in several domains. Experiment results show that incorporating multi-action relationship can effectively improve the learning performance.
引用
收藏
页码:574 / 587
页数:14
相关论文
共 50 条
  • [1] Benchmarking Genetic Programming in a Multi-action Reinforcement Learning Locomotion Task
    Amaral, Ryan
    Ianta, Alexandru
    Bayer, Caleidgh
    Smith, Robert J.
    Heywood, Malcolm I.
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 522 - 525
  • [2] Robot multi-action cooperative grasping strategy based on deep reinforcement learning
    He, Huiteng
    Zhou, Yong
    Hu, Kaixiong
    Li, Weidong
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (05): : 1789 - 1797
  • [3] Miracle: Multi-Action Reinforcement Learning-Based Chip Floorplanning Reasoner
    Yang, Bo
    Xu, Qi
    Geng, Hao
    Chen, Song
    Kang, Yi
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [4] An offline-to-online reinforcement learning approach based on multi-action evaluation with policy extension
    Cheng, Xuebo
    Huang, Xiaohui
    Huang, Zhichao
    Jiang, Nan
    APPLIED INTELLIGENCE, 2024, 54 (23) : 12246 - 12271
  • [5] A multi-action deep reinforcement learning framework for flexible Job-shop scheduling problem
    Lei, Kun
    Guo, Peng
    Zhao, Wenchao
    Wang, Yi
    Qian, Linmao
    Meng, Xiangyin
    Tang, Liansheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
  • [6] Offline Multi-Action Policy Learning: Generalization and Optimization
    Zhou, Zhengyuan
    Athey, Susan
    Wager, Stefan
    OPERATIONS RESEARCH, 2023, 71 (01) : 148 - 183
  • [7] Multi-Action Dialog Policy Learning with Interactive Human Teaching
    Jhunjhunwala, Megha
    Bryant, Caleb
    Shah, Pararth
    SIGDIAL 2020: 21ST ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2020), 2020, : 290 - 296
  • [8] Retrieve & Memorize: Dialog Policy Learning with Multi-Action Memory
    Li, Yunhao
    Yang, Yunyi
    Quan, Xiaojun
    Yu, Jianxing
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 447 - 459
  • [9] Q-Learning Lagrange Policies for Multi-Action Restless Bandits
    Killian, Jackson A.
    Biswas, Arpita
    Shah, Sanket
    Tambe, Milind
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 871 - 881
  • [10] A Hierarchical-Based Learning Approach for Multi-Action Intent Recognition
    Hollinger, David
    Pollard, Ryan S.
    Schall Jr, Mark C.
    Chen, Howard
    Zabala, Michael
    SENSORS, 2024, 24 (23)