Task-Oriented Reinforcement Learning with Interest State Representation

被引:0
|
作者
Li, Ziyi [1 ]
Hu, Xiangtao [2 ]
Zhang, Yongle [1 ]
Zhou, Fujie [1 ]
机构
[1] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Peoples R China
[2] Anhui Univ, Dept Elect Engn & Automat, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
OBJECTS;
D O I
10.1109/ICARM62033.2024.10715850
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Current visual-based reinforcement learning for robotic manipulation is plagued by the problems related to goal transferability and anti-interference performance. The problems are mainly due to the pixel-dependence of state representations, which directly leads to the trained policy being highly correlated with the original observation data. To address above problems, a novel state representation framework named Task-Oriented Reinforcement Learning (TORL) is proposed by integrating Mask R-CNN with the original PPO-Clipped. In TORL, four cameras are used to capture multi-view observations from environment. Mask R-CNN is used to detect the objects in the environment, and the bounding boxes of the interest objects in each view are extracted as task features to define interest state representation and interest reward prediction. Four experiments are designed and carried out, the results demonstrate that TORL can improve goal transferability and anti-interference performance while ensuring the learning efficiency and stability.
引用
收藏
页码:721 / 728
页数:8
相关论文
共 50 条
  • [1] A Task-oriented Chatbot Based on LSTM and Reinforcement Learning
    Hsueh, Yu-Ling
    Chou, Tai-Liang
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
  • [2] A Task-oriented Chatbot Based on LSTM and Reinforcement Learning
    Chou, Tai-Liang
    Hsueh, Yu-Ling
    NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 87 - 91
  • [3] Task-oriented Dialogue System Based on Reinforcement Learning
    Song, Meina
    Chen, Zhongfu
    Niu, Peiqing
    Haihong, E.
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 93 - 98
  • [4] Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems
    Li, Ziming
    Kiseleva, Julia
    de Rijke, Maarten
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [5] A Survey of Task-Oriented Dialogue Policies Based on Reinforcement Learning
    Xu K.
    Wang Z.-Y.
    Wang X.
    Qin H.
    Long Y.-X.
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (06): : 1201 - 1231
  • [6] Task-oriented reinforcement learning for continuous tasks in dynamic environment
    Kamal, MAS
    Murata, J
    Hirasawa, K
    SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 829 - 832
  • [7] CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
    Verma, Siddharth
    Fu, Justin
    Yang, Mengjiao
    Levine, Sergey
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4471 - 4491
  • [8] Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog
    Zhang, Jiaping
    Zhao, Tiancheng
    Yu, Zhou
    19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), 2018, : 140 - 150
  • [9] Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control
    Xiang, Guofei
    Su, Jianbo
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (02) : 1056 - 1069
  • [10] Task-oriented learning on the Web
    Whittington, CD
    Campbell, LM
    INNOVATIONS IN EDUCATION AND TRAINING INTERNATIONAL, 1999, 36 (01): : 26 - 33