Task-Oriented Reinforcement Learning with Interest State Representation

被引:0
|
作者
Li, Ziyi [1 ]
Hu, Xiangtao [2 ]
Zhang, Yongle [1 ]
Zhou, Fujie [1 ]
机构
[1] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Peoples R China
[2] Anhui Univ, Dept Elect Engn & Automat, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
OBJECTS;
D O I
10.1109/ICARM62033.2024.10715850
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Current visual-based reinforcement learning for robotic manipulation is plagued by the problems related to goal transferability and anti-interference performance. The problems are mainly due to the pixel-dependence of state representations, which directly leads to the trained policy being highly correlated with the original observation data. To address above problems, a novel state representation framework named Task-Oriented Reinforcement Learning (TORL) is proposed by integrating Mask R-CNN with the original PPO-Clipped. In TORL, four cameras are used to capture multi-view observations from environment. Mask R-CNN is used to detect the objects in the environment, and the bounding boxes of the interest objects in each view are extracted as task features to define interest state representation and interest reward prediction. Four experiments are designed and carried out, the results demonstrate that TORL can improve goal transferability and anti-interference performance while ensuring the learning efficiency and stability.
引用
收藏
页码:721 / 728
页数:8
相关论文
共 50 条
  • [31] TASK-ORIENTED ARCHITECTURES
    BISIANI, R
    MAUERSBERG, H
    REDDY, R
    PROCEEDINGS OF THE IEEE, 1983, 71 (07) : 885 - 898
  • [32] Unifying Task-Oriented Knowledge Graph Learning and Recommendation
    Li, Qianyu
    Tang, Xiaoli
    Wang, Tengyun
    Yang, Haizhi
    Song, Hengjie
    IEEE ACCESS, 2019, 7 : 115816 - 115828
  • [33] Adversarial Learning of Task-Oriented Neural Dialog Models
    Liu, Bing
    Lane, Ian
    19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), 2018, : 350 - 359
  • [34] Event-Driven Interest Detection for Task-Oriented Mobile Apps
    Ota, Fernando Kaway Carvalho
    Damoun, Farouk
    Lagraa, Sofiane
    Becerra-Sanchez, Patricia
    Atten, Christophe
    Hilger, Jean
    State, Radu
    MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES, 2022, 419 : 582 - 598
  • [35] Discovery of Action Patterns in Task-Oriented Learning Processes
    Zhou, Xiaokang
    Chen, Jian
    Jin, Qun
    ADVANCES IN WEB-BASED LEARNING - ICWL 2013, 2013, 8167 : 121 - 130
  • [36] Budgeted Policy Learning for Task-Oriented Dialogue Systems
    Zhang, Zhirui
    Li, Xiujun
    Gao, Jianfeng
    Chen, Enhong
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3742 - 3751
  • [37] Unsupervised learning of kb queries in task-oriented dialogs
    Raghu D.
    Gupta N.
    Mausam
    Transactions of the Association for Computational Linguistics, 2021, 9 : 374 - 390
  • [38] One-Shot Learning for Task-Oriented Grasping
    Holomjova, Valerija
    Starkey, Andrew J.
    Yun, Bruno
    Meisner, Pascal
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (12) : 8232 - 8238
  • [39] Task-oriented contrastive learning for unsupervised domain adaptation
    Wei, Xing
    Wen, Bin
    Yang, Fan
    Liu, Yujie
    Zhao, Chong
    Hu, Di
    Luo, Hui
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [40] Structural Learning: Attraction and Conformity in Task-Oriented Groups
    James A. Kitts
    Michael W. Macy
    Andreas Flache
    Computational & Mathematical Organization Theory, 1999, 5 (2): : 129 - 145