Task-Oriented Reinforcement Learning with Interest State Representation

被引:0
|
作者
Li, Ziyi [1 ]
Hu, Xiangtao [2 ]
Zhang, Yongle [1 ]
Zhou, Fujie [1 ]
机构
[1] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Peoples R China
[2] Anhui Univ, Dept Elect Engn & Automat, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
OBJECTS;
D O I
10.1109/ICARM62033.2024.10715850
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Current visual-based reinforcement learning for robotic manipulation is plagued by the problems related to goal transferability and anti-interference performance. The problems are mainly due to the pixel-dependence of state representations, which directly leads to the trained policy being highly correlated with the original observation data. To address above problems, a novel state representation framework named Task-Oriented Reinforcement Learning (TORL) is proposed by integrating Mask R-CNN with the original PPO-Clipped. In TORL, four cameras are used to capture multi-view observations from environment. Mask R-CNN is used to detect the objects in the environment, and the bounding boxes of the interest objects in each view are extracted as task features to define interest state representation and interest reward prediction. Four experiments are designed and carried out, the results demonstrate that TORL can improve goal transferability and anti-interference performance while ensuring the learning efficiency and stability.
引用
收藏
页码:721 / 728
页数:8
相关论文
共 50 条
  • [21] Task-oriented developmental learning for humanoid robots
    Tan, KC
    Chen, YJ
    Tan, KK
    Lee, TH
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2005, 52 (03) : 906 - 914
  • [22] Continual Learning in Task-Oriented Dialogue Systems
    Madotto, Andrea
    Lin, Zhaojiang
    Zhou, Zhenpeng
    Moon, Seungwhan
    Crook, Paul
    Liu, Bing
    Yu, Zhou
    Cho, Eunjoon
    Fung, Pascale
    Wang, Zhiguang
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7452 - 7467
  • [23] Learning Folksonomies from Task-Oriented Dialogues
    Puppi Wanderley, Gregory Moro
    Paraiso, Emerson Cabrera
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 360 - 367
  • [24] N-Shot Learning for Augmenting Task-Oriented Dialogue State Tracking
    Aksu, Taha
    Liu, Zhengyuan
    Kan, Min-Yen
    Chen, Nancy F.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1659 - 1671
  • [25] Task-oriented Resource Allocation for Mobile Edge Computing with Multi-Agent Reinforcement Learning
    Zou, Yue
    Shen, Fei
    Yan, Feng
    Tang, Liang
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [26] BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
    Lipton, Zachary
    Li, Xiujun
    Gao, Jianfeng
    Li, Lihong
    Ahmed, Faisal
    Deng, Li
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5237 - 5244
  • [27] Personality-aware Natural Language Generation for Task-oriented Dialogue using Reinforcement Learning
    Guo, Ao
    Ohashi, Atsumoto
    Chiba, Yuya
    Tsunomori, Yuiko
    Hirai, Ryu
    Higashinaka, Ryuichiro
    2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN, 2023, : 1823 - 1828
  • [28] Task-Oriented Feature Representation for Spontaneous Speech of AD Patients
    Li, Jiyun
    Huang, Peng
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 46 - 57
  • [29] Probing Task-Oriented Dialogue Representation from Language Models
    Wu, Chien-Sheng
    Xiong, Caiming
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 5036 - 5051
  • [30] Unsupervised Learning of KB Queries in Task-Oriented Dialogs
    Raghu, Dinesh
    Gupta, Nikhil
    Mausam
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 374 - 390