Human-robot cooperative tasks have gained importance with the emergence of robotics and artificial intelligence technology. In interactive reinforcement learning techniques, robots learn target tasks by receiving feedback from an experienced human trainer. However, most interactive reinforcement learning studies require a separate process to integrate the trainer's feedback into the training dataset, making it challenging for robots to learn new tasks from humans in real-time. Furthermore, the types of feedback sentences that trainers can use are limited in previous research. To address these limitations, this paper proposes a robot teaching strategy that uses deep RL via human-robot interaction to learn table balancing tasks interactively. The proposed system employs Deep Q-Network with real-time sentiment feedback delivered through the trainer's speech to learn cooperative tasks. We designed a novel reward function that incorporates sentiment feedback from human speech in real-time during the learning process. The paper presents an improved reward shaping technique based on subdivided feedback levels and shrinking feedback. This function serves as a guide for the robot to engage in natural interactions with humans and enables it to learn the tasks effectively. Experimental results demonstrate that the proposed interactive deep reinforcement learning model achieved a high success rate of up to 99.06%, outperforming the model without sentiment feedback.
机构:
East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
Wu, Xingjiao
Xiao, Luwei
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
Xiao, Luwei
Sun, Yixuan
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
Sun, Yixuan
Zhang, Junhang
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
Zhang, Junhang
Ma, Tianlong
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
Ma, Tianlong
He, Liang
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
He, Liang
[J].
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE,
2022,
135
: 364
-
381
机构:
East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
Wu, Xingjiao
Xiao, Luwei
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
Xiao, Luwei
Sun, Yixuan
论文数: 0引用数: 0
h-index: 0
机构:
Fudan Univ, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
Sun, Yixuan
Zhang, Junhang
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
Zhang, Junhang
Ma, Tianlong
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
Ma, Tianlong
He, Liang
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
East China Normal Univ, Sch Comp Sci & Technol, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai, Peoples R China
He, Liang
[J].
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE,
2022,
135
: 364
-
381