A Critical Period for Robust Curriculum-Based Deep Reinforcement Learning of Sequential Action in a Robot Arm

被引:4
|
作者
de Kleijn, Roy [1 ]
Sen, Deniz [2 ]
Kachergis, George [3 ]
机构
[1] Leiden Univ, Leiden Inst Brain & Cognit, Leiden, Netherlands
[2] Leiden Univ, Math Inst, Leiden, Netherlands
[3] Stanford Univ, Language & Cognit Lab, Stanford, CA 94305 USA
关键词
Curriculum learning; Movement optimization; Reinforcement learning; Robotic arm control; Sequential action; GENERATION;
D O I
10.1111/tops.12595
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Many everyday activities are sequential in nature. That is, they can be seen as a sequence of subactions and sometimes subgoals. In the motor execution of sequential action, context effects are observed in which later subactions modulate the execution of earlier subactions (e.g., reaching for an overturned mug, people will optimize their grasp to achieve a comfortable end state). A trajectory (movement) adaptation of an often-used paradigm in the study of sequential action, the serial response time task, showed several context effects of which centering behavior is of special interest. Centering behavior refers to the tendency (or strategy) of subjects to move their arm or mouse cursor to a position equidistant to all stimuli in the absence of predictive information, thereby reducing movement time to all possible targets. In the current study, we investigated sequential action in a virtual robotic agent trained using proximal policy optimization, a state-of-the-art deep reinforcement learning algorithm. The agent was trained to reach for appearing targets, similar to a serial response time task given to humans. We found that agents were more likely to develop centering behavior similar to human subjects after curricularized learning. In our curriculum, we first rewarded agents for reaching targets before introducing a penalty for energy expenditure. When the penalty was applied with no curriculum, many agents failed to learn the task due to a lack of action space exploration, resulting in high variability of agents' performance. Our findings suggest that in virtual agents, similar to infants, early energetic exploration can promote robust later learning. This may have the same effect as infants' curiosity-based learning by which they shape their own curriculum. However, introducing new goals cannot wait too long, as there may be critical periods in development after which agents (as humans) cannot flexibly learn to incorporate new objectives. These lessons are making their way into machine learning and offer exciting new avenues for studying both human and machine learning of sequential action.
引用
收藏
页码:311 / 326
页数:16
相关论文
共 50 条
  • [1] Curriculum-Based Deep Reinforcement Learning for Quantum Control
    Ma, Hailan
    Dong, Daoyi
    Ding, Steven X.
    Chen, Chunlin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8852 - 8865
  • [2] Curriculum-Based Reinforcement Learning for Distribution System Critical Load Restoration
    Zhang, Xiangyu
    Eseye, Abinet Tesfaye
    Knueven, Bernard
    Liu, Weijia
    Reynolds, Matthew
    Jones, Wesley
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (05) : 4418 - 4431
  • [3] Cooperative Object Transportation Using Curriculum-Based Deep Reinforcement Learning
    Eoh, Gyuho
    Park, Tae-Hyoung
    SENSORS, 2021, 21 (14)
  • [4] Robust Contact-Rich Task Learning With Reinforcement Learning and Curriculum-Based Domain Randomization
    Aflakian, Ali
    Hathaway, Jamie
    Stolkin, Rustam
    Rastegarpanah, Alireza
    IEEE ACCESS, 2024, 12 : 103461 - 103472
  • [5] Acquiring musculoskeletal skills with curriculum-based reinforcement learning
    Chiappa, Alberto Silvio
    Tano, Pablo
    Patel, Nisheet
    Ingster, Abigail
    Pouget, Alexandre
    Mathis, Alexander
    NEURON, 2024, 112 (23)
  • [6] Pedestrian Simulation with Reinforcement Learning: A Curriculum-Based Approach
    Vizzari, Giuseppe
    Cecconello, Thomas
    FUTURE INTERNET, 2023, 15 (01):
  • [7] UAV Navigation in 3D Urban Environments with Curriculum-based Deep Reinforcement Learning
    de Carvalho, Kevin Braathen
    de Oliveira, Iure Rosa L.
    Brandao, Alexandre S.
    2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 1249 - 1255
  • [8] Curriculum-Based Asymmetric Multi-Task Reinforcement Learning
    Huang, Hanchi
    Ye, Deheng
    Shen, Li
    Liu, Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7258 - 7269
  • [9] Coordinated Transportation of Dual-arm Robot Based on Deep Reinforcement Learning
    Hao, Zixuan
    Chen, Gang
    Huang, Zeyuan
    Jia, Qingxuan
    Liu, Yu
    Yao, Zhipeng
    2024 IEEE 19TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ICIEA 2024, 2024,
  • [10] Curriculum-based reinforcement learning for path tracking in an underactuated nonholonomic system
    Chivkula, Prashanth
    Rodwell, Colin
    Tallapragada, Phanindra
    IFAC PAPERSONLINE, 2022, 55 (37): : 339 - 344