Motivated developmental learning for flexible behavioral decision-making of mobile robots in unknown environment

被引:1
作者
Wang D.-S. [1 ]
Gao X.-L. [1 ]
机构
[1] School of Electrical Engineering, Zhengzhou University, Henan, Zhengzhou
来源
Kongzhi Lilun Yu Yingyong/Control Theory and Applications | 2023年 / 40卷 / 04期
基金
中国国家自然科学基金;
关键词
behavioral decision-making; curiosity; exploration-exploitation trade-offs; locus coeruleus; mobile robot;
D O I
10.7641/CTA.2022.10963
中图分类号
学科分类号
摘要
The flexible behavioral decision-making of mobile robots in unknown environment is the premise of completing various tasks. The current robot behavior decision methods are not flexible in facing the dynamic environments, and it is difficult for the robots to obtain continuous and stable learning ability. In our previous work, the supervised learning of cerebellum and reinforcement learning of basal ganglia is integrated to achieve flexible behavior decision of mobile robot in dynamic environment. However, the proposed algorithm has limited adaptability to the dynamic environment. Based on the previous work, a more biologically meaningful curiosity index is designed to replace the original vigilance index, by simulating the dynamic switch of locus coeruleus activity between tonic mode and phasic mode, the dynamic adaptive adjustment of environment exploration and exploitation of the mobile robot is realized. At the same time, an adaptive adjustment factor changing with the external environment is designed to realize the flexible behavioral decision of mobile robot in dynamic environment based on the supervised learning of the cerebellum, and reinforcement learning of basal ganglia, make the robot obtain stable learning ability. Experimental results in dynamic environment and real environment verify the effectiveness of the proposed algorithm. © 2023 South China University of Technology. All rights reserved.
引用
收藏
页码:641 / 652
页数:11
相关论文
共 35 条
[1]  
HU J Y, PARIJAT B, ALEXANDER L., Group coordinated control of networked mobile robots with applications to object transportation, IEEE Transactions on Vehicular Technology, 70, 8, pp. 8269-8274, (2021)
[2]  
RAHMAN M M, ISHII K, NOGUCHI N., Optimum harvesting area of convex and concave polygon field for path planning of robot combine harvester, Intelligent Service Robotics, 12, 2, pp. 167-179, (2019)
[3]  
JI M, NILANJAN S., Supervisory fault adaptive control of a mobile robot and its application in sensor-fault accommodation, IEEE Transactions on Robotics, 23, 1, pp. 174-178, (2020)
[4]  
DING L, HUANG L, LI S, Et al., Definition and application of variable resistance coefficient for wheeled mobile robots on deformable terrain, IEEE Transactions on Robotics, 36, 3, pp. 894-909, (2020)
[5]  
YAO Xuliang, WANG Feng, WANG Jingfang, Et al., Time-optimal path planning for autonomous underwater vehicles with uncertain ocean currents, Control Theory & Applications, 37, 6, pp. 1302-1310, (2020)
[6]  
ZHU An, CHEN Li, Buffer and compliance control of dual-arm space robot capture satellite operation with spring-damper device, Control Theory & Applications, 39, 1, pp. 117-129, (2022)
[7]  
SUN Z, XIE H, ZHENG J C, Et al., Path-following control of mecanum-wheels omnidirectional mobile robots using nonsingular terminal sliding mode, Mechanical Systems and Signal Processing, 147, (2021)
[8]  
KHALAJI A K, JALALNEZHAD M., Robust forward backward control of wheeled mobile robots, ISA Transactions, 115, pp. 32-45, (2021)
[9]  
MARTINEZ E A, RIOS H, MERA M., Robust tracking control design for unicycle mobile robots with input saturation, Control Engineering Practice, 107, (2021)
[10]  
LI Gaoming, NIE Zhuoyun, LI Zhaoyang, Et al., Disturbance rejection PID control of wheeled mobile robot under non-equlibrium load, Control Theory & Applications, 38, 3, pp. 398-406, (2021)