Multiagent-Reinforcement-Learning-Based Stable Path Tracking Control for a Bionic Robotic Fish With Reaction Wheel

被引:16
作者
Qiu, Changlin [1 ,2 ]
Wu, Zhengxing [1 ,2 ]
Wang, Jian [1 ,2 ]
Tan, Min [1 ,2 ]
Yu, Junzhi [1 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Peking Univ, Coll Engn, Dept Adv Mfg & Robot, State Key Lab Turbulence & Complex Syst, Beijing 100871, Peoples R China
[4] Guangdong Ocean Univ, Coll Elect & Informat Engn, Zhanjiang 524088, Peoples R China
基金
中国国家自然科学基金;
关键词
Multiagent reinforcement learning (MARL); path tracking control; reaction wheel; robotic fish; underwater robot; ATTITUDE;
D O I
10.1109/TIE.2023.3239937
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The path tracking of the robotic fish is a hotspot with its high maneuverability and environmental friendliness. However, the periodic oscillation generated by bionic fish-like propulsion mode may lead to unstable control. To this end, this article proposes a novel framework involving a newly designed platform and multiagent reinforcement learning (MARL) method. First, a bionic robotic fish equipped with a reaction wheel is developed to enhance the stability. Second, an MARL-based control framework is proposed for the cooperative control of tail-beating and reaction wheel. Correspondingly, a hierarchical training method including initial training and iterative training is designed to deal with the control coupling and frequency difference between two agents. Finally, extensive simulations and experiments indicate that the developed robotic fish and the proposed MARL-based control framework can effectively improve the accuracy and stability of path tracking. Remarkably, headshaking is reduced about 40%. It provides a promising reference for the stability optimization and cooperative control of bionic swimming robots featuring oscillatory motions.
引用
收藏
页码:12670 / 12679
页数:10
相关论文
共 34 条
[1]  
Castaño ML, 2019, IEEE ASME INT C ADV, P839, DOI [10.1109/AIM.2019.8868586, 10.1109/aim.2019.8868586]
[2]   Model Predictive Control-Based Path-Following for Tail-Actuated Robotic Fish [J].
Castano, Maria L. ;
Tan, Xiaobo .
JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2019, 141 (07)
[3]   Exploration of swimming performance for a biomimetic multi-joint robotic fish with a compliant passive joint [J].
Chen, Di ;
Wu, Zhengxing ;
Dong, Huijie ;
Tan, Min ;
Yu, Junzhi .
BIOINSPIRATION & BIOMIMETICS, 2021, 16 (02)
[4]   Robust control of reaction wheel bicycle robot via adaptive integral terminal sliding mode [J].
Chen, Long ;
Liu, Jun ;
Wang, Hai ;
Hu, Youhao ;
Zheng, Xuefeng ;
Ye, Mao ;
Zhang, Jie .
NONLINEAR DYNAMICS, 2021, 104 (03) :2291-2302
[5]  
Clapham RJ, 2014, IEEE INT C INT ROBOT, P287, DOI 10.1109/IROS.2014.6942574
[6]   Barrier-Based Adaptive Line-of-Sight 3-D Path-Following System for a Multijoint Robotic Fish With Sideslip Compensation [J].
Dai, Shijie ;
Wu, Zhengxing ;
Wang, Jian ;
Tan, Min ;
Yu, Junzhi .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (07) :4204-4217
[7]   Experimental Study on the Improvement of Yaw Stability by Coordination Control between the Caudal Fin and Anal Fin [J].
Ding, Jiang ;
Zheng, Changzhen ;
Song, Chaocheng ;
Zuo, Qiyang ;
Xu, Yaohui ;
Dong, Bingbing ;
Cui, Jiaxu ;
He, Kai ;
Xie, Fengran .
JOURNAL OF BIONIC ENGINEERING, 2022, 19 (05) :1261-1271
[8]   Exploration of underwater life with an acoustically controlled soft robotic fish [J].
Katzschmann, Robert K. ;
DelPreto, Joseph ;
MacCurdy, Robert ;
Rus, Daniela .
SCIENCE ROBOTICS, 2018, 3 (16)
[9]   Reinforcement learning in robotics: A survey [J].
Kober, Jens ;
Bagnell, J. Andrew ;
Peters, Jan .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1238-1274
[10]   Dynamic Torso Compliance Control for Standing and Walking Balance of Position-Controlled Humanoid Robots [J].
Li, Qingqing ;
Meng, Fei ;
Yu, Zhangguo ;
Chen, Xuechao ;
Huang, Qiang .
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2021, 26 (02) :679-688