Adaptive Sliding Mode Disturbance Observer and Deep Reinforcement Learning Based Motion Control for Micropositioners

被引:7
作者
Liang, Shiyun [1 ,2 ]
Xi, Ruidong [1 ,2 ]
Xiao, Xiao [3 ]
Yang, Zhixin [1 ,2 ]
机构
[1] Univ Macau, State Key Lab Internet Things Smart City, Macau 999078, Peoples R China
[2] Univ Macau, Dept Electromech Engn, Macau 999078, Peoples R China
[3] Southern Univ Sci & Technol, Dept Elect & Elect Engn, Shenzhen 518055, Peoples R China
关键词
micropositioners; reinforcement learning; disturbance observer; deep deterministic policy gradient; SYSTEMS;
D O I
10.3390/mi13030458
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The motion control of high-precision electromechanitcal systems, such as micropositioners, is challenging in terms of the inherent high nonlinearity, the sensitivity to external interference, and the complexity of accurate identification of the model parameters. To cope with these problems, this work investigates a disturbance observer-based deep reinforcement learning control strategy to realize high robustness and precise tracking performance. Reinforcement learning has shown great potential as optimal control scheme, however, its application in micropositioning systems is still rare. Therefore, embedded with the integral differential compensator (ID), deep deterministic policy gradient (DDPG) is utilized in this work with the ability to not only decrease the state error but also improve the transient response speed. In addition, an adaptive sliding mode disturbance observer (ASMDO) is proposed to further eliminate the collective effect caused by the lumped disturbances. The micropositioner controlled by the proposed algorithm can track the target path precisely with less than 1 mu m error in simulations and actual experiments, which shows the sterling performance and the accuracy improvement of the controller.
引用
收藏
页数:21
相关论文
共 47 条
[1]  
Adda U, 2005, IEEE INT CONF ROBOT, P4569
[2]   Adaptive High-Order Terminal Sliding Mode Control Based on Time Delay Estimation for the Robotic Manipulators With Backlash Hysteresis [J].
Ahmed, Saim ;
Wang, Haoping ;
Tian, Yang .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (02) :1128-1137
[3]   Policy derivation methods for critic -only reinforcement learning in continuous spaces [J].
Alibekov, Eduard ;
Kubalik, Jiri ;
Babuska, Robert .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 69 :178-187
[4]   Force-Position Photo-Robotic Approach for the High-Accurate Micro-Assembly of Photonic Devices [J].
Bettahar, Houari ;
Clevy, Cedric ;
Courjal, Nadege ;
Lutz, Philippe .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) :6396-6402
[5]   Positioning Accuracy in Holographic Optical Traps [J].
Catala-Castro, Frederic ;
Martin-Badosa, Estela .
MICROMACHINES, 2021, 12 (05)
[6]   Tracking Flight Control of Quadrotor Based on Disturbance Observer [J].
Chen, Mou ;
Xiong, Shixun ;
Wu, Qingxian .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (03) :1414-1423
[7]   Nanoimprint lithography: Emergent materials and methods of actuation [J].
Cox, Lewis M. ;
Martinez, Alina M. ;
Blevins, Adrienne K. ;
Sowan, Nancy ;
Ding, Yifu ;
Bowman, Christopher N. .
NANO TODAY, 2020, 31
[8]   A New Methodology for a Retrofitted Self-tuned Controller with Open-Source FPGA [J].
Cruz-Miguel, Edson E. ;
Garcia-Martinez, Jose R. ;
Rodriguez-Resendiz, Juvenal ;
Carrillo-Serrano, Roberto, V .
SENSORS, 2020, 20 (21) :1-21
[9]   Robotic Manipulation of Deformable Cells for Orientation Control [J].
Dai, Changsheng ;
Zhang, Zhuoran ;
Lu, Yuchen ;
Shan, Guanqiao ;
Wang, Xian ;
Zhao, Qili ;
Ru, Changhai ;
Sun, Yu .
IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (01) :271-283
[10]   Optimization of a Spin-Orbit Torque Switching Scheme Based on Micromagnetic Simulations and Reinforcement Learning [J].
de Orio, Roberto L. ;
Ender, Johannes ;
Fiorentini, Simone ;
Goes, Wolfgang ;
Selberherr, Siegfried ;
Sverdlov, Viktor .
MICROMACHINES, 2021, 12 (04)