Learning Lyapunov-Stable Polynomial Dynamical Systems Through Imitation

被引:0
作者
Abyaneh, Amin [1 ]
Lin, Hsiu-Chin [2 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
来源
CONFERENCE ON ROBOT LEARNING, VOL 229 | 2023年 / 229卷
基金
加拿大自然科学与工程研究理事会;
关键词
Imitation learning; Safe learning; Motion planning; Dynamical system; Semidefinite programming; Robotic manipulation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imitation learning is a paradigm to address complex motion planning problems by learning a policy to imitate an expert's behavior. However, relying solely on the expert's data might lead to unsafe actions when the robot deviates from the demonstrated trajectories. Stability guarantees have previously been provided utilizing nonlinear dynamical systems, acting as high-level motion planners, in conjunction with the Lyapunov stability theorem. Yet, these methods are prone to inaccurate policies, high computational cost, sample inefficiency, or quasi stability when replicating complex and highly nonlinear trajectories. To mitigate this problem, we present an approach for learning a globally stable nonlinear dynamical system as a motion planning policy. We model the nonlinear dynamical system as a parametric polynomial and learn the polynomial's coefficients jointly with a Lyapunov candidate. To showcase its success, we compare our method against the state of the art in simulation and conduct real-world experiments with the Kinova Gen3 Lite manipulator arm. Our experiments demonstrate the sample efficiency and reproduction accuracy of our method for various expert trajectories, while remaining stable in the face of perturbations.
引用
收藏
页数:21
相关论文
共 50 条
[1]   Understanding the Impact of Technical Debt in Coding and Testing: An Exploratory Case Study [J].
Abad, Zahra Shakeri Hossein ;
Karimpour, Reza ;
Ho, Jason ;
Didar-Al-Alam, S. M. ;
Ruhe, Guenther ;
Tse, Edward ;
Barabash, Kevin ;
Hargreaves, Ian .
2016 IEEE/ACM 3RD INTERNATIONAL WORKSHOP ON SOFTWARE ENGINEERING RESEARCH AND INDUSTRIAL PRACTICE (SER&IP), 2016, :25-31
[2]  
Abbeel P., 2004, Proceedings of 21st International conference on machine learning
[3]   Robust trajectory learning and approximation for robot programming by demonstration [J].
Aleotti, J. ;
Caselli, S. .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (05) :409-413
[4]  
Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ECC.2019.8796030, 10.23919/ecc.2019.8796030]
[5]  
[Anonymous], IEEE DECIS CONTR P
[6]  
[Anonymous], 1989, ALVINN: An Autonomous Land Vehicle in a Neural Network, page
[7]  
ApS M., 2021, MOSEK OPT API PYTH 1
[8]  
Bazaraa Mokhtar S, 2013, Nonlinear Programming: Theory and Algorithms
[9]  
Billard Aude, 2008, Technical report
[10]  
Blekherman G, 2013, MOS-SIAM SER OPTIMIZ, V13, P1