Active Learning of Dynamics for Data-Driven Control Using Koopman Operators

被引:132
作者
Abraham, Ian [1 ,2 ]
Murphey, Todd D. [1 ,2 ]
机构
[1] Northwestern Univ, Neurosci & Robot Lab NxR, Evanston, IL 60208 USA
[2] Northwestern Univ, Dept Mech Engn, Evanston, IL 60208 USA
基金
美国国家科学基金会;
关键词
Active learning; information theoretic control; Koopman operators; single execution learning; GRADIENT DESCENT; LEAST-SQUARES; PARAMETER-IDENTIFICATION; ADAPTIVE-CONTROL; SYSTEMS; EXPLORATION; ROBOTICS;
D O I
10.1109/TRO.2019.2923880
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
This paper presents an active learning strategy for robotic systems that takes into account task information, enables fast learning, and allows control to be readily synthesized by taking advantage of the Koopman operator representation. We first motivate the use of representing nonlinear systems as linear Koopman operator systems by illustrating the improved model-based control performance with an actuated Van der Pol system. Information-theoretic methods are then applied to the Koopman operator formulation of dynamical systems where we derive a controller for active learning of robot dynamics. The active learning controller is shown to increase the rate of information about the Koopman operator. In addition, our active learning controller can readily incorporate policies built on the Koopman dynamics, enabling the benefits of fast active learning and improved control. Results using a quadcopter illustrate single-execution active learning and stabilization capabilities during free fall. The results for active learning are extended for automating Koopman observables and we implement our method on real robotic systems.
引用
收藏
页码:1071 / 1083
页数:13
相关论文
共 64 条
[1]  
[Anonymous], 2017, ARXIV170802596
[2]  
[Anonymous], 2017, ARXIV170806850
[3]  
[Anonymous], 2017, P ROBOTICS SCI SYSTE, DOI DOI 10.15607/RSS.2017.XIII.048
[4]  
[Anonymous], 2016, PROC INT C MACH LEAR
[5]  
[Anonymous], 2008 IEEE INT C ROB
[6]  
[Anonymous], 2016, P 29 C NEUR INF PROC
[7]  
[Anonymous], 2017, ROBOTICS SCI SYSTEMS, DOI [DOI 10.15607/RSS.2017.XIII.052, 10.15607/RSS.2017.XIII.052]
[8]  
[Anonymous], 2012, ELEMENTS INFORM THEO
[9]  
[Anonymous], 2016, ARXIV161103537
[10]  
[Anonymous], 2007, OPTIMAL CONTROL LINE