Data-Driven Anytime Algorithms for Motion Planning with Safety Guarantees

被引：0

作者：

Jha, Devesh K. ^{[1
]}

Zhu, Minghui ^{[2
]}

Wang, Yebin ^{[3
]}

Ray, Asok ^{[1
]}

机构：

[1] Penn State Univ, Mech & Nucl Engn Dept, University Pk, PA 16802 USA

[2] Penn State Univ, Dept Elect Engn, University Pk, PA 16802 USA

[3] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA

来源：

2016 AMERICAN CONTROL CONFERENCE (ACC) | 2016年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a learning-based (i.e., data-driven) approach to motion planning of robotic systems. This is motivated by controller synthesis problems for safety critical systems where an accurate estimate of the uncertainties (e.g., unmodeled dynamics, disturbance) can improve the performance of the system. The state-space of the system is built by sampling from the state-set as well as the input set of the underlying system. The robust adaptive motion planning problem is modeled as a learning-based approach evasion differential game, where a machine-learning algorithm is used to update the statistical estimates of the uncertainties from system observations. The system begins with a conservative estimate of the uncertainty set to ensure safety of the underlying system and we relax the robustness constraints as we get better estimates of the unmodeled uncertainty. The estimates from the machine learning algorithm are used to refine the estimates of the controller in an anytime fashion. We show that the values for the game converges to the optimal values with known disturbance given the statistical estimates on the uncertainty converges. Using confidence intervals for the unmodeled disturbance estimated by the machine learning estimator during the transient learning phase, we are able to guarantee safety of the robotic system with the proposed algorithms during transience.

引用

页码：5716 / 5721

页数：6

共 22 条

[1] Akametalu AK, 2014, IEEE DECIS CONTR P, P1424, DOI 10.1109/CDC.2014.7039601
[2] [Anonymous], 2013, ALGORITHMIC FDN ROBO
[3] [Anonymous], 2009, SET VALUED ANAL
[4] Provably safe and robust learning-based model predictive control
Aswani, Anil
Gonzalez, Humberto
Sastry, S. Shankar
Tomlin, Claire
[J]. AUTOMATICA, 2013, 49 (05) : 1216 - 1226
[5] Bertsekas DP, 1995, PROCEEDINGS OF THE 34TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, P560, DOI 10.1109/CDC.1995.478953
[6] A Probabilistic Particle-Control Approximation of Chance-Constrained Stochastic Predictive Control
Blackmore, Lars
Ono, Masahiro
Bektassov, Askar
Williams, Brian C.
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2010, 26 (03) : 502 - 517
[7] Bry Adam, 2011, IEEE International Conference on Robotics and Automation, P723
[8] Approximate Confidence and Prediction Intervals for Least Squares Support Vector Regression
De Brabanter, Kris
De Brabanter, Jos
Suykens, Johan A. K.
De Moor, Bart
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (01): : 110 - 120
[9] De Brabanter Kris., 2011, Least squares support vector regression with applications to large-scale data: a statistical approach
[10] Gillula J. H., 2012, ROBOTICS SCI SYSTEMS

← 1 2 3 →