Data-Driven Anytime Algorithms for Motion Planning with Safety Guarantees

被引:0
作者
Jha, Devesh K. [1 ]
Zhu, Minghui [2 ]
Wang, Yebin [3 ]
Ray, Asok [1 ]
机构
[1] Penn State Univ, Mech & Nucl Engn Dept, University Pk, PA 16802 USA
[2] Penn State Univ, Dept Elect Engn, University Pk, PA 16802 USA
[3] Mitsubishi Elect Res Labs, Cambridge, MA 02139 USA
来源
2016 AMERICAN CONTROL CONFERENCE (ACC) | 2016年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a learning-based (i.e., data-driven) approach to motion planning of robotic systems. This is motivated by controller synthesis problems for safety critical systems where an accurate estimate of the uncertainties (e.g., unmodeled dynamics, disturbance) can improve the performance of the system. The state-space of the system is built by sampling from the state-set as well as the input set of the underlying system. The robust adaptive motion planning problem is modeled as a learning-based approach evasion differential game, where a machine-learning algorithm is used to update the statistical estimates of the uncertainties from system observations. The system begins with a conservative estimate of the uncertainty set to ensure safety of the underlying system and we relax the robustness constraints as we get better estimates of the unmodeled uncertainty. The estimates from the machine learning algorithm are used to refine the estimates of the controller in an anytime fashion. We show that the values for the game converges to the optimal values with known disturbance given the statistical estimates on the uncertainty converges. Using confidence intervals for the unmodeled disturbance estimated by the machine learning estimator during the transient learning phase, we are able to guarantee safety of the robotic system with the proposed algorithms during transience.
引用
收藏
页码:5716 / 5721
页数:6
相关论文
共 22 条
  • [1] Akametalu AK, 2014, IEEE DECIS CONTR P, P1424, DOI 10.1109/CDC.2014.7039601
  • [2] [Anonymous], 2013, ALGORITHMIC FDN ROBO
  • [3] [Anonymous], 2009, SET VALUED ANAL
  • [4] Provably safe and robust learning-based model predictive control
    Aswani, Anil
    Gonzalez, Humberto
    Sastry, S. Shankar
    Tomlin, Claire
    [J]. AUTOMATICA, 2013, 49 (05) : 1216 - 1226
  • [5] Bertsekas DP, 1995, PROCEEDINGS OF THE 34TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, P560, DOI 10.1109/CDC.1995.478953
  • [6] A Probabilistic Particle-Control Approximation of Chance-Constrained Stochastic Predictive Control
    Blackmore, Lars
    Ono, Masahiro
    Bektassov, Askar
    Williams, Brian C.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2010, 26 (03) : 502 - 517
  • [7] Bry Adam, 2011, IEEE International Conference on Robotics and Automation, P723
  • [8] Approximate Confidence and Prediction Intervals for Least Squares Support Vector Regression
    De Brabanter, Kris
    De Brabanter, Jos
    Suykens, Johan A. K.
    De Moor, Bart
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (01): : 110 - 120
  • [9] De Brabanter Kris., 2011, Least squares support vector regression with applications to large-scale data: a statistical approach
  • [10] Gillula J. H., 2012, ROBOTICS SCI SYSTEMS