Safe Reinforcement Learning Using Wasserstein Distributionally Robust MPC and Chance Constraint

被引:5
|
作者
Kordabad, Arash Bahari [1 ]
Wisniewski, Rafael [2 ]
Gros, Sebastien [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, N-7034 Trondheim, Norway
[2] Aalborg Univ, Dept Elect Syst, DK-9220 Aalborg, Denmark
关键词
Safe reinforcement learning; model predictive control; distributionally robust optimization; chance constraint; conditional value at risk; Q-learning; OPTIMIZATION;
D O I
10.1109/ACCESS.2022.3228922
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we address the chance-constrained safe Reinforcement Learning (RL) problem using the function approximators based on Stochastic Model Predictive Control (SMPC) and Distributionally Robust Model Predictive Control (DRMPC). We use Conditional Value at Risk (CVaR) to measure the probability of constraint violation and safety. In order to provide a safe policy by construction, we first propose using parameterized nonlinear DRMPC at each time step. DRMPC optimizes a finite-horizon cost function subject to the worst-case constraint violation in an ambiguity set. We use a statistical ball around the empirical distribution with a radius measured by the Wasserstein metric as the ambiguity set. Unlike the sample average approximation SMPC, DRMPC provides a probabilistic guarantee of the out-of-sample risk and requires lower samples from the disturbance. Then the Q-learning method is used to optimize the parameters in the DRMPC to achieve the best closed-loop performance. Wheeled Mobile Robot (WMR) path planning with obstacle avoidance will be considered to illustrate the efficiency of the proposed method.
引用
收藏
页码:130058 / 130067
页数:10
相关论文
共 50 条
  • [1] Safe Reinforcement Learning Using Robust MPC
    Zanon, Mario
    Gros, Sebastien
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (08) : 3638 - 3652
  • [2] Stochastic MPC with Distributionally Robust Chance Constraints
    Mark, Christoph
    Liu, Steven
    IFAC PAPERSONLINE, 2020, 53 (02): : 7136 - 7141
  • [3] On distributionally robust chance constrained programs with Wasserstein distance
    Weijun Xie
    Mathematical Programming, 2021, 186 : 115 - 155
  • [4] On distributionally robust chance constrained programs with Wasserstein distance
    Xie, Weijun
    MATHEMATICAL PROGRAMMING, 2021, 186 (1-2) : 115 - 155
  • [5] Distributionally robust chance constrained games under Wasserstein ball
    Xia, Tian
    Liu, Jia
    Lisser, Abdel
    OPERATIONS RESEARCH LETTERS, 2023, 51 (03) : 315 - 321
  • [6] Distributionally robust joint chance-constrained programming with Wasserstein metric
    Gu, Yining
    Wang, Yanjun
    OPTIMIZATION METHODS & SOFTWARE, 2024, 40 (01): : 134 - 168
  • [7] Distributionally robust joint chance-constrained programming with Wasserstein metric
    不详
    OPTIMIZATION METHODS & SOFTWARE, 2024, 40 (01): : 134 - 168
  • [8] Wasserstein distributionally robust chance-constrained program with moment information
    Luo, Zunhao
    Yin, Yunqiang
    Wang, Dujuan
    Cheng, T. C. E.
    Wu, Chin -Chia
    COMPUTERS & OPERATIONS RESEARCH, 2023, 152
  • [9] Distributionally Robust Chance-Constraint Optimal Power Flow Considering Uncertain Renewables with Wasserstein-Moment Metric
    Liu, Jun
    Chen, Yefu
    Duan, Chao
    Lyu, Jia
    INNOVATIVE SOLUTIONS FOR ENERGY TRANSITIONS, 2019, 158 : 192 - 197
  • [10] Adjusted Wasserstein Distributionally Robust Estimator in Statistical Learning
    Xie, Yiling
    Huo, Xiaoming
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 40