Distributionally Robust Chance-Constrained Markov Decision Processes with Random Payoff

被引:0
|
作者
Nguyen, Hoang Nam [1 ]
Lisser, Abdel [1 ]
Singh, Vikas Vikram [2 ]
机构
[1] Univ Paris Saclay, Lab Signals & Syst, CNRS, CentraleSupelec, 3 Rue Joliot Curie, F-91190 Gif Sur Yvette, France
[2] Indian Inst Technol Delhi, Dept Math, New Delhi 110016, India
来源
APPLIED MATHEMATICS AND OPTIMIZATION | 2024年 / 90卷 / 01期
关键词
Markov decision process; Distributionally robust chance constraints; Second-order cone programming; Copositive optimization; Mix-integer second-order cone programming; Biconvex optimization; OPTIMIZATION PROBLEMS; RISK; UNCERTAINTY;
D O I
10.1007/s00245-024-10167-w
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
A Markov Decision Process (MDP) is a framework used for decision-making. In an MDP problem, the decision maker's goal is to maximize the expected discounted value of future rewards while navigating through different states controlled by a Markov chain. In this paper, we focus on the case where the transition probabilities vector is deterministic, while the reward vector is uncertain and follow a partially known distribution. We employ a distributionally robust chance constraints approach to model the MDP. This approach entails the construction of potential distributions of reward vector, characterized by moments or statistical metrics. We explore two situations for these ambiguity sets: one where the reward vector has a real support and another where it is constrained to be nonnegative. In the case of a real support, we demonstrate that solving the distributionally robust chance-constrained Markov decision process is mathematically equivalent to a second-order cone programming problem for moments and phi\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi $$\end{document}-divergence ambiguity sets. For Wasserstein distance ambiguity sets, it becomes a mixed-integer second-order cone programming problem. In contrast, when dealing with nonnegative reward vector, the equivalent optimization problems are different. Moments-based ambiguity sets lead to a copositive optimization problem, while Wasserstein distance-based ambiguity sets result in a biconvex optimization problem. To illustrate the practical application of these methods, we examine a machine replacement problem and present results conducted on randomly generated instances to showcase the effectiveness of our proposed methods.
引用
收藏
页数:39
相关论文
共 50 条
  • [1] A dynamical neural network approach for distributionally robust chance-constrained Markov decision process
    Xia, Tian
    Liu, Jia
    Chen, Zhiping
    SCIENCE CHINA-MATHEMATICS, 2024, 67 (06) : 1395 - 1418
  • [2] A dynamical neural network approach for distributionally robust chance-constrained Markov decision process
    Tian Xia
    Jia Liu
    Zhiping Chen
    Science China(Mathematics), 2024, 67 (06) : 1395 - 1418
  • [3] Joint chance-constrained Markov decision processes
    V Varagapriya
    Vikas Vikram Singh
    Abdel Lisser
    Annals of Operations Research, 2023, 322 : 1013 - 1035
  • [4] Joint chance-constrained Markov decision processes
    Varagapriya, V.
    Singh, Vikas Vikram
    Lisser, Abdel
    ANNALS OF OPERATIONS RESEARCH, 2023, 322 (02) : 1013 - 1035
  • [5] On Distributionally Robust Chance-Constrained Linear Programs
    G. C. Calafiore
    L. El Ghaoui
    Journal of Optimization Theory and Applications, 2006, 130 : 1 - 22
  • [6] On distributionally robust chance-constrained linear programs
    Calafiore, G. C.
    El Ghaoui, L.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2006, 130 (01) : 1 - 22
  • [7] The Distributionally Robust Chance-Constrained Vehicle Routing Problem
    Ghosal, Shubhechyya
    Wiesemann, Wolfram
    OPERATIONS RESEARCH, 2020, 68 (03) : 716 - 732
  • [8] Bayesian Optimization for Distributionally Robust Chance-constrained Problem
    Inatsu, Yu
    Takeno, Shion
    Karasuyama, Masayuki
    Takeuchi, Ichiro
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [9] DISTRIBUTIONALLY ROBUST CHANCE-CONSTRAINED MINIMUM VARIANCE BEAMFORMING
    Zhang, Xiao
    Feng, Qiang
    Ge, Ning
    Lu, Jianhua
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2881 - 2885
  • [10] Distributionally Robust Chance-Constrained Generation Expansion Planning
    Pourahmadi, Farzaneh
    Kazempour, Jalal
    Ordoudis, Christos
    Pinson, Pierre
    Hosseini, Seyed Hamid
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (04) : 2888 - 2903