Distributionally Robust Chance-Constrained Markov Decision Processes with Random Payoff

被引：0

作者：

Nguyen, Hoang Nam ^{[1
]}

Lisser, Abdel ^{[1
]}

Singh, Vikas Vikram ^{[2
]}

机构：

[1] Univ Paris Saclay, Lab Signals & Syst, CNRS, CentraleSupelec, 3 Rue Joliot Curie, F-91190 Gif Sur Yvette, France

[2] Indian Inst Technol Delhi, Dept Math, New Delhi 110016, India

来源：

APPLIED MATHEMATICS AND OPTIMIZATION | 2024年 / 90卷 / 01期

关键词：

Markov decision process; Distributionally robust chance constraints; Second-order cone programming; Copositive optimization; Mix-integer second-order cone programming; Biconvex optimization; OPTIMIZATION PROBLEMS; RISK; UNCERTAINTY;

D O I：

10.1007/s00245-024-10167-w

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

A Markov Decision Process (MDP) is a framework used for decision-making. In an MDP problem, the decision maker's goal is to maximize the expected discounted value of future rewards while navigating through different states controlled by a Markov chain. In this paper, we focus on the case where the transition probabilities vector is deterministic, while the reward vector is uncertain and follow a partially known distribution. We employ a distributionally robust chance constraints approach to model the MDP. This approach entails the construction of potential distributions of reward vector, characterized by moments or statistical metrics. We explore two situations for these ambiguity sets: one where the reward vector has a real support and another where it is constrained to be nonnegative. In the case of a real support, we demonstrate that solving the distributionally robust chance-constrained Markov decision process is mathematically equivalent to a second-order cone programming problem for moments and phi\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi $$\end{document}-divergence ambiguity sets. For Wasserstein distance ambiguity sets, it becomes a mixed-integer second-order cone programming problem. In contrast, when dealing with nonnegative reward vector, the equivalent optimization problems are different. Moments-based ambiguity sets lead to a copositive optimization problem, while Wasserstein distance-based ambiguity sets result in a biconvex optimization problem. To illustrate the practical application of these methods, we examine a machine replacement problem and present results conducted on randomly generated instances to showcase the effectiveness of our proposed methods.

引用

页数：39

共 50 条

[1] A dynamical neural network approach for distributionally robust chance-constrained Markov decision process
Xia, Tian
Liu, Jia
Chen, Zhiping
SCIENCE CHINA-MATHEMATICS, 2024, 67 (06) : 1395 - 1418
[2] A dynamical neural network approach for distributionally robust chance-constrained Markov decision process
Tian Xia
Jia Liu
Zhiping Chen
Science China(Mathematics), 2024, 67 (06) : 1395 - 1418
[3] Joint chance-constrained Markov decision processes
V Varagapriya
Vikas Vikram Singh
Abdel Lisser
Annals of Operations Research, 2023, 322 : 1013 - 1035
[4] Joint chance-constrained Markov decision processes
Varagapriya, V.
Singh, Vikas Vikram
Lisser, Abdel
ANNALS OF OPERATIONS RESEARCH, 2023, 322 (02) : 1013 - 1035
[5] On Distributionally Robust Chance-Constrained Linear Programs
G. C. Calafiore
L. El Ghaoui
Journal of Optimization Theory and Applications, 2006, 130 : 1 - 22
[6] On distributionally robust chance-constrained linear programs
Calafiore, G. C.
El Ghaoui, L.
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2006, 130 (01) : 1 - 22
[7] The Distributionally Robust Chance-Constrained Vehicle Routing Problem
Ghosal, Shubhechyya
Wiesemann, Wolfram
OPERATIONS RESEARCH, 2020, 68 (03) : 716 - 732
[8] Bayesian Optimization for Distributionally Robust Chance-constrained Problem
Inatsu, Yu
Takeno, Shion
Karasuyama, Masayuki
Takeuchi, Ichiro
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[9] DISTRIBUTIONALLY ROBUST CHANCE-CONSTRAINED MINIMUM VARIANCE BEAMFORMING
Zhang, Xiao
Feng, Qiang
Ge, Ning
Lu, Jianhua
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2881 - 2885
[10] Distributionally Robust Chance-Constrained Generation Expansion Planning
Pourahmadi, Farzaneh
Kazempour, Jalal
Ordoudis, Christos
Pinson, Pierre
Hosseini, Seyed Hamid
IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (04) : 2888 - 2903

← 1 2 3 4 5 →