Integro-differential optimality equations for the risk-sensitive control of piecewise deterministic Markov processes

被引:0
作者
O. L. V. Costa
F. Dufour
机构
[1] Escola Politécnica da Universidade de São Paulo,Departamento de Engenharia de Telecomunicações e Controle
[2] Université de Bordeaux,Institut Polytechnique de Bordeaux, INRIA Bordeaux Sud Ouest, Team: CQFD, IMB, Institut de Mathématiques de Bordeaux
来源
Mathematical Methods of Operations Research | 2021年 / 93卷
关键词
Risk-sensitive control problem; Continuous control; Piecewise deterministic Markov process; Continuous-time Markov decision process; Primary 90C40; Secondary 60J25;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper we study the minimization problem of the infinite-horizon expected exponential utility total cost for continuous-time piecewise deterministic Markov processes with the control acting continuously on the jump intensity λ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda $$\end{document} and on the transition measure Q of the process. The action space is supposed to depend on the state variable and the state space is considered to have a frontier such that the process jumps whenever it touches this boundary. We characterize the optimal value function as the minimal solution of an integro-differential optimality equation satisfying some boundary conditions, as well as the existence of a deterministic stationary optimal policy. These results are obtained by using the so-called policy iteration algorithm, under some continuity and compactness assumptions on the parameters of the problem, as well as some non-explosive conditions for the process.
引用
收藏
页码:327 / 357
页数:30
相关论文
共 22 条
[21]  
Pradhan S(undefined)undefined undefined undefined undefined-undefined
[22]  
Zhang Y(undefined)undefined undefined undefined undefined-undefined