Integro-differential optimality equations for the risk-sensitive control of piecewise deterministic Markov processes

被引：0

作者：

O. L. V. Costa

F. Dufour

机构：

[1] Escola Politécnica da Universidade de São Paulo,Departamento de Engenharia de Telecomunicações e Controle

[2] Université de Bordeaux,Institut Polytechnique de Bordeaux, INRIA Bordeaux Sud Ouest, Team: CQFD, IMB, Institut de Mathématiques de Bordeaux

来源：

Mathematical Methods of Operations Research | 2021年 / 93卷

关键词：

Risk-sensitive control problem; Continuous control; Piecewise deterministic Markov process; Continuous-time Markov decision process; Primary 90C40; Secondary 60J25;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper we study the minimization problem of the infinite-horizon expected exponential utility total cost for continuous-time piecewise deterministic Markov processes with the control acting continuously on the jump intensity λ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda $$\end{document} and on the transition measure Q of the process. The action space is supposed to depend on the state variable and the state space is considered to have a frontier such that the process jumps whenever it touches this boundary. We characterize the optimal value function as the minimal solution of an integro-differential optimality equation satisfying some boundary conditions, as well as the existence of a deterministic stationary optimal policy. These results are obtained by using the so-called policy iteration algorithm, under some continuity and compactness assumptions on the parameters of the problem, as well as some non-explosive conditions for the process.

引用

页码：327 / 357

页数：30