Simultaneous Impulse and Continuous Control of a Markov Chain in Continuous Time

被引:3
作者
Miller, A. B. [1 ,2 ]
Miller, B. M. [1 ,2 ,3 ]
Stepanyan, K. V. [1 ]
机构
[1] Russian Acad Sci, Kharkevich Inst Informat Transmiss Problems, Moscow, Russia
[2] Kazan Fed Univ, Kazan, Russia
[3] Monash Univ, Melbourne, Vic, Australia
关键词
Markov chain; impulse controls; quasi-variational inequality; STOCHASTIC-CONTROL; DECISION-PROCESSES; DAM; MANAGEMENT; FLOW;
D O I
10.1134/S0005117920030066
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider continuous and impulse control of a Markov chain (MC) with a finite set of states in continuous time. Continuous control determines the intensity of transitions between MC states, while transition times and their directions are random. Nevertheless, sometimes it is necessary to ensure a transition that leads to an instantaneous change in the state of the MC. Since such transitions require different influences and can produce different effects on the state of the MC, such controls can be interpreted as impulse controls. In this work, we use the martingale representation of a controllable MC and give an optimality condition, which, using the principle of dynamic programming, is reduced to a form of quasi-variational inequality. The solution to this inequality can be obtained in the form of a dynamic programming equation, which for an MC with a finite set of states reduces to a system of ordinary differential equations with one switching line. We prove a sufficient optimality condition and give examples of problems with deterministic and random impulse action.
引用
收藏
页码:469 / 482
页数:14
相关论文
共 36 条
[1]   Infinite horizon optimal impulsive control with applications to Internet congestion control [J].
Avrachenkov, Konstantin ;
Habachi, Oussama ;
Piunovskiy, Alexey ;
Zhang, Yi .
INTERNATIONAL JOURNAL OF CONTROL, 2015, 88 (04) :703-716
[2]   Average cost under the PMλ,τ policy in a finite dam with compound Poisson inputs [J].
Bae, J ;
Kim, S ;
Lee, EY .
JOURNAL OF APPLIED PROBABILITY, 2003, 40 (02) :519-526
[3]  
Bensoussan A, 1987, IMPULSNOE UPRAVLENIE
[4]  
Bensoussan A, 1982, CONTROLE IMPULSIONNE
[5]  
Bremaud P, 1981, Point Processes and Queues: Martingale Dynamics
[6]   OPTIMAL-CONTROL OF MARKOV-CHAINS ADMITTING STRONG AND WEAK-INTERACTIONS [J].
DELEBECQUE, F ;
QUADRAT, JP .
AUTOMATICA, 1981, 17 (02) :281-296
[7]   IMPULSE CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES [J].
Dempster, M. A. H. ;
Ye, J. J. .
ANNALS OF APPLIED PROBABILITY, 1995, 5 (02) :399-423
[8]   Optimal impulsive control of piecewise deterministic Markov processes [J].
Dufour, F. ;
Horiguchi, M. ;
Piunovskiy, A. B. .
STOCHASTICS-AN INTERNATIONAL JOURNAL OF PROBABILITY AND STOCHASTIC REPORTS, 2016, 88 (07) :1073-1098
[9]   Impulsive Control for Continuous-Time Markov Decision Processes: A Linear Programming Approach [J].
Dufour, F. ;
Piunovskiy, A. B. .
APPLIED MATHEMATICS AND OPTIMIZATION, 2016, 74 (01) :129-161
[10]   Generalized solutions in nonlinear stochastic control problems [J].
Dufour, F ;
Miller, BM .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2002, 40 (06) :1724-1745