ERGODIC THEORY FOR CONTROLLED MARKOV CHAINS WITH STATIONARY INPUTS

被引：0

作者：

Chen, Yue ^{[1
]}

Busic, Ana ^{[2
,3
]}

Meyn, Sean ^{[1
]}

机构：

[1] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA

[2] INRIA, Paris, France

[3] Ecole Normale Super, Comp Sci Dept, Paris, France

来源：

ANNALS OF APPLIED PROBABILITY | 2018年 / 28卷 / 01期

基金：

美国国家科学基金会;

关键词：

Controlled Markov chain; ergodic theory; information theory; MEAN-FIELD CONTROL; POPULATION; GAMES;

D O I：

10.1214/17-AAP1300

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Consider a stochastic process X on a finite state space X = {1,...,}d. It is conditionally Markov, given a real-valued "input process" sigma.0 This is assumed to be small, which is modeled through the scaling, sigma(t) = epsilon sigma(1)(t), 0 <= epsilon <= 1, where sigma(1) is a bounded stationary process. The following conclusions are obtained, subject to smoothness assumptions on the controlled transition matrix and a mixing condition on sigma : (i) A stationary version of the process is constructed, that is coupled with a stationary version of the Markov chain X obtained with sigma equivalent to 0. The triple (X, X-center dot, sigma) is a jointly stationary process satisfying P{X(t) not equal X-center dot(t)} = 0(epsilon). Moreover, a second-order Taylor-series approximation is obtained: P{X(t) = P{X-center dot(t)= i} + epsilon(2)pi((2))(i) + 0(epsilon(2)), 1 <= i <= d, with an explicit formula for the vector n ((2)) is an element of R-d. (ii) For any m >= 1 and any function f : {1,...,d} x R -> R-m, the stationary stochastic process Y (t) = f (X (t), sigma(t)) has a power spectral density S-f that admits a second-order Taylor series expansion: A function) S-f((2)) : [-pi, pi] -> C-mxm is constructed such that S-f (theta) = S-f(center dot) (theta) epsilon(2)sf(2)(theta) + o(epsilon(2)), theta is an element of[-pi,pi] in which the first term is the power spectral density obtained with epsilon = 0. An explicit formula for the function S-f((2)) is obtained, based in part on the bounds in (i). The results are illustrated with two general examples: mean field games, and a version of the timing channel of Anantharam and Verdu.

引用

页码：79 / 111

页数：33

共 21 条

[1] Oblivious Equilibrium: An Approximation to Large Population Dynamic Games with Concave Utility
Adlakha, Sachin
Johari, Ramesh
Weintraub, Gabriel
Goldsmith, Andrea
[J]. 2009 INTERNATIONAL CONFERENCE ON GAME THEORY FOR NETWORKS (GAMENETS 2009), 2009, : 68 - +
[2] Bits through queues
Anantharam, V
Verdu, S
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1996, 42 (01) : 4 - 18
[3] CAINES P. E., 1988, LINEAR STOCHASTIC SY
[4] Chen Y., 2016, THESIS
[5] Chen Y., 2017, IEEE T SMART GRID
[6] State Estimation for the Individual and the Population in Mean Field Control With Application to Demand Dispatch
Chen, Yue
Busic, Ana
Meyn, Sean P.
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (03) : 1138 - 1149
[7] Chen Y, 2014, IEEE DECIS CONTR P, P6425, DOI 10.1109/CDC.2014.7040397
[8] Cover T., 1991, Wiley Series in Telecommunications, DOI [10.1002/0471200611, DOI 10.1002/0471200611]
[9] DEMBO A., 1998, Large Deviations Techniques and Applications, V38, DOI [10.1007/978-1-4612-5320-4, DOI 10.1007/978-1-4612-5320-4]
[10] An overview of some stochastic stability methods
Foss, S
Konstantopoulos, T
[J]. JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN, 2004, 47 (04) : 275 - 303

← 1 2 3 →