ERGODIC THEORY FOR CONTROLLED MARKOV CHAINS WITH STATIONARY INPUTS

被引:0
作者
Chen, Yue [1 ]
Busic, Ana [2 ,3 ]
Meyn, Sean [1 ]
机构
[1] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
[2] INRIA, Paris, France
[3] Ecole Normale Super, Comp Sci Dept, Paris, France
基金
美国国家科学基金会;
关键词
Controlled Markov chain; ergodic theory; information theory; MEAN-FIELD CONTROL; POPULATION; GAMES;
D O I
10.1214/17-AAP1300
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Consider a stochastic process X on a finite state space X = {1,...,}d. It is conditionally Markov, given a real-valued "input process" sigma.0 This is assumed to be small, which is modeled through the scaling, sigma(t) = epsilon sigma(1)(t), 0 <= epsilon <= 1, where sigma(1) is a bounded stationary process. The following conclusions are obtained, subject to smoothness assumptions on the controlled transition matrix and a mixing condition on sigma : (i) A stationary version of the process is constructed, that is coupled with a stationary version of the Markov chain X obtained with sigma equivalent to 0. The triple (X, X-center dot, sigma) is a jointly stationary process satisfying P{X(t) not equal X-center dot(t)} = 0(epsilon). Moreover, a second-order Taylor-series approximation is obtained: P{X(t) = P{X-center dot(t)= i} + epsilon(2)pi((2))(i) + 0(epsilon(2)), 1 <= i <= d, with an explicit formula for the vector n ((2)) is an element of R-d. (ii) For any m >= 1 and any function f : {1,...,d} x R -> R-m, the stationary stochastic process Y (t) = f (X (t), sigma(t)) has a power spectral density S-f that admits a second-order Taylor series expansion: A function) S-f((2)) : [-pi, pi] -> C-mxm is constructed such that S-f (theta) = S-f(center dot) (theta) epsilon(2)sf(2)(theta) + o(epsilon(2)), theta is an element of[-pi,pi] in which the first term is the power spectral density obtained with epsilon = 0. An explicit formula for the function S-f((2)) is obtained, based in part on the bounds in (i). The results are illustrated with two general examples: mean field games, and a version of the timing channel of Anantharam and Verdu.
引用
收藏
页码:79 / 111
页数:33
相关论文
共 21 条
  • [1] Oblivious Equilibrium: An Approximation to Large Population Dynamic Games with Concave Utility
    Adlakha, Sachin
    Johari, Ramesh
    Weintraub, Gabriel
    Goldsmith, Andrea
    [J]. 2009 INTERNATIONAL CONFERENCE ON GAME THEORY FOR NETWORKS (GAMENETS 2009), 2009, : 68 - +
  • [2] Bits through queues
    Anantharam, V
    Verdu, S
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1996, 42 (01) : 4 - 18
  • [3] CAINES P. E., 1988, LINEAR STOCHASTIC SY
  • [4] Chen Y., 2016, THESIS
  • [5] Chen Y., 2017, IEEE T SMART GRID
  • [6] State Estimation for the Individual and the Population in Mean Field Control With Application to Demand Dispatch
    Chen, Yue
    Busic, Ana
    Meyn, Sean P.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (03) : 1138 - 1149
  • [7] Chen Y, 2014, IEEE DECIS CONTR P, P6425, DOI 10.1109/CDC.2014.7040397
  • [8] Cover T., 1991, Wiley Series in Telecommunications, DOI [10.1002/0471200611, DOI 10.1002/0471200611]
  • [9] DEMBO A., 1998, Large Deviations Techniques and Applications, V38, DOI [10.1007/978-1-4612-5320-4, DOI 10.1007/978-1-4612-5320-4]
  • [10] An overview of some stochastic stability methods
    Foss, S
    Konstantopoulos, T
    [J]. JOURNAL OF THE OPERATIONS RESEARCH SOCIETY OF JAPAN, 2004, 47 (04) : 275 - 303