Markov Chains With Maximum Return Time Entropy for Robotic Surveillance

被引:23
作者
Duan, Xiaoming [1 ,2 ]
George, Mishel [1 ,2 ]
Bullo, Francesco [1 ,2 ]
机构
[1] Univ Calif Santa Barbara, Dept Mech Engn, Santa Barbara, CA 93106 USA
[2] Univ Calif Santa Barbara, Ctr Control Dynam Syst & Computat, Santa Barbara, CA 93106 USA
关键词
Entropy; Markov processes; Surveillance; Robots; Random variables; Topology; Markov chains; return time entropy; stochastic surveillance;
D O I
10.1109/TAC.2019.2906473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Motivated by robotic surveillance applications, this paper studies the novel problem of maximizing the return time entropy of a Markov chain, subject to a graph topology with travel times and stationary distribution. The return time entropy is the weighted average, over all graph nodes, of the entropy of the first return times of the Markov chain; this objective function is a function series that does not admit, in general, a closed form. This paper features theoretical and computational contributions. First, we obtain a discrete-time delayed linear system for the return time probability distribution and establish its convergence properties. We show that the objective function is continuous over a compact set and therefore admits a global maximum. We then establish upper and lower bounds between the return time entropy and the well-known entropy rate of the Markov chain. To compute the optimal Markov chain numerically, we establish the asymptotic equality between entropy, conditional entropy, and truncated entropy, and propose an iteration to compute the gradient of the truncated entropy. Finally, we apply these results to the robotic surveillance problem. Our numerical results show that for a model of rational intruder over prototypical graph topologies and test cases, the maximum return time entropy Markov chain outperforms several pre-existing Markov chains.
引用
收藏
页码:72 / 86
页数:15
相关论文
共 27 条
  • [1] Markov Chain Approach to Probabilistic Guidance for Swarms of Autonomous Agents
    Acikmese, Behcet
    Bayard, David S.
    [J]. ASIAN JOURNAL OF CONTROL, 2015, 17 (04) : 1105 - 1124
  • [2] Multi-robot perimeter patrol in adversarial settings
    Agmon, Noa
    Kraus, Sarit
    Kaminka, Gal A.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 2339 - 2345
  • [3] Persistent monitoring in discrete environments: Minimizing the maximum weighted latency between observations
    Alamdari, Soroush
    Fata, Elaheh
    Smith, Stephen L.
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2014, 33 (01) : 138 - 154
  • [4] [Anonymous], 1976, PRINCIPLES MATH ANAL
  • [5] Maximizing the set of recurrent states of an MDP subject to convex constraints
    Arvelo, Eduardo
    Martins, Nuno C.
    [J]. AUTOMATICA, 2014, 50 (03) : 994 - 998
  • [6] Asghar AB, 2016, P AMER CONTR CONF, P6435, DOI 10.1109/ACC.2016.7526682
  • [7] Probabilistic and Distributed Control of a Large-Scale Swarm of Autonomous Agents
    Bandyopadhyay, Saptarshi
    Chung, Soon-Jo
    Hadaegh, Fred Y.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2017, 33 (05) : 1103 - 1123
  • [8] Bertsekas D. P., 2016, THEORETICAL SOLUTION
  • [9] Boyd S., 2004, CONVEX OPTIMIZATION
  • [10] A Minimalist Algorithm for Multirobot Continuous Coverage
    Cannata, Giorgio
    Sgorbissa, Antonio
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2011, 27 (02) : 297 - 312