Markov Chains With Maximum Return Time Entropy for Robotic Surveillance

被引：23

作者：

Duan, Xiaoming ^{[1
,2
]}

George, Mishel ^{[1
,2
]}

Bullo, Francesco ^{[1
,2
]}

机构：

[1] Univ Calif Santa Barbara, Dept Mech Engn, Santa Barbara, CA 93106 USA

[2] Univ Calif Santa Barbara, Ctr Control Dynam Syst & Computat, Santa Barbara, CA 93106 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2020年 / 65卷 / 01期

关键词：

Entropy; Markov processes; Surveillance; Robots; Random variables; Topology; Markov chains; return time entropy; stochastic surveillance;

D O I：

10.1109/TAC.2019.2906473

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Motivated by robotic surveillance applications, this paper studies the novel problem of maximizing the return time entropy of a Markov chain, subject to a graph topology with travel times and stationary distribution. The return time entropy is the weighted average, over all graph nodes, of the entropy of the first return times of the Markov chain; this objective function is a function series that does not admit, in general, a closed form. This paper features theoretical and computational contributions. First, we obtain a discrete-time delayed linear system for the return time probability distribution and establish its convergence properties. We show that the objective function is continuous over a compact set and therefore admits a global maximum. We then establish upper and lower bounds between the return time entropy and the well-known entropy rate of the Markov chain. To compute the optimal Markov chain numerically, we establish the asymptotic equality between entropy, conditional entropy, and truncated entropy, and propose an iteration to compute the gradient of the truncated entropy. Finally, we apply these results to the robotic surveillance problem. Our numerical results show that for a model of rational intruder over prototypical graph topologies and test cases, the maximum return time entropy Markov chain outperforms several pre-existing Markov chains.

引用

页码：72 / 86

页数：15

共 27 条

[1] Markov Chain Approach to Probabilistic Guidance for Swarms of Autonomous Agents
Acikmese, Behcet
Bayard, David S.
[J]. ASIAN JOURNAL OF CONTROL, 2015, 17 (04) : 1105 - 1124
[2] Multi-robot perimeter patrol in adversarial settings
Agmon, Noa
Kraus, Sarit
Kaminka, Gal A.
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 2339 - 2345
[3] Persistent monitoring in discrete environments: Minimizing the maximum weighted latency between observations
Alamdari, Soroush
Fata, Elaheh
Smith, Stephen L.
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2014, 33 (01) : 138 - 154
[4] [Anonymous], 1976, PRINCIPLES MATH ANAL
[5] Maximizing the set of recurrent states of an MDP subject to convex constraints
Arvelo, Eduardo
Martins, Nuno C.
[J]. AUTOMATICA, 2014, 50 (03) : 994 - 998
[6] Asghar AB, 2016, P AMER CONTR CONF, P6435, DOI 10.1109/ACC.2016.7526682
[7] Probabilistic and Distributed Control of a Large-Scale Swarm of Autonomous Agents
Bandyopadhyay, Saptarshi
Chung, Soon-Jo
Hadaegh, Fred Y.
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2017, 33 (05) : 1103 - 1123
[8] Bertsekas D. P., 2016, THEORETICAL SOLUTION
[9] Boyd S., 2004, CONVEX OPTIMIZATION
[10] A Minimalist Algorithm for Multirobot Continuous Coverage
Cannata, Giorgio
Sgorbissa, Antonio
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2011, 27 (02) : 297 - 312

← 1 2 3 →