Monte-Carlo-based partially observable Markov decision process approximations for adaptive sensing

被引：18

作者：

Chong, Edwin K. P. ^{[1
,3
]}

Kreucher, Christopher M. ^{[2
]}

Hero, Alfred O., III ^{[3
]}

机构：

[1] Colorado State Univ, Ft Collins, CO 80523 USA

[2] Integr Applicat Incorp, Ann Arbor, MI USA

[3] Univ Michigan, Ann Arbor, MI 48109 USA

来源：

WODES' 08: PROCEEDINGS OF THE 9TH INTERNATIONAL WORKSHOP ON DISCRETE EVENT SYSTEMS | 2008年

关键词：

D O I：

10.1109/WODES.2008.4605941

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Adaptive sensing involves actively managing sensor resources to achieve a sensing task, such as object detection, classification, and tracking, and represents a promising direction for new applications of discrete event system methods. We describe an approach to adaptive sensing based on approximately solving a partially observable Markov decision process (POMDP) formulation of the problem. Such approximations are necessary because of the very large state space involved in practical adaptive sensing problems, precluding exact computation of optimal solutions. We review the theory of POMDPs and show how the theory applies to adaptive sensing problems. We then describe Monte-Carlo-based approximation methods, with an example to illustrate their application in adaptive sensing. The example also demonstrates the gains that are possible from nonmyopic methods relative to myopic methods.

引用

页码：173 / +

页数：2

共 20 条

[1]

Bertsekas D.P., 2001, DYNAMIC PROGRAMMING, V2

[2]

Bertsekas D. P., 2005, DYNAMIC PROGRAMMING, VI

[3] Rollout algorithms for stochastic scheduling problems [J].

Bertsekas, DP ;

Castañon, DA .

JOURNAL OF HEURISTICS, 1999, 5 (01) :89-108

[4]

BERTSEKAS DP, 2005, P JOINT 44 IEEE C DE

[5]

Castanon DA, 1997, IEEE DECIS CONTR P, P1202, DOI 10.1109/CDC.1997.657615

[6] Parallel rollout for online solution of partially observable Markov decision processes [J].

Chang, HS ;

Givan, R ;

Chong, EKP .

DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2004, 14 (03) :309-341

[7]

Chong EKP, 2000, IEEE DECIS CONTR P, P1433, DOI 10.1109/CDC.2000.912059

[8] Sensor scheduling for target tracking in sensor networks [J].

He, Y ;

Chong, EKP .

2004 43RD IEEE CONFERENCE ON DECISION AND CONTROL (CDC), VOLS 1-5, 2004, :743-748

[9] Sensor scheduling for target tracking: A Monte Carlo sampling approach [J].

He, Ying ;

Chong, Edwin K. P. .

DIGITAL SIGNAL PROCESSING, 2006, 16 (05) :533-545

[10]

Hero III A. O., 2007, FDN APPL SENSOR MANA

← 1 2 →