Planning under Uncertainty for Robotic Tasks with Mixed Observability

被引:144
作者
Ong, Sylvie C. W. [1 ]
Png, Shao Wei [2 ]
Hsu, David [1 ]
Lee, Wee Sun [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 117417, Singapore
[2] McGill Univ, Sch Comp Sci, Montreal, PQ H3A 2A7, Canada
关键词
motion planning; motion planning with uncertainty; Markov decision process; partially observable Markov decision process;
D O I
10.1177/0278364910369861
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Partially observable Markov decision processes (POMDPs) provide a principled, general framework for robot motion planning in uncertain and dynamic environments. They have been applied to various robotic tasks. However, solving POMDPs exactly is computationally intractable. A major challenge is to scale up POMDP algorithms for complex robotic tasks. Robotic systems often have mixed observability : even when a robot's state is not fully observable, some components of the state may still be so. We use a factored model to represent separately the fully and partially observable components of a robot's state and derive a compact lower-dimensional representation of its belief space. This factored representation can be combined with any point-based algorithm to compute approximate POMDP solutions. Experimental results show that on standard test problems, our approach improves the performance of a leading point-based POMDP algorithm by many times.
引用
收藏
页码:1053 / 1068
页数:16
相关论文
共 29 条
[1]  
BOYAN JA, 2001, ADV NEURAL INFORM PR, V13
[2]   Parametric POMDPs for planning in continuous state spaces [J].
Brooks, Alex ;
Makarenko, Alexei ;
Williams, Stefan ;
Durrant-Whyte, Hugh .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (11) :887-897
[3]   Efficient solution algorithms for factored MDPs [J].
Guestrin, C ;
Koller, D ;
Parr, R ;
Venkataraman, S .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2003, 19 :399-468
[4]  
Guestrin C., 2004, P 20 C UNC ART INT, P235
[5]  
GUESTRIN C, 2001, INT JOINT C ART INT, P67
[6]  
Hansen E. A., 2000, Proceedings of the Fifth International Conference on Artificial Intelligence Planning and Scheduling, P130
[7]   Value-function approximations for partially observable Markov decision processes [J].
Hauskrecht, M .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2000, 13 :33-94
[8]  
HAUSKRECHT M, 1998, P 9 INT WORKSH PRINC, P182
[9]  
HOEY J, 2007, P INT C VIS SYST
[10]   Grasping POMDPs [J].
Hsiao, Kaijen ;
Kaelbling, Leslie Pack ;
Lozano-Perez, Tomas .
PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-10, 2007, :4685-+