Knowledge-Based Hierarchical POMDPs for Task Planning

被引:10
作者
Serrano, Sergio A. [1 ]
Santiago, Elizabeth [2 ]
Martinez-Carranza, Jose [1 ,3 ]
Morales, Eduardo F. [1 ,4 ]
Enrique Sucar, L. [1 ]
机构
[1] Inst Nacl Astrofis Opt & Electr, Comp Sci Dept, Puebla, Mexico
[2] Inst Nacl Enfermedades Resp, Lab Computat Biol, Mexico City, DF, Mexico
[3] Univ Bristol, Comp Sci Dept, Bristol, Avon, England
[4] Ctr Invest Matemat, Guanajuato, Mexico
关键词
Task planning; Hierarchical POMDP; Service robotics; Decision making; Knowledge-based systems;
D O I
10.1007/s10846-021-01348-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main goal in task planning is to build a sequence of actions that takes an agent from an initial state to a goal state. In robotics, this is particularly difficult because actions usually have several possible results, and sensors are prone to produce measurements with error. Partially observable Markov decision processes (POMDPs) are commonly employed, thanks to their ability to model the uncertainty of actions that modify and monitor the state of a system. However, since solving a POMDP is computationally expensive, their usage becomes prohibitive for most robotic applications. In this article, we propose a task planning architecture for service robotics. In the context of service robot design, we present a scheme to encode knowledge about the robot and its environment, that promotes the modularity and reuse of information. Also, we introduce a new recursive definition of a POMDP that enables our architecture to autonomously build a hierarchy of POMDPs, so that it can be used to generate and execute plans that solve the task at hand. Experimental results show that, in comparison to baseline methods, by following a recursive hierarchical approach the architecture is able to significantly reduce the planning time, while maintaining (or even improving) the robustness under several scenarios that vary in uncertainty and size.
引用
收藏
页数:30
相关论文
共 36 条
[1]  
Balai E, 2013, LECT NOTES COMPUT SC, V8148, P135, DOI 10.1007/978-3-642-40564-8_14
[2]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[3]   AUTOMATED VISUAL INSPECTION - A SURVEY [J].
CHIN, RT ;
HARLOW, CA .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1982, 4 (06) :557-573
[4]   STRIPS - NEW APPROACH TO APPLICATION OF THEOREM PROVING TO PROBLEM SOLVING [J].
FIKES, RE ;
NILSSON, NJ .
ARTIFICIAL INTELLIGENCE, 1971, 2 (3-4) :189-208
[5]   The hierarchical hidden Markov model: Analysis and applications [J].
Fine, S ;
Singer, Y ;
Tishby, N .
MACHINE LEARNING, 1998, 32 (01) :41-62
[6]   PDDL2.1: An extension to PDDL for expressing temporal planning domains [J].
Fox, M ;
Long, D .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2003, 20 :61-124
[7]  
Gelfond M, 2014, KNOWLEDGE REPRESENTATION, REASONING, AND THE DESIGN OF INTELLIGENT AGENTS: THE ANSWER-SET PROGRAMMING APPROACH, P1, DOI 10.1017/CBO9781139342124
[8]   Robot task planning and explanation in open and uncertain worlds [J].
Hanheide, Marc ;
Goebelbecker, Moritz ;
Horn, Graham S. ;
Pronobis, Andrzej ;
Sjoeoe, Kristoffer ;
Aydemir, Alper ;
Jensfelt, Patric ;
Gretton, Charles ;
Dearden, Richard ;
Janicek, Miroslav ;
Zender, Hendrik ;
Kruijff, Geert-Jan ;
Hawes, Nick ;
Wyatt, Jeremy L. .
ARTIFICIAL INTELLIGENCE, 2017, 247 :119-150
[9]  
Herrmann P., 1982, J MANUF SYST, V1, P148, DOI 10.1016/S0278-6125(82)80024-1
[10]   Deliberation for autonomous robots: A survey [J].
Ingrand, Felix ;
Ghallab, Malik .
ARTIFICIAL INTELLIGENCE, 2017, 247 :10-44