Learning Safe Numeric Action Models

被引:0
作者
Mordoch, Argaman [1 ]
Juba, Brendan [2 ]
Stern, Roni [1 ]
机构
[1] Ben Gurion Univ Negev, Beer Sheva, Israel
[2] Washington Univ, St Louis, MO USA
来源
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10 | 2023年
关键词
FF;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Powerful domain-independent planners have been developed to solve various types of planning problems. These planners often require a model of the acting agent's actions, given in some planning domain description language. Yet obtaining such an action model is a notoriously hard task. This task is even more challenging in mission-critical domains, where a trial-and-error approach to learning how to act is not an option. In such domains, the action model used to generate plans must be safe, in the sense that plans generated with it must be applicable and achieve their goals. Learning safe action models for planning has been recently explored for domains in which states are sufficiently described with Boolean variables. In this work, we go beyond this limitation and propose the Numeric Safe Action Model Learning (N-SAM) algorithm. N-SAM runs in time that is polynomial in the number of observations and, under certain conditions, is guaranteed to return safe action models. We analyze its worst-case sample complexity, which may be intractable for some domains. Empirically, however, N-SAM can quickly learn a safe action model that can solve most problems in the domain.
引用
收藏
页码:12079 / 12086
页数:8
相关论文
共 33 条
[1]  
Aeronautiques C., 1998, Technical Report, Tech. Rep
[2]   Learning action models with minimal observability [J].
Aineto, Diego ;
Jimenez Celorrio, Sergio ;
Onaindia, Eva .
ARTIFICIAL INTELLIGENCE, 2019, 275 :104-137
[3]  
Anthony M., 1996, Comb. Probab. Comput., V5, P191
[4]  
Coles A, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P1671
[5]  
Cresswell S., 2011, INT C AUTOMATED PLAN, V42, P49
[6]   Acquiring planning domain models using LOCM [J].
Cresswell, Stephen N. ;
McCluskey, Thomas L. ;
West, Margaret M. .
KNOWLEDGE ENGINEERING REVIEW, 2013, 28 (02) :195-213
[7]   STRIPS - NEW APPROACH TO APPLICATION OF THEOREM PROVING TO PROBLEM SOLVING [J].
FIKES, RE ;
NILSSON, NJ .
ARTIFICIAL INTELLIGENCE, 1971, 2 (3-4) :189-208
[8]   PDDL2.1: An extension to PDDL for expressing temporal planning domains [J].
Fox, M ;
Long, D .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2003, 20 :61-124
[9]   Modelling mixed discrete-continuous domains for planning [J].
Fox, Maria ;
Long, Derek .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2006, 27 :235-297
[10]  
Ghallab M, 2016, Automated Planning and Acting