Compact Policies for Fully-Observable Non-Deterministic Planning as SAT

被引:0
作者
Geffner, Tomas [1 ]
Geffner, Hector [2 ,3 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
[2] ICREA, Barcelona, Spain
[3] Univ Pompeu Fabra, Barcelona, Spain
来源
TWENTY-EIGHTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING (ICAPS 2018) | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fully observable non-deterministic (FOND) planning is becoming increasingly important as an approach for computing proper policies in probabilistic planning, extended temporal plans in LTL planning, and general plans in generalized planning. In this work, we introduce a SAT encoding for FOND planning that is compact and can produce compact strong cyclic policies. Simple variations of the encodings are also introduced for strong planning and for what we call, dual FOND planning, where some non-deterministic actions are assumed to be fair (e.g., probabilistic) and others unfair (e.g., adversarial). The resulting FOND planners are compared empirically with existing planners over existing and new benchmarks. The notion of "probabilistic interesting problems" is also revisited to yield a more comprehensive picture of the strengths and limitations of current FOND planners and the proposed SAT approach.
引用
收藏
页码:88 / 96
页数:9
相关论文
共 50 条
[21]   LTL Synthesis via Non-deterministic Planning [J].
Lu, Xu ;
Yu, Bin ;
Tian, Cong ;
Duan, Zhen-Hua .
Ruan Jian Xue Bao/Journal of Software, 2022, 33 (08) :2769-2781
[22]   Bounded non-deterministic planning for multimedia adaptation [J].
Lopez, Fernando ;
Jannach, Dietmar ;
Martinez, Jose M. ;
Timmerer, Christian ;
Garcia, Narciso ;
Hellwagner, Hermann .
APPLIED INTELLIGENCE, 2012, 36 (01) :29-60
[23]   Workspace planning in construction: non-deterministic factors [J].
Hosny, Abdelhady ;
Nik-Bakht, Mazdak ;
Moselhi, Osama .
AUTOMATION IN CONSTRUCTION, 2020, 116
[24]   Parallel Algorithms for Generating Distinguishing Sequences for Observable Non-deterministic FSMs [J].
Hierons, Robert M. ;
Turker, Uraz Cengiz .
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2017, 26 (01)
[25]   Safety Verification of Non-Deterministic Policies in Reinforcement Learning [J].
Kwon, Ryeonggu ;
Kwon, Gihwon .
IEEE ACCESS, 2024, 12 :185768-185788
[26]   Further research on observation reduction in non-deterministic planning [J].
Rao, Dong-Ning ;
Jiang, Zhi-Hua ;
Jiang, Yun-Fei ;
Zhu, Hui-Quan .
Ruan Jian Xue Bao/Journal of Software, 2009, 20 (05) :1254-1268
[27]   Risk analysis for non-deterministic mission planning and sequencing [J].
Cheung, Kar-Ming ;
Ko, Adans ;
Dang, Van ;
Heckman, David .
2005 IEEE Aerospace Conference, Vols 1-4, 2005, :147-158
[28]   Compiling Uncertainty Away in Non-Deterministic Conformant Planning [J].
Albore, Alexandre ;
Palacios, Hector ;
Geffner, Hector .
ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 :465-470
[29]   AN EFFICIENT FULLY SYMBOLIC BISIMULATION ALGORITHM FOR NON-DETERMINISTIC SYSTEMS [J].
Mumme, Malcolm ;
Ciardo, Gianfranco .
INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2013, 24 (02) :263-282
[30]   Generalized Planning: Non-Deterministic Abstractions and Trajectory Constraints [J].
Bonet, Blai ;
De Giacomo, Giuseppe ;
Geffner, Hector ;
Rubin, Sasha .
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, :873-879