Goal Recognition as Reinforcement Learning

被引:0
作者
Amado, Leonardo [1 ]
Mirsky, Reuth [2 ,3 ]
Meneguzzi, Felipe [1 ,4 ]
机构
[1] Pontificia Univ Catolica Rio Grande do Sul, Porto Alegre, RS, Brazil
[2] Bar Ilan Univ, Ramat Gan, Israel
[3] Univ Texas Austin, Austin, TX USA
[4] Univ Aberdeen, Aberdeen, Scotland
来源
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most approaches for goal recognition rely on specifications of the possible dynamics of the actor in the environment when pursuing a goal. These specifications suffer from two key issues. First, encoding these dynamics requires careful design by a domain expert, which is often not robust to noise at recognition time. Second, existing approaches often need costly real-time computations to reason about the likelihood of each potential goal. In this paper, we develop a framework that combines model-free reinforcement learning and goal recognition to alleviate the need for careful, manual domain design, and the need for costly online executions. This framework consists of two main stages: offline learning of policies or utility functions for each potential goal, and online inference. We provide a first instance of this framework using tabular Q-learning for the learning stage, as well as three mechanisms for the inference stage. The resulting instantiation achieves state-of-the-art performance against goal recognizers on standard evaluation domains and superior performance in noisy environments.
引用
收藏
页码:9644 / 9651
页数:8
相关论文
共 30 条
[1]  
Amado L., 2018, Advances in Electronic Government, Digital Divide, and Regional Development, P1, DOI 10.1109/IJCNN.2018.8489653
[2]  
Amado L. R., 2019, INT C AUT PLANN SCHE
[3]   Learning Partially Observable Deterministic Action Models [J].
Amir, Eyal ;
Chang, Allen .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2008, 33 :349-402
[4]  
[Anonymous], 1994, MACHINE LEARNING P 1
[5]   A review of learning planning action models [J].
Arora, Ankuj ;
Fiorino, Humbert ;
Pellier, Damien ;
Metivier, Marc ;
Pesty, Sylvie .
KNOWLEDGE ENGINEERING REVIEW, 2018, 33
[6]  
Asai M, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2676
[7]  
Bishop J., 2020, 2020 SYST INF ENG DE, P1
[8]   An integrated approach of learning, planning, and execution [J].
García-Martínez, R ;
Borrajo, D .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2000, 29 (01) :47-78
[9]   Object-Based Goal Recognition Using Real-World Data [J].
Granada, Roger ;
Monteiro, Juarez ;
Gavenski, Nathan ;
Meneguzzi, Felipe .
ADVANCES IN SOFT COMPUTING, MICAI 2020, PT I, 2020, 12468 :325-337
[10]   The Fast Downward planning system [J].
Helmert, Malte .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2006, 26 :191-246