Goal Recognition as Reinforcement Learning

被引：0

作者：

Amado, Leonardo ^{[1
]}

Mirsky, Reuth ^{[2
,3
]}

Meneguzzi, Felipe ^{[1
,4
]}

机构：

[1] Pontificia Univ Catolica Rio Grande do Sul, Porto Alegre, RS, Brazil

[2] Bar Ilan Univ, Ramat Gan, Israel

[3] Univ Texas Austin, Austin, TX USA

[4] Univ Aberdeen, Aberdeen, Scotland

来源：

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most approaches for goal recognition rely on specifications of the possible dynamics of the actor in the environment when pursuing a goal. These specifications suffer from two key issues. First, encoding these dynamics requires careful design by a domain expert, which is often not robust to noise at recognition time. Second, existing approaches often need costly real-time computations to reason about the likelihood of each potential goal. In this paper, we develop a framework that combines model-free reinforcement learning and goal recognition to alleviate the need for careful, manual domain design, and the need for costly online executions. This framework consists of two main stages: offline learning of policies or utility functions for each potential goal, and online inference. We provide a first instance of this framework using tabular Q-learning for the learning stage, as well as three mechanisms for the inference stage. The resulting instantiation achieves state-of-the-art performance against goal recognizers on standard evaluation domains and superior performance in noisy environments.

引用

页码：9644 / 9651

页数：8

共 30 条

[1]

Amado L., 2018, Advances in Electronic Government, Digital Divide, and Regional Development, P1, DOI 10.1109/IJCNN.2018.8489653

[2]

Amado L. R., 2019, INT C AUT PLANN SCHE

[3] Learning Partially Observable Deterministic Action Models [J].

Amir, Eyal ;

Chang, Allen .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2008, 33 :349-402

[4]

[Anonymous], 1994, MACHINE LEARNING P 1

[5] A review of learning planning action models [J].

Arora, Ankuj ;

Fiorino, Humbert ;

Pellier, Damien ;

Metivier, Marc ;

Pesty, Sylvie .

KNOWLEDGE ENGINEERING REVIEW, 2018, 33

[6]

Asai M, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2676

[7]

Bishop J., 2020, 2020 SYST INF ENG DE, P1

[8] An integrated approach of learning, planning, and execution [J].

García-Martínez, R ;

Borrajo, D .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2000, 29 (01) :47-78

[9] Object-Based Goal Recognition Using Real-World Data [J].

Granada, Roger ;

Monteiro, Juarez ;

Gavenski, Nathan ;

Meneguzzi, Felipe .

ADVANCES IN SOFT COMPUTING, MICAI 2020, PT I, 2020, 12468 :325-337

[10] The Fast Downward planning system [J].

Helmert, Malte .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2006, 26 :191-246

← 1 2 3 →