Information gathering in POMDPs using active inference

被引：0

作者：

Walraven, Erwin ^{[1
]}

Sijs, Joris ^{[2
]}

Burghouts, Gertjan J. ^{[1
]}

机构：

[1] Netherlands Org Appl Sci Res, The Hague, Netherlands

[2] Delft Univ Technol, Delft, Netherlands

来源：

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS | 2025年 / 39卷 / 01期

关键词：

Planning under uncertainty; POMDP; Information gathering; Active inference;

D O I：

10.1007/s10458-024-09683-4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Gathering information about the environment state is the main goal in several planning tasks for autonomous agents, such as surveillance, inspection and tracking of objects. Such planning tasks are typically modeled using a Partially Observable Markov Decision Process (POMDP), and in the literature several approaches have emerged to consider information gathering during planning and execution. Similar developments can be seen in the field of active inference, which focuses on active information collection in order to be able to reach a goal. Both fields use POMDPs to model the environment, but the underlying principles for action selection are different. In this paper we create a bridge between both research fields by discussing how they relate to each other and how they can be used for information gathering. Our contribution is a tailored approach to model information gathering tasks directly in the active inference framework. A series of experiments demonstrates that our approach enables agents to gather information about the environment state. As a result, active inference becomes an alternative to common POMDP approaches for information gathering, which opens the door towards more cross cutting research at the intersection of both fields. This is advantageous, because recent advancements in POMDP solvers may be used to accelerate active inference, and the principled active inference framework may be used to model POMDP agents that operate in a neurobiologically plausible fashion.

引用

页数：22

共 38 条

[1] A survey on inspecting structures using robotic systems [J].

Almadhoun, Randa ;

Taha, Tarek ;

Seneviratne, Lakmal ;

Dias, Jorge ;

Cai, Guowei .

INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2016, 13 :1-18

[2]

Almeida J, 2005, ETFA 2005: 10TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION, VOL 1, PTS 1 AND 2, PROCEEDINGS, P203

[3]

[Anonymous], 2008, P ROBOTICS SCI SYSTE

[4]

Araya-Lopez M ..., 2010, Advances in Neural Information Processing Systems

[5] Finite-time analysis of the multiarmed bandit problem [J].

Auer, P ;

Cesa-Bianchi, N ;

Fischer, P .

MACHINE LEARNING, 2002, 47 (2-3) :235-256

[6] Planning as inference [J].

Botvinick, Matthew ;

Toussaint, Marc .

TRENDS IN COGNITIVE SCIENCES, 2012, 16 (10) :485-488

[7]

Cassandra A. R., 1997, UAI 97, P54

[8] Branching Time Active Inference: The theory and its generality [J].

Champion, Theophile ;

Da Costa, Lancelot ;

Bowman, Howard ;

Grzes, Marek .

NEURAL NETWORKS, 2022, 151 :295-316

[9]

Coulom R, 2007, LECT NOTES COMPUT SC, V4630, P72

[10] How Active Inference Could Help Revolutionise Robotics [J].

Da Costa, Lancelot ;

Lanillos, Pablo ;

Sajid, Noor ;

Friston, Karl ;

Khan, Shujhat .

ENTROPY, 2022, 24 (03)

← 1 2 3 4 →