Monte Carlo Planning in Hybrid Belief POMDPs

被引：1

作者：

Barenboim, Moran ^{[1
]}

Shienman, Moshe ^{[1
]}

Indelman, Vadim ^{[2
]}

机构：

[1] Technion Israel Inst Technol, Technion Autonomous Syst Program TASP, IL-32000 Haifa, Israel

[2] Technion Israel Inst Technol, Dept Aerosp Engn, IL-32000 Haifa, Israel

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2023年 / 8卷 / 08期

关键词：

Planning; Random variables; Monte Carlo methods; Inference algorithms; Markov processes; Data models; History; Planning under uncertainty; autonomous agents;

D O I：

10.1109/LRA.2023.3282773

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Real-world problems often require reasoning about hybrid beliefs, over both discrete and continuous random variables. Yet, such a setting has hardly been investigated in the context of planning. Moreover, existing online partially observable Markov decision processes (POMDPs) solvers do not support hybrid beliefs directly. In particular, these solvers do not address the added computational burden due to an increasing number of hypotheses with the planning horizon, which can grow exponentially. As part of this work, we present a novel algorithm, Hybrid Belief Monte Carlo Planning (HB-MCP) that utilizes the Monte Carlo Tree Search (MCTS) algorithm to solve a POMDP while maintaining a hybrid belief. We illustrate how the upper confidence bound (UCB) exploration bonus can be leveraged to guide the growth of hypotheses trees alongside the belief trees. We then evaluate our approach in highly aliased simulated environments where unresolved data association leads to multi-modal belief hypotheses.

引用

页码：4410 / 4417

页数：8

共 16 条

[1] Barenboim M., 2022, 31 INT JOINT C ART I
[2] Barenboim M., MONTE CARLO PLANNING
[3] Dellaert F., 2012, FACTOR GRAPHS GTSAM
[4] ARAS: Ambiguity-aware Robust Active SLAM based on Multi-hypothesis State and Map Estimations
Hsiao, Ming
Mangelson, Joshua G.
Suresh, Sudharshan
Debrunner, Christian
Kaess, Michael
[J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5037 - 5044
[5] Hsiao M, 2019, IEEE INT CONF ROBOT, P1274, DOI [10.1109/icra.2019.8793854, 10.1109/ICRA.2019.8793854]
[6] Kennedy T., 2016, Monte Carlo Methods-a Spec. Top. Course 167.
[7] Bandit based Monte-Carlo planning
Kocsis, Levente
Szepesvari, Csaba
[J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 282 - 293
[8] Factor graphs and the sum-product algorithm
Kschischang, FR
Frey, BJ
Loeliger, HA
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2001, 47 (02) : 498 - 519
[9] Modeling Perceptual Aliasing in SLAM via Discrete-Continuous Graphical Models
Lajoie, Pierre-Yves
Hu, Siyi
Beltrame, Giovanni
Carlone, Luca
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) : 1232 - 1239
[10] A unified framework for data association aware robust belief space planning and perception
Pathak, Shashank
Thomas, Antony
Indelman, Vadim
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (2-3) : 287 - 315

← 1 2 →