Monte Carlo Planning in Hybrid Belief POMDPs

被引:1
作者
Barenboim, Moran [1 ]
Shienman, Moshe [1 ]
Indelman, Vadim [2 ]
机构
[1] Technion Israel Inst Technol, Technion Autonomous Syst Program TASP, IL-32000 Haifa, Israel
[2] Technion Israel Inst Technol, Dept Aerosp Engn, IL-32000 Haifa, Israel
关键词
Planning; Random variables; Monte Carlo methods; Inference algorithms; Markov processes; Data models; History; Planning under uncertainty; autonomous agents;
D O I
10.1109/LRA.2023.3282773
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Real-world problems often require reasoning about hybrid beliefs, over both discrete and continuous random variables. Yet, such a setting has hardly been investigated in the context of planning. Moreover, existing online partially observable Markov decision processes (POMDPs) solvers do not support hybrid beliefs directly. In particular, these solvers do not address the added computational burden due to an increasing number of hypotheses with the planning horizon, which can grow exponentially. As part of this work, we present a novel algorithm, Hybrid Belief Monte Carlo Planning (HB-MCP) that utilizes the Monte Carlo Tree Search (MCTS) algorithm to solve a POMDP while maintaining a hybrid belief. We illustrate how the upper confidence bound (UCB) exploration bonus can be leveraged to guide the growth of hypotheses trees alongside the belief trees. We then evaluate our approach in highly aliased simulated environments where unresolved data association leads to multi-modal belief hypotheses.
引用
收藏
页码:4410 / 4417
页数:8
相关论文
共 16 条
  • [1] Barenboim M., 2022, 31 INT JOINT C ART I
  • [2] Barenboim M., MONTE CARLO PLANNING
  • [3] Dellaert F., 2012, FACTOR GRAPHS GTSAM
  • [4] ARAS: Ambiguity-aware Robust Active SLAM based on Multi-hypothesis State and Map Estimations
    Hsiao, Ming
    Mangelson, Joshua G.
    Suresh, Sudharshan
    Debrunner, Christian
    Kaess, Michael
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5037 - 5044
  • [5] Hsiao M, 2019, IEEE INT CONF ROBOT, P1274, DOI [10.1109/icra.2019.8793854, 10.1109/ICRA.2019.8793854]
  • [6] Kennedy T., 2016, Monte Carlo Methods-a Spec. Top. Course 167.
  • [7] Bandit based Monte-Carlo planning
    Kocsis, Levente
    Szepesvari, Csaba
    [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 282 - 293
  • [8] Factor graphs and the sum-product algorithm
    Kschischang, FR
    Frey, BJ
    Loeliger, HA
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2001, 47 (02) : 498 - 519
  • [9] Modeling Perceptual Aliasing in SLAM via Discrete-Continuous Graphical Models
    Lajoie, Pierre-Yves
    Hu, Siyi
    Beltrame, Giovanni
    Carlone, Luca
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) : 1232 - 1239
  • [10] A unified framework for data association aware robust belief space planning and perception
    Pathak, Shashank
    Thomas, Antony
    Indelman, Vadim
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (2-3) : 287 - 315