Multirobot Navigation Using Partially Observable Markov Decision Processes with Belief-Based Rewards

被引:0
|
作者
Tzikas, Alexandros E. [1 ]
Knowles, Derek [2 ]
Gao, Grace X. [1 ]
Kochenderfer, Mykel J. [1 ]
机构
[1] Stanford Univ, Dept Aeronaut & Astronaut, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USA
来源
关键词
Extended Kalman Filter; Guidance and Navigational Algorithms; Multi-Robot Systems; Markov Decision Process;
D O I
10.2514/1.I011146
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
For the safe deployment of multirobot systems, accurate position estimation is necessary. We present a multirobot navigation algorithm that accomplishes the dual objective of reaching designated goal positions and maintaining low position uncertainty. We pose the problem as a ?-partially observable Markov decision process (?POMDP) to capture position uncertainty through a belief-based reward. The multirobot system employs a centralized extended Kalman filter (EKF) for state estimation, whose output is used as the observation as well as the belief in the ?POMDP. Using the EKF output as the observation and belief allows for the integration of the EKF in the action planning module of the system. To solve the ?POMDP, we adapt the online information particle filter tree algorithm to be compatible with the EKF closed-form output being used as both the observation and belief. We present results for the proposed navigation algorithm that show emergent behaviors for a multirobot system, where the robots move to provide inter-ranging measurements that minimize position uncertainty for all robots in the system. We also demonstrate that the proposed method outperforms a state-of-the-art solver (partially observable Monte Carlo planning with observation widening) and a direct-to-goal planner in terms of minimizing position uncertainty.
引用
收藏
页码:437 / 446
页数:10
相关论文
共 50 条
  • [1] Using rewards for belief state updates in partially observable Markov decision processes
    Izadi, MT
    Precup, D
    MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 593 - 600
  • [2] MEDICAL TREATMENTS USING PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES
    Goulionis, John E.
    JP JOURNAL OF BIOSTATISTICS, 2009, 3 (02) : 77 - 97
  • [3] Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions
    Omidshafiei, Shayegan
    Agha-mohammadi, Ali-akbar
    Amato, Christopher
    How, Jonathan P.
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 5962 - 5969
  • [4] Partially Observable Markov Decision Processes and Robotics
    Kurniawati, Hanna
    ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 253 - 277
  • [5] A tutorial on partially observable Markov decision processes
    Littman, Michael L.
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2009, 53 (03) : 119 - 125
  • [6] Quantum partially observable Markov decision processes
    Barry, Jennifer
    Barry, Daniel T.
    Aaronson, Scott
    PHYSICAL REVIEW A, 2014, 90 (03):
  • [7] PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES WITH PARTIALLY OBSERVABLE RANDOM DISCOUNT FACTORS
    Martinez-Garcia, E. Everardo
    Minjarez-Sosa, J. Adolfo
    Vega-Amaya, Oscar
    KYBERNETIKA, 2022, 58 (06) : 960 - 983
  • [8] Position Validation Strategies using Partially Observable Markov Decision Processes
    Kochenderfer, Mykel J.
    Shih, Kevin J.
    Chryssanthacopoulos, James P.
    Rose, Charles E.
    Elder, Tomas R.
    2011 IEEE/AIAA 30TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2011,
  • [9] POSITION VALIDATION STRATEGIES USING PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES
    Kochenderfer, Mykel J.
    Shih, Kevin J.
    Chryssanthacopoulos, James P.
    Rose, Charles E.
    Elder, Tomas R.
    2011 IEEE/AIAA 30TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2011,
  • [10] Modeling speech using Partially Observable Markov Decision Processes (POMDP)
    Jonas, M
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4016 - 4016