Real-Time Planning as Decision-Making under Uncertainty

被引:0
|
作者
Mitchell, Andrew [1 ]
Ruml, Wheeler [1 ]
Spaniol, Fabian [2 ]
Hoffmann, Joerg [2 ]
Petrik, Marek [1 ]
机构
[1] Univ New Hampshire, Dept Comp Sci, Durham, NH 03824 USA
[2] Saarland Univ, Dept Comp Sci, Saarbrucken, Germany
来源
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real-time planning, an agent must select the next action to take within a fixed time bound. Many popular real-time heuristic search methods approach this by expanding nodes using time-limited A* and selecting the action leading toward the frontier node with the lowest f value. In this paper, we reconsider real-time planning as a problem of decision-making under uncertainty. We propose treating heuristic values as uncertain evidence and we explore several backup methods for aggregating this evidence. We then propose a novel lookahead strategy that expands nodes to minimize risk, the expected regret in case a non-optimal action is chosen. We evaluate these methods in a simple synthetic benchmark and the sliding tile puzzle and find that they outperform previous methods. This work illustrates how uncertainty can arise even when solving deterministic planning problems, due to the inherent ignorance of time-limited search algorithms about those portions of the state space that they have not computed, and how an agent can benefit from explicitly metareasoning about this uncertainty.
引用
收藏
页码:2338 / 2345
页数:8
相关论文
共 50 条
  • [1] Real-time data stream learning for emergency decision-making under uncertainty
    Wang, Kun
    Xiong, Li
    Xue, Rudan
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2024, 633
  • [2] A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty
    Malikopoulos, Andreas A.
    Papalambros, Panos Y.
    Assanis, Dennis N.
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2009, 131 (04): : 1 - 8
  • [3] Constrained optimization under uncertainty for decision-making problems: Application to Real-Time Strategy games
    Antuori, Valentin
    Richoux, Florian
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 458 - 465
  • [4] A Bayesian approach to decision-making under uncertainty: An application to real-time forecasting in the river Rhine
    Reggiani, P.
    Weerts, A. H.
    JOURNAL OF HYDROLOGY, 2008, 356 (1-2) : 56 - 69
  • [5] DECISION-MAKING UNDER UNCERTAINTY IN SEASONAL OPERATIONS PLANNING
    CARVALHO, VF
    PESSIONE, GF
    FRANCES, DM
    ELKADY, MA
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 1989, 11 (03) : 170 - 175
  • [6] A state-space representation model and learning algorithm for real-time decision-making under uncertainty
    Malikopoulos, Andreas A.
    Assanis, Dennis N.
    Papalambros, Panos Y.
    PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINERING CONGRESS AND EXPOSITION 2007, VOL 9, PTS A-C: MECHANICAL SYSTEMS AND CONTROL, 2008, : 575 - 584
  • [7] REAL-TIME DATA-PROCESSING AND REAL-TIME DECISION-MAKING
    KENNEDY, MH
    HOFFER, JA
    JOURNAL OF SYSTEMS MANAGEMENT, 1978, 29 (10): : 21 - 25
  • [8] Data Acquisition for Real-time Decision-making under Freshness Constraints
    Hu, Shaohan
    Yao, Shuochao
    Jin, Haiming
    Zhao, Yiran
    Hu, Yitao
    Liu, Xiaochen
    Naghibolhosseini, Nooreddin
    Li, Shen
    Kapoor, Akash
    Dron, William
    Su, Lu
    Bar-Noy, Amotz
    Szekely, Pedro
    Govindan, Ramesh
    Hobbs, Reginald
    Abdelzaher, Tarek F.
    2015 IEEE 36TH REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2015), 2015, : 185 - 194
  • [9] Intelligent Sensors for Real-Time Decision-Making
    Coito, Tiago
    Firme, Bernardo
    Martins, Miguel S. E.
    Vieira, Susana M.
    Figueiredo, Joao
    Sousa, Joao M. C.
    AUTOMATION, 2021, 2 (02): : 62 - 82
  • [10] Real-time decision making under uncertainty of self-localization results
    Fukase, T
    Kobayashi, Y
    Ueda, R
    Kawabe, T
    Arai, T
    ROBOCUP 2002: ROBOT SOCCER WORLD CUP VI, 2003, 2752 : 375 - 383