The BAO* algorithm for stochastic Shortest Path Problems with Dynamic Learning

被引：0

作者：

Aksakalli, Vural ^{[1
]}

机构：

[1] Johns Hopkins Univ, Dept Appl Math & Stat, Baltimore, MD 21218 USA

来源：

PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14 | 2007年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Suppose a spatial arrangement of possible obstacles needs to be traversed as swiftly as possible, and the status of the obstacles may be disambiguated en route at a cost The goal is to find a protocol that decides what and where to disambiguate en route so as to minimize the expected length of the traversal. We call this problem the Stochastic Shortest Path Problem with Dynamic Learning (SDL), which has been shown to be intractable in many broad settings. In this manuscript, we establish a framework for SDL in both continuous and discrete settings and cast the problem as a Markov decision process. The state space, however, is too large to efficiently utilize the stochastic dynamic programming paradigm. We introduce an algorithm for a discretized version of the continuous setting, called the BAO* Algorithm, which is a new improvement on the AO* search algorithm that employs stronger pruning techniques, including utilization of upper bounds on path lengths (in addition to lower bounds as in AO*), and uses significantly less computational resources. The BAO* Algorithm is not polynomial-time, but it can dramatically shorten the execution time needed to find an exact solution to moderately-sized instances of the problem.

引用

页码：2460 / 2465

页数：6

共 50 条

[21] An iterative stochastic algorithm based on distributed learning automata for finding the stochastic shortest path in stochastic graphs [J].

Hamid Beigy ;

Mohammad Reza Meybodi .

The Journal of Supercomputing, 2020, 76 :5540-5562

[22] Q-learning and policy iteration algorithms for stochastic shortest path problems [J].

Huizhen Yu ;

Dimitri P. Bertsekas .

Annals of Operations Research, 2013, 208 :95-132

[23] Q-learning and policy iteration algorithms for stochastic shortest path problems [J].

Yu, Huizhen ;

Bertsekas, Dimitri P. .

ANNALS OF OPERATIONS RESEARCH, 2013, 208 (01) :95-132

[24] The Coupled EigenAnt algorithm for shortest path problems [J].

Kaszkurewicz, Eugenius ;

Bhaya, Amit ;

Jayadeva ;

Meirelles da Silva, Joao Marcos .

2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, :1729-1735

[25] Minimizing risk models in stochastic shortest path problems [J].

Yoshio Ohtsubo .

Mathematical Methods of Operations Research, 2003, 57 :79-88

[26] Shortest path network problems with stochastic arc weights [J].

Jordan, Jeremy D. ;

Uryasev, Stan .

OPTIMIZATION LETTERS, 2021, 15 (08) :2793-2812

[27] Minimizing risk models in stochastic shortest path problems [J].

Ohtsubo, Y .

MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2003, 57 (01) :79-88

[28] An FPTAS for Dynamic Multiobjective Shortest Path Problems [J].

Maristany de las Casas, Pedro ;

Borndoerfer, Ralf ;

Kraus, Luitgard ;

Sedeno-Noda, Antonio .

ALGORITHMS, 2021, 14 (02)

[29] Stochastic shortest path problems with associative accumulative criteria [J].

Ohtsubo, Yoshio .

APPLIED MATHEMATICS AND COMPUTATION, 2008, 198 (01) :198-208

[30] Shortest path network problems with stochastic arc weights [J].

Jeremy D. Jordan ;

Stan Uryasev .

Optimization Letters, 2021, 15 :2793-2812

← 1 2 3 4 5 →