Proximity-Based Non-uniform Abstractions for Approximate Planning

被引:2
|
作者
Baum, Jiri [1 ]
Nicholson, Ann E. [1 ]
Dix, Trevor I. [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Clayton, Vic, Australia
来源
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH | 2012年 / 43卷
关键词
ALGORITHMS;
D O I
10.1613/jair.3414
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a deterministic world, a planning agent can be certain of the consequences of its planned sequence of actions. Not so, however, in dynamic, stochastic domains where Markov decision processes are commonly used. Unfortunately these suffer from the 'curse of dimensionality': if the state space is a Cartesian product of many small sets ('dimensions'), planning is exponential in the number of those dimensions. Our new technique exploits the intuitive strategy of selectively ignoring various dimensions in different parts of the state space. The resulting non-uniformity has strong implications, since the approximation is no longer Markovian, requiring the use of a modified planner. We also use a spatial and temporal proximity measure, which responds to continued planning as well as movement of the agent through the state space, to dynamic ally adapt the abstraction as planning progresses. We present qualitative and quantitative results across a range of experimental domains showing that an agent exploiting this novel approximation method successfully finds solutions to the planning problem using much less than the full state space. We assess and analyse the features of domains which our method can exploit.
引用
收藏
页码:477 / 522
页数:46
相关论文
共 50 条
  • [21] Characterizing the shapes of noisy, non-uniform, and disconnected point clusters in the plane
    Zhong, Xu
    Duckham, Matt
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2016, 57 : 48 - 58
  • [22] Non-Uniform Wavelet Sampling for RF Analog-to-Information Conversion
    Pelissier, Michael
    Studer, Christoph
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (02) : 471 - 484
  • [23] Uniform and Non-uniform Perturbations in Brain-Machine Interface Task Elicit Similar Neural Strategies
    Salas, Michelle Armenta
    Tillery, Stephen I. Helms
    FRONTIERS IN SYSTEMS NEUROSCIENCE, 2016, 10
  • [24] PERFORMANCE ANALYSIS OF ROOT-MUSIC-BASED DIRECTION-OF-ARRIVAL ESTIMATION FOR ARBITRARY NON-UNIFORM ARRAYS
    Ruebsamen, Michael
    Gershman, Alex B.
    2008 IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP, 2008, : 184 - 188
  • [25] Under-Determined DOA Estimation: A Method Based on Higher-Order Statistics and Non-Uniform Arrays
    Peng, Wei
    Li, Peng
    Wu, Xinyi
    Luo, Kai
    Zheng, Gan
    Li, Dong
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (11) : 15903 - 15914
  • [26] Higher-Order Statistics-Based Non-uniform Linear Array for Underdetermined DoA Estimation of Non-circular Signals
    Gupta, Payal
    Agrawal, Monika
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (05) : 2719 - 2749
  • [27] A new maximum power point tracking strategy for PV arrays under uniform and non-uniform insolation conditions
    Kouchaki, Alireza
    Iman-Eini, Hossein
    Asaei, Behzad
    SOLAR ENERGY, 2013, 91 : 221 - 232
  • [28] Segmentation of MRI brain scans using non-uniform partial volume densities
    Brouwer, Rachel M.
    Pol, Hilleke E. Hulshoff
    Schnack, Hugo G.
    NEUROIMAGE, 2010, 49 (01) : 467 - 477
  • [29] On the Bertrand Pairs of Open Non-Uniform Rational B-Spline Curves
    Incesu, Muhsin
    Evren, Sara Yilmaz
    Gursoy, Osman
    MATHEMATICAL ANALYSIS AND APPLICATIONS, MAA 2020, 2021, 381 : 167 - 184
  • [30] Single machine lot scheduling with non-uniform lot capacities and processing times
    Ying Chen
    Yongxi Cheng
    Guiqing Zhang
    Journal of Combinatorial Optimization, 2022, 43 : 1359 - 1367