Online belief tracking using regression for contingent planning

被引:12
作者
Brafman, Ronen I. [1 ]
Shani, Guy [1 ]
机构
[1] Ben Gurion Univ Negev, Beer Sheva, Israel
关键词
Contingent planning; Partial observability; Non-deterministic planning; Regression; Belief;
D O I
10.1016/j.artint.2016.08.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In online contingent planning under partial observability an agent decides at each time step on the next action to execute, given its initial knowledge of the world, the actions executed so far, and the observation made. Such agents require some representation of their belief state to determine which actions are valid, or whether the goal has been achieved. Efficient maintenance of a belief state is, given its potential exponential size, a key research challenge in this area. In this paper we develop the theory of regression as a useful tool for belief-state maintenance. We provide a formal description of regression, discussing various alternatives and optimization techniques, and analyze its space and time complexity. In particular, we show that, with some care, the regressed formula will contain variables relevant to the current query only, rather than all variables in the problem description. Consequently, under suitable assumptions, the complexity of regression queries is at most exponential in its contextual width. This parameter is always upper bounded by Bonet and Geffner's width parameter, introduced in their state-of-the-art factored belief tracking (FBT) method. In addition, we show how to obtain a poly-sized circuit representation for the online regression formula even with non-deterministic actions. We provide an empirical comparison of regression with FBT-based belief maintenance, showing the power of regression for online belief tracking. We also suggest caching techniques for regression, and demonstrate their value in reducing runtime in current benchmarks. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:131 / 152
页数:22
相关论文
共 50 条
  • [21] Online Bayesian shrinkage regression
    Waqas Jamil
    Abdelhamid Bouchachia
    Neural Computing and Applications, 2020, 32 : 17759 - 17767
  • [22] Online multiple instance regression
    Wang Zhi-Gang
    Zhao Zeng-Shun
    Zhang Chang-Shui
    CHINESE PHYSICS B, 2013, 22 (09)
  • [23] Online multiple instance regression
    王志岗
    赵增顺
    张长水
    Chinese Physics B, 2013, 22 (09) : 660 - 665
  • [24] Online regression with unbounded sampling
    Guo, Zheng-Chu
    Wang, Cheng
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2011, 88 (14) : 2936 - 2941
  • [25] Memory network for tracking with deep regression
    Deng, Ying
    Zheng, Hong
    2018 IEEE THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2018, : 273 - 278
  • [26] Online estimation of a smooth regression function
    Khasminskii, R
    Liptser, R
    THEORY OF PROBABILITY AND ITS APPLICATIONS, 2002, 47 (03) : 541 - 550
  • [27] Quantifying Uncertainty in Online Regression Forests
    Vasiloudis, Theodore
    Morales, Gianmarco De Francisci
    Bostrom, Henrik
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [28] Cost-Sensitive Probabilistic Contingent Planning for Web Service Composition
    Markou, George
    Refanidis, Ioannis
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2016, 25 (01)
  • [29] Data Actualization Using Regression Models in Decision Support System for Urban Transport Planning
    Yatskiv , Irina
    Yurshevich, Elena
    THEORY AND ENGINEERING OF COMPLEX SYSTEMS AND DEPENDABILITY, 2015, 365 : 553 - 561
  • [30] Nitrogen Oxide Emission Modeling for Boiler Combustion Using Accurate Online Support Vector Regression
    Zhou, Jianxin
    Ji, Yinxin
    Qiao, Zongliang
    Si, Fengqi
    Xu, Zhigao
    2013 10TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2013, : 989 - 993