Online belief tracking using regression for contingent planning

被引:12
作者
Brafman, Ronen I. [1 ]
Shani, Guy [1 ]
机构
[1] Ben Gurion Univ Negev, Beer Sheva, Israel
关键词
Contingent planning; Partial observability; Non-deterministic planning; Regression; Belief;
D O I
10.1016/j.artint.2016.08.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In online contingent planning under partial observability an agent decides at each time step on the next action to execute, given its initial knowledge of the world, the actions executed so far, and the observation made. Such agents require some representation of their belief state to determine which actions are valid, or whether the goal has been achieved. Efficient maintenance of a belief state is, given its potential exponential size, a key research challenge in this area. In this paper we develop the theory of regression as a useful tool for belief-state maintenance. We provide a formal description of regression, discussing various alternatives and optimization techniques, and analyze its space and time complexity. In particular, we show that, with some care, the regressed formula will contain variables relevant to the current query only, rather than all variables in the problem description. Consequently, under suitable assumptions, the complexity of regression queries is at most exponential in its contextual width. This parameter is always upper bounded by Bonet and Geffner's width parameter, introduced in their state-of-the-art factored belief tracking (FBT) method. In addition, we show how to obtain a poly-sized circuit representation for the online regression formula even with non-deterministic actions. We provide an empirical comparison of regression with FBT-based belief maintenance, showing the power of regression for online belief tracking. We also suggest caching techniques for regression, and demonstrate their value in reducing runtime in current benchmarks. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:131 / 152
页数:22
相关论文
共 50 条
[41]   Achieving Online Regression Performance of LSTMs With Simple RNNs [J].
Vural, N. Mert ;
Ilhan, Fatih ;
Yilmaz, Selim F. ;
Ergut, Salih ;
Kozat, Suleyman Serdar .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) :7632-7643
[42]   Kernel-based online regression with canal loss [J].
Liang, Xijun ;
Zhang, Zhipeng ;
Song, Yunquan ;
Jian, Ling .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 297 (01) :268-279
[43]   PLANNING THE FUTURE OF EMERGENCY DEPARTMENTS: FORECASTING ED PATIENT ARRIVALS BY USING REGRESSION AND NEURAL NETWORK MODELS [J].
Gul, Muhammet ;
Guneri, Ali Fuat .
INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE, 2016, 23 (02) :137-154
[44]   Online Boosting Algorithm for Regression with Additive and Multiplicative Updates [J].
Mirza, Ali H. .
2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
[45]   A metapsychological exploration of the role of popular media in engineering public belief on planning issues [J].
Gunder, Michael .
PLANNING THEORY, 2011, 10 (04) :325-343
[46]   Constructing A Regression Model to Determine the Target Audience for the Online Course [J].
Plotnikova, Ekaterina V. ;
Kuratova, Anna N. .
VISION 2025: EDUCATION EXCELLENCE AND MANAGEMENT OF INNOVATIONS THROUGH SUSTAINABLE ECONOMIC COMPETITIVE ADVANTAGE, 2019, :1766-1779
[47]   Online Infinite-Dimensional Regression: Learning Linear Operators [J].
Raman, Vinod ;
Subedi, Unique ;
Tewari, Ambuj .
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
[48]   In-Network Online Asynchronous Regression Over a Wireless Network [J].
Meshram, Rahul H. .
2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,
[49]   Influence of Online Social Support on the Public's Belief in Overcoming COVID-19 [J].
Yao, Zhong ;
Tang, Pengfei ;
Fan, Jiarong ;
Luan, Jing .
INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
[50]   Reasoning about uncertain parameters and agent behaviors through encoded experiences and belief planning [J].
Hayashi, Akinobu ;
Ruiken, Dirk ;
Hasegawa, Tadaaki ;
Goerick, Christian .
ARTIFICIAL INTELLIGENCE, 2020, 280