Piecewise linear value function approximation for factored MDPs

被引：0

作者：

Poupart, P ^{[1
]}

Boutilier, C ^{[1
]}

Patrascu, R ^{[1
]}

Schuurmans, D ^{[1
]}

机构：

[1] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3H5, Canada

来源：

EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS | 2002年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A number of proposals have been put forth in recent years for the solution of Markov decision processes (MDPs) whose state (and sometimes action) spaces are factored. One recent class of methods involves linear value function approximation, where the optimal value function is assumed to be a linear combination of some set of basis functions, with the aim of finding suitable weights. While sophisticated techniques have been developed for finding the best approximation within this constrained space, few methods have been proposed for choosing a suitable basis set, or modifying it if solution quality is found wanting. We propose a general framework, and specific proposals, that address. both of,these questions. In particular, we examine weakly coupled MDPS where a number of subtasks can be viewed independently modulo resource constraints. We then describe. methods for constructing a piecewise linear combination of the subtask value. functions, using greedy decision tree techniques. We argue that this architecture is suitable for many types of MDPs whose combinatorics are determined largely by the existence multiple conflicting objectives.

引用

页码：292 / 299

页数：8

共 50 条

[41] PIECEWISE-LINEAR APPROXIMATION OF REFERENCE FUNCTION FOR APERTURE DIGITAL SYNTHESIZING
ZRAENKO, SM
KOBERNICHENKO, VG
IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 1987, 30 (08): : 8 - 12
[42] Optimal Piecewise Linear Function Approximation for GPU-Based Applications
Berjon, Daniel
Gallego, Guillermo
Cuevas, Carlos
Moran, Francisco
Garcia, Narciso
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (11) : 2584 - 2595
[43] Symmetrical Piecewise Linear Functions Composed by Absolute Value Function
Mitsuishi, Takashi
FORMALIZED MATHEMATICS, 2023, 31 (01): : 299 - 308
[44] Scaling Up Robust MDPs using Function Approximation
Tamar, Aviv
Mannor, Shie
Xu, Huan
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 181 - 189
[45] Near-optimal Reinforcement Learning in Factored MDPs
Osband, Ian
Van Roy, Benjamin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
[46] Resource-based Dynamic Rewards for Factored MDPs
Killough, Ronan
Bauters, Kim
McAreavey, Kevin
Liu, Weiru
Hong, Jun
2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 1320 - 1327
[47] Efficient solutions to factored MDPs with imprecise transition probabilities
Delgado, Karina Valdivia
Sanner, Scott
de Barros, Leliane Nunes
ARTIFICIAL INTELLIGENCE, 2011, 175 (9-10) : 1498 - 1527
[48] Approximation by means of piecewise linear functions
Kacsó D.P.
Results in Mathematics, 1999, 35 (1-2) : 89 - 102
[49] Exploiting Additive Structure in Factored MDPs for Reinforcement Learning
Degris, Thomas
Sigaud, Olivier
Wuillemin, Pierre-Henri
RECENT ADVANCES IN REINFORCEMENT LEARNING, 2008, 5323 : 15 - 26
[50] Solving factored MDPs with hybrid state and action variables
Kveton, Branislav
Hauskrecht, Milos
Guestrin, Carlos
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2006, 27 (153-201): : 153 - 201

← 1 2 3 4 5 →