Solving very large weakly coupled Markov decision processes
被引:0
作者:
Meuleau, N
论文数: 0引用数: 0
h-index: 0
机构:
Brown Univ, Dept Comp Sci, Providence, RI 02912 USABrown Univ, Dept Comp Sci, Providence, RI 02912 USA
Meuleau, N
[1
]
Hauskrecht, M
论文数: 0引用数: 0
h-index: 0
机构:
Brown Univ, Dept Comp Sci, Providence, RI 02912 USABrown Univ, Dept Comp Sci, Providence, RI 02912 USA
Hauskrecht, M
[1
]
Kim, KE
论文数: 0引用数: 0
h-index: 0
机构:
Brown Univ, Dept Comp Sci, Providence, RI 02912 USABrown Univ, Dept Comp Sci, Providence, RI 02912 USA
Kim, KE
[1
]
Peshkin, L
论文数: 0引用数: 0
h-index: 0
机构:
Brown Univ, Dept Comp Sci, Providence, RI 02912 USABrown Univ, Dept Comp Sci, Providence, RI 02912 USA
Peshkin, L
[1
]
Kaelbling, LP
论文数: 0引用数: 0
h-index: 0
机构:
Brown Univ, Dept Comp Sci, Providence, RI 02912 USABrown Univ, Dept Comp Sci, Providence, RI 02912 USA
Kaelbling, LP
[1
]
Dean, T
论文数: 0引用数: 0
h-index: 0
机构:
Brown Univ, Dept Comp Sci, Providence, RI 02912 USABrown Univ, Dept Comp Sci, Providence, RI 02912 USA
Dean, T
[1
]
Boutilier, C
论文数: 0引用数: 0
h-index: 0
机构:
Brown Univ, Dept Comp Sci, Providence, RI 02912 USABrown Univ, Dept Comp Sci, Providence, RI 02912 USA
Boutilier, C
[1
]
机构:
[1] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
来源:
FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS
|
1998年
关键词:
D O I:
暂无
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes(MDPs). We exploit two key properties to avoid explicitly enumerating the very large state and action spaces associated with these problems. First, the problems are composed of multiple tasks whose utilities are independent. Second, the actions taken with respect to (or resources allocated to) a task do not influence the status of any other task. We can therefore view each task as an MDP. However, these MDPs are weakly coupled by resource constraints: actions selected for one MDP restrict the actions available to others. We describe heuristic techniques for dealing with several classes of constraints that use the solutions for individual Mops to construct an approximate global solution. We demonstrate this technique on problems involving thousands of tasks, approximating the solution to problems that are far beyond the reach of standard methods.