共 4 条
[1]
Blackwell D., 1965, ANN MATH STAT, V36, P226
[3]
SEMI-MARKOV DECISION PROCESSES WITH UNBOUNDED REWARDS
[J].
MANAGEMENT SCIENCE SERIES A-THEORY,
1973, 19 (07)
:717-731
[4]
DYNAMIC-PROGRAMMING WITH UNBOUNDED REWARDS
[J].
MANAGEMENT SCIENCE SERIES A-THEORY,
1975, 21 (11)
:1225-1233