Approximate dynamic programming strategies and their applicability for process control: A review and future directions

被引：1

作者：

Lee, JM ^{[1
]}

Lee, JH ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Chem & Biomol Engn, Atlanta, GA 30332 USA

来源：

INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS | 2004年 / 2卷 / 03期

关键词：

approximate dynamic programming; reinforcement learning; neuro-dynamic programming; optimal control; function approximation;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper reviews dynamic programming (DP), surveys approximate solution methods for it, and considers their applicability to process control problems. Reinforcement Learning (RL) and Neuro-Dynamic Programming (NDP), which can be viewed as approximate DP techniques, are already established techniques for solving difficult multi-stage decision problems in the fields of operations research, computer science, and robotics. Owing to the significant disparity of problem formulations and objective, however, the algorithms and techniques available from these fields are not directly applicable to process control problems, and reformulations based on accurate understanding of these techniques are needed. We categorize the currently available approximate solution techniques for dynamic programming and identify those most suitable for process control problems. Several open issues are also identified and discussed.

引用

页码：263 / 278

页数：16

共 105 条

[11] Atkeson C., 1997, P INT C ROB AUT
[12] Atkeson C. G., 1997, P 14 INT C MACH LEAR, P12
[13] Baird L, 1995, MACHINE LEARNING P 1, P30
[14] LEARNING TO ACT USING REAL-TIME DYNAMIC-PROGRAMMING
BARTO, AG
BRADTKE, SJ
SINGH, SP
[J]. ARTIFICIAL INTELLIGENCE, 1995, 72 (1-2) : 81 - 138
[15] NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS
BARTO, AG
SUTTON, RS
ANDERSON, CW
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05): : 834 - 846
[16] Bellman R., 1957, DYNAMIC PROGRAMMING
[17] Bertsekas D. P., 1992, DATA NETWORKS
[18] Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st
[19] Bertsekas Dimitri P., 1989, PARALLEL DISTRIBUTED
[20] ADAPTIVE AGGREGATION METHODS FOR INFINITE HORIZON DYNAMIC-PROGRAMMING
BERTSEKAS, DP
CASTANON, DA
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1989, 34 (06) : 589 - 598

← 1 2 3 4 5 6 7 8 9 10 →