Approximate dynamic programming strategies and their applicability for process control: A review and future directions

被引：0

作者：

Lee, JM ^{[1
]}

Lee, JH ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Chem & Biomol Engn, Atlanta, GA 30332 USA

来源：

INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS | 2004年 / 2卷 / 03期

关键词：

approximate dynamic programming; reinforcement learning; neuro-dynamic programming; optimal control; function approximation;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper reviews dynamic programming (DP), surveys approximate solution methods for it, and considers their applicability to process control problems. Reinforcement Learning (RL) and Neuro-Dynamic Programming (NDP), which can be viewed as approximate DP techniques, are already established techniques for solving difficult multi-stage decision problems in the fields of operations research, computer science, and robotics. Owing to the significant disparity of problem formulations and objective, however, the algorithms and techniques available from these fields are not directly applicable to process control problems, and reformulations based on accurate understanding of these techniques are needed. We categorize the currently available approximate solution techniques for dynamic programming and identify those most suitable for process control problems. Several open issues are also identified and discussed.

引用

页码：263 / 278

页数：16

共 105 条

[61] Kernel-based reinforcement learning in average-cost problems
Ormoneit, D
Glynn, P
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2002, 47 (10) : 1624 - 1636
[62] Kernel-based reinforcement learning
Ormoneit, D
Sen, S
[J]. MACHINE LEARNING, 2002, 49 (2-3) : 161 - 178
[63] ESTIMATION OF A PROBABILITY DENSITY-FUNCTION AND MODE
PARZEN, E
[J]. ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (03): : 1065 - &
[64] Peng J., 1993, ADAPT BEHAV, V1, P437, DOI DOI 10.1177/105971239300100403
[65] Adaptive critic designs
Prokhorov, DV
Wunsch, DC
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (05): : 997 - 1007
[66] Puterman ML., 1994, Wiley Series in Probability and Statistics, DOI 10.1002/9780470316887
[67] A survey of industrial model predictive control technology
Qin, SJ
Badgwell, TA
[J]. CONTROL ENGINEERING PRACTICE, 2003, 11 (07) : 733 - 764
[68] R┬u┬ade U., 1993, MATH COMPUTATIONAL T
[69] Rummery G, 1994, 166 CUED FINFENG TR
[70] SABES P, 1993, P 4 CONN MOD SUMM SC

← 2 3 4 5 6 7 8 9 10 11 →