Approximate dynamic programming strategies and their applicability for process control: A review and future directions

被引:0
作者
Lee, JM [1 ]
Lee, JH [1 ]
机构
[1] Georgia Inst Technol, Sch Chem & Biomol Engn, Atlanta, GA 30332 USA
关键词
approximate dynamic programming; reinforcement learning; neuro-dynamic programming; optimal control; function approximation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper reviews dynamic programming (DP), surveys approximate solution methods for it, and considers their applicability to process control problems. Reinforcement Learning (RL) and Neuro-Dynamic Programming (NDP), which can be viewed as approximate DP techniques, are already established techniques for solving difficult multi-stage decision problems in the fields of operations research, computer science, and robotics. Owing to the significant disparity of problem formulations and objective, however, the algorithms and techniques available from these fields are not directly applicable to process control problems, and reformulations based on accurate understanding of these techniques are needed. We categorize the currently available approximate solution techniques for dynamic programming and identify those most suitable for process control problems. Several open issues are also identified and discussed.
引用
收藏
页码:263 / 278
页数:16
相关论文
共 105 条
  • [61] Kernel-based reinforcement learning in average-cost problems
    Ormoneit, D
    Glynn, P
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2002, 47 (10) : 1624 - 1636
  • [62] Kernel-based reinforcement learning
    Ormoneit, D
    Sen, S
    [J]. MACHINE LEARNING, 2002, 49 (2-3) : 161 - 178
  • [63] ESTIMATION OF A PROBABILITY DENSITY-FUNCTION AND MODE
    PARZEN, E
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (03): : 1065 - &
  • [64] Peng J., 1993, ADAPT BEHAV, V1, P437, DOI DOI 10.1177/105971239300100403
  • [65] Adaptive critic designs
    Prokhorov, DV
    Wunsch, DC
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (05): : 997 - 1007
  • [66] Puterman ML., 1994, Wiley Series in Probability and Statistics, DOI 10.1002/9780470316887
  • [67] A survey of industrial model predictive control technology
    Qin, SJ
    Badgwell, TA
    [J]. CONTROL ENGINEERING PRACTICE, 2003, 11 (07) : 733 - 764
  • [68] R┬u┬ade U., 1993, MATH COMPUTATIONAL T
  • [69] Rummery G, 1994, 166 CUED FINFENG TR
  • [70] SABES P, 1993, P 4 CONN MOD SUMM SC