Multiple Model-Based Control Using Finite Controlled Markov Chains

被引:6
作者
Ikonen, Enso [1 ]
Najim, Kaddour [2 ]
机构
[1] Univ Oulu, Syst Engn Lab, Oulu 90014, Finland
[2] ENSIACET, Toulouse, France
关键词
Dynamic programming; Intelligent control; Markov decision processes; Multiple models; Physical models; Probabilistic models;
D O I
10.1007/s12559-009-9020-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cognition and control processes share many similar characteristics, and decisionmaking and learning under the paradigm of multiple models has increasingly gained attention in both fields. The controlled finite Markov chain (CFMC) approach enables to deal with a large variety of signals and systems with multivariable, nonlinear, and stochastic nature. In this article, adaptive control based on multiple models is considered. For a set of candidate plant models, CFMC models (and controllers) are constructed off-line. The outcomes of the CFMC models are compared with frequentist information obtained from on-line data. The best model (and controller) is chosen based on the Kullback-Leibler information. This approach to adaptive control emphasizes the use of physical models as the basis of reliable plant identification. Three series of simulations are conducted: to examine the performace of the developed Matlab-tools; to illustrate the approach in the control of a nonlinear nonminimum phase van der Vusse CSTR plant; and to examine the suggested model selection method for the adaptive control.
引用
收藏
页码:234 / 243
页数:10
相关论文
共 33 条
  • [1] Berthiaux H, 2004, CAN J CHEM ENG, V82, P1143
  • [2] Bertsekas Dimitri P., 2007, Dynamic Programming and Optimal Control, VII
  • [3] Chen H., 1995, Proceedings of the Third European Control Conference. ECC 95, P3247
  • [4] Model bank based intelligent control
    Filev, D
    [J]. 2002 ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY PROCEEDINGS, 2002, : 583 - 586
  • [5] GHAHRAMANI Z, 1994, 1509 MIT CTR BIOL CO
  • [6] Reinforcement Learning: A Tutorial Survey and Recent Advances
    Gosavi, Abhijit
    [J]. INFORMS JOURNAL ON COMPUTING, 2009, 21 (02) : 178 - 192
  • [7] Haggstrom Olle, 2002, Finite Markov Chains and Algorithmic Applications, V52
  • [8] Hsu C.S., 1987, CELL TO CELL MAPPING
  • [9] Hussain A, 2008, LECT NOTES COMPUT SC, V5164, P949, DOI 10.1007/978-3-540-87559-8_98
  • [10] HYOTYNIEMI H, 2006, 151 HELS U TECHN CON