Higher level application of ADP: A next phase for the control field?

被引:17
|
作者
Lendaris, George G. [1 ]
机构
[1] Portland State Univ, Dept Elect & Comp Engn, NW Computat Intelligence Lab, Syst Sci Grad Program, Portland, OR 97207 USA
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2008年 / 38卷 / 04期
基金
美国国家科学基金会;
关键词
approximate dynamic programming (ADP); artificial intelligence (AI); context; context discernment; experience-based identification and control (EBIC); neural networks (NNs); optimal control; reinforcement learning (RL); system identification (SID);
D O I
10.1109/TSMCB.2008.918073
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Two distinguishing features of humanlike control vis-a-vis current technological control are the ability to make use of experience while selecting a control policy for distinct situations and the ability to do so faster and faster as more experience is gained (in contrast to current technological implementations that slow down as more knowledge is stored). The notions of context and context discernment are important to understanding this human ability. Whereas methods known as adaptive control and learning control focus on modifying the design of a controller as changes in context occur, experience-based (EB) control entails selecting a previously designed controller that is appropriate to the current situation. Developing the EB approach entails a shift of the technologist's focus "up a level" away from designing individual (optimal) controllers to that of developing online algorithms that efficiently and effectively select designs from a repository of existing controller solutions. A key component of the notions presented here is that of higher level learning algorithm. This is a new application of reinforcement learning and, in particular, approximate dynamic programming, with its focus shifted to the posited higher level, and is employed, with very promising results. The author's hope for this paper is to inspire and guide future work in this promising area.
引用
收藏
页码:901 / 912
页数:12
相关论文
共 50 条
  • [1] PHILIPPINE COMMUNISM - A NEXT HIGHER LEVEL
    VANDERKROEF, JM
    ISSUES & STUDIES, 1987, 23 (11): : 115 - 137
  • [2] Phase control of higher-order squeezing of a quantum field
    Xie, RH
    Rao, Q
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2002, 312 (3-4) : 421 - 430
  • [3] Application of ADP to intersection signal control
    Li, Tao
    Zhao, Dongbin
    Yi, Jianqiang
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 374 - +
  • [4] Using ADP to understand and replicate brain intelligence: the next level design
    Werbos, Paul J.
    2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, 2007, : 209 - 216
  • [5] Phase control of higher spectral components in the presence of a static electric field
    Zhang, Chaojin
    Yang, Weifeng
    Song, Xiaohong
    Xu, Zhizhan
    JOURNAL OF PHYSICS B-ATOMIC MOLECULAR AND OPTICAL PHYSICS, 2009, 42 (05)
  • [6] Taking Change Control to the Next Level
    Doppler, J. M.
    Foss, M.
    Basiri, M.
    Bundy, K. L.
    Stubbs, J.
    TRANSFUSION, 2010, 50 : 253A - 254A
  • [7] ACCURATE FIELD LEVEL AND PHASE CONTROL AND MONITORING FOR A PROTON LINEAR ACCELERATOR
    BATCHELOR, K
    GALLAGHE.G
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 1965, NS12 (03) : 195 - +
  • [8] VMM application packages: the next level of productivity
    Bergeron, Janick
    EDN, 2008, 53 (04) : 49 - +
  • [9] The Baylor Project: Taking Christian Higher Education to the Next Level
    Ward, Roger
    CHRISTIAN HIGHER EDUCATION, 2008, 8 (01) : 74 - 81
  • [10] Molecularly Engineered "Janus GroEL": Application to Supramolecular Copolymerization with a Higher Level of Sequence Control
    Kashiwagi, Daiki
    Shen, Hao K.
    Sim, Seunghyun
    Sano, Koki
    Ishida, Yasuhiro
    Kimura, Ayumi
    Niwa, Tatsuya
    Taguchi, Hideki
    Aida, Takuzo
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2020, 142 (31) : 13310 - 13315