Higher level application of ADP: A next phase for the control field?

被引：17

作者：

Lendaris, George G. ^{[1
]}

机构：

[1] Portland State Univ, Dept Elect & Comp Engn, NW Computat Intelligence Lab, Syst Sci Grad Program, Portland, OR 97207 USA

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2008年 / 38卷 / 04期

基金：

美国国家科学基金会;

关键词：

approximate dynamic programming (ADP); artificial intelligence (AI); context; context discernment; experience-based identification and control (EBIC); neural networks (NNs); optimal control; reinforcement learning (RL); system identification (SID);

D O I：

10.1109/TSMCB.2008.918073

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Two distinguishing features of humanlike control vis-a-vis current technological control are the ability to make use of experience while selecting a control policy for distinct situations and the ability to do so faster and faster as more experience is gained (in contrast to current technological implementations that slow down as more knowledge is stored). The notions of context and context discernment are important to understanding this human ability. Whereas methods known as adaptive control and learning control focus on modifying the design of a controller as changes in context occur, experience-based (EB) control entails selecting a previously designed controller that is appropriate to the current situation. Developing the EB approach entails a shift of the technologist's focus "up a level" away from designing individual (optimal) controllers to that of developing online algorithms that efficiently and effectively select designs from a repository of existing controller solutions. A key component of the notions presented here is that of higher level learning algorithm. This is a new application of reinforcement learning and, in particular, approximate dynamic programming, with its focus shifted to the posited higher level, and is employed, with very promising results. The author's hope for this paper is to inspire and guide future work in this promising area.

引用

页码：901 / 912

页数：12

共 50 条

[1] PHILIPPINE COMMUNISM - A NEXT HIGHER LEVEL
VANDERKROEF, JM
ISSUES & STUDIES, 1987, 23 (11): : 115 - 137
[2] Phase control of higher-order squeezing of a quantum field
Xie, RH
Rao, Q
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2002, 312 (3-4) : 421 - 430
[3] Application of ADP to intersection signal control
Li, Tao
Zhao, Dongbin
Yi, Jianqiang
ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 374 - +
[4] Using ADP to understand and replicate brain intelligence: the next level design
Werbos, Paul J.
2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, 2007, : 209 - 216
[5] Phase control of higher spectral components in the presence of a static electric field
Zhang, Chaojin
Yang, Weifeng
Song, Xiaohong
Xu, Zhizhan
JOURNAL OF PHYSICS B-ATOMIC MOLECULAR AND OPTICAL PHYSICS, 2009, 42 (05)
[6] Taking Change Control to the Next Level
Doppler, J. M.
Foss, M.
Basiri, M.
Bundy, K. L.
Stubbs, J.
TRANSFUSION, 2010, 50 : 253A - 254A
[7] ACCURATE FIELD LEVEL AND PHASE CONTROL AND MONITORING FOR A PROTON LINEAR ACCELERATOR
BATCHELOR, K
GALLAGHE.G
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 1965, NS12 (03) : 195 - +
[8] VMM application packages: the next level of productivity
Bergeron, Janick
EDN, 2008, 53 (04) : 49 - +
[9] The Baylor Project: Taking Christian Higher Education to the Next Level
Ward, Roger
CHRISTIAN HIGHER EDUCATION, 2008, 8 (01) : 74 - 81
[10] Molecularly Engineered "Janus GroEL": Application to Supramolecular Copolymerization with a Higher Level of Sequence Control
Kashiwagi, Daiki
Shen, Hao K.
Sim, Seunghyun
Sano, Koki
Ishida, Yasuhiro
Kimura, Ayumi
Niwa, Tatsuya
Taguchi, Hideki
Aida, Takuzo
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2020, 142 (31) : 13310 - 13315

← 1 2 3 4 5 →