GA-based fuzzy reinforcement learning for control of a magnetic bearing system

被引:66
作者
Lin, CT [1 ]
Jou, CP [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Control Engn, Hsinchu 300, Taiwan
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2000年 / 30卷 / 02期
关键词
action network; active magnetic bearing; adaptive heuristic critic; critic network;
D O I
10.1109/3477.836376
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a TD (temporal difference) and GA (genetic algorithm)-based reinforcement (TDGAR) learning method and applies it to the control of a real magnetic bearing system. The TDGAR learning scheme is a new hybrid GA, which integrates the TD prediction method and the GA to perform the reinforcement learning task, The TDGAR learning system is composed of two integrated feedforward networks. One neural network acts as a critic network to guide the learning of the other network (the action network) which determines the outputs (actions) of the TDGAR learning system, The action network can be a normal neural network or a neural fuzzy network, Using the TD prediction method, the critic network can predict the external reinforcement signal and provide a more informative internal reinforcement signal to the action network. The action network uses the GA to adapt itself according to the internal reinforcement signal, The key concept of the TDGAR learning scheme is to formulate the internal reinforcement signal as the fitness function for the GA such that the GA can evaluate the candidate solutions (chromosomes) regularly, even during periods without external feedback from the environment. This enables the GA to proceed to new generations regularly without waiting for the arrival of the external reinforcement signal. This can usually accelerate the GA learning since a reinforcement signal may only be available at a time long after a sequence of actions has occurred in the reinforcement learning problem, The proposed TDGAR learning system has been used to control an active magnetic bearing (AMB) system in practice. A systematic design procedure is developed to achieve successful integration of all the subsystems including magnetic suspension, mechanical structure, and controller training. The results show that the TDGAR learning scheme can successfully find a neural controller or a neural fuzzy controller for a self-designed magnetic bearing system.
引用
收藏
页码:276 / 289
页数:14
相关论文
共 55 条
[1]  
ADLER D, 1993, 1993 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, P1104, DOI 10.1109/ICNN.1993.298712
[2]  
AIGUO S, 1996, P IEE INT C INT ROB, P1353
[3]  
Anderson C. W., 1987, Proceedings of the Fourth International Workshop on Machine Learning, P103
[4]  
[Anonymous], P INT JOINT C NEUR N
[5]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[6]   PATTERN-RECOGNIZING STOCHASTIC LEARNING AUTOMATA [J].
BARTO, AG ;
ANANDAN, P .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (03) :360-375
[7]  
BARTO AG, 1987, 1ST P IEEE INT C NEU, V2, P629
[8]   LEARNING AND TUNING FUZZY-LOGIC CONTROLLERS THROUGH REINFORCEMENTS [J].
BERENJI, HR ;
KHEDKAR, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05) :724-740
[9]   Nonlinear control of a magnetic levitation system without premagnetization [J].
Charara, A ;
DeMiras, J ;
Caron, B .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 1996, 4 (05) :513-523
[10]  
Cox E., 1993, Proceedings. The Second Annual International Conference on Artificial Intelligence Applications on Wall Street: Tactical and Strategic Computing Technologies, P280