On the study of embedding fuzzy concept into reinforcement learning schemes

被引:1
作者
Su, SF [1 ]
Hsieh, SH [1 ]
Chuang, CC [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Elect Engn, Taipei 106, Taiwan
关键词
reinforcement learning; fuzzy; box-structure; learning control;
D O I
10.1080/02533839.2001.9670633
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The paper is to study how to correctly embed fuzzy concept into reinforcement learning schemes with the box-type structure. In fact, precious work has already proposed the idea of incorporating fuzzy concept into reinforcement learning systems. However, in that approach only two state variables were considered. In our implementation, when Four state variables were used with the fuzzification process, the results show a complete disaster. After analyzing the behaviors of the approach, three problems have been identified. The first one is the credit assignment problem about the learning process in the use of the fuzzification process. Since the considered system is of bang-ban:: control, all fired boxes may not all agree on the fired action. Thus, in the learning process, such a credit assignment problem must be taken into account. The second one is the weight domination problem. In the early stage, some learned boxes even with very small firing strengths might dominate the action generation process because other boxes are unlearned. Finally, the use of fuzzy schemes in generating the predicted evaluation under the temporal difference mechanism seems to be a negative factor in our example. The modifications to overcome those problems were also proposed in the paper. The results have shown the correctness of our modifications.
引用
收藏
页码:357 / 368
页数:12
相关论文
共 21 条
[1]  
Anderson C. W., 1989, IEEE Control Systems Magazine, V9, P31, DOI 10.1109/37.24809
[2]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[3]   LEARNING AND TUNING FUZZY-LOGIC CONTROLLERS THROUGH REINFORCEMENTS [J].
BERENJI, HR ;
KHEDKAR, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05) :724-740
[4]  
Grigore O, 2000, PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY 2000, VOLS 1 AND 2, P662
[5]  
HSIEH SH, 1997, THESIS NATL TAIWAN U
[6]  
KIM PL, 1995, THESIS NATL TAIWAN U
[7]   A SELF-LEARNING RULE-BASED CONTROLLER EMPLOYING APPROXIMATE REASONING AND NEURAL NET CONCEPTS [J].
LEE, CC .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1991, 6 (01) :71-93
[8]   GA-based fuzzy reinforcement learning for control of a magnetic bearing system [J].
Lin, CT ;
Jou, CP .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2000, 30 (02) :276-289
[9]   REINFORCEMENT STRUCTURE PARAMETER LEARNING FOR NEURAL-NETWORK-BASED FUZZY-LOGIC CONTROL-SYSTEMS [J].
LIN, CT ;
LEE, CSG .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1994, 2 (01) :46-63
[10]  
LIN CT, 1996, NEURAL FUZZY SYSTEM