Determining the minimally allowed rule-distance for the incremental rule-base construction phase of the FRIQ-learning

被引:0
作者
Tompa, Tamas [1 ]
Kovacs, Szilveszter [1 ]
机构
[1] Univ Miskolc, Dept Informat Technol, Miskolc, Hungary
来源
2018 19TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC) | 2018年
关键词
reinforcement learning; Q-learning; Fuzzy Q-learning; FRIQ-learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a novel minimally allowed rule-distance determination methodology for the incremental rule-base construction phase of the FRIQ-learning. The FRIQ-learning process starts from an empty Q-function represented by an empty fuzzy rule-base. Then it creates the fuzzy Q-function in an iterative incremental manner by adding new fuzzy rules in cases where the Q-function update seems to be inefficient by tuning the existing rule base. According to the original FRIQ-learning a new rule is inserted if the Q-function update is high, and even the closest existing fuzzy rule is farther than a preset limit. Improper choice of the minimal rule distance limit can lead to unnecessary high number of inserted rules during the initial steps of the iteration. The main goal of this paper is to introduce a new rule-distance limit calculation methodology which can avoid the unnecessary high number of inserted rules in the initial phase of the iteration. For demonstrating the efficiency of the proposed rule-base creation method, the "mountain car" benchmark application example is also discussed briefly in the paper.
引用
收藏
页码:480 / 483
页数:4
相关论文
共 12 条
[1]  
[Anonymous], 1998, REINFORCEMENT LEARNI
[2]   FUZZY-SETS AND VAGUE ENVIRONMENTS [J].
KLAWONN, F .
FUZZY SETS AND SYSTEMS, 1994, 66 (02) :207-221
[3]  
Kovacs S., 1997, Tatra Mountains Mathematical Publications, V12, P169
[4]  
Kovacs Sz., 1996, P 6 INT C INF PROC M, P477
[5]  
Kovacs Sz., 1997, P 7 INT FUZZ SYST AS, P144
[6]  
Krizsan Z., 2008, P 9 INT S HUNG RES C, P531
[7]  
Tompa Tamas, 2017, APPL MACH INT INF SA
[8]  
Vincze D., 2009, P 10 INT S HUNG RES, P533
[9]  
Vincze D., 2015, Recent Innovations in Mechatronics (RIiM), V2
[10]  
Vincze D, 2009, APPL COMP INT INF 20