Decomposition based Multi-Objective Evolutionary Algorithm in XCS for Multi-Objective Reinforcement Learning

被引:0
作者
Cheng, Xiu [1 ]
Browne, Will N. [1 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, POB 600, Wellington 6140, New Zealand
来源
2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2018年
关键词
GENETIC OPTIMIZATION; SYSTEM;
D O I
10.1109/CEC.2018.8477857
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning Classifier Systems (LCSs) have been widely used to tackle Reinforcement Learning (RL) problems as they have a good generalization ability and provide a simple understandable rule-based solution. The accuracy-based LCS, XCS, has been most popularly used for single-objective RL problems. As many real-world problems exhibit multiple conflicting objectives recent work has sought to adapt XCS to Multi-Objective Reinforcement Learning (MORL) tasks. However, many of these algorithms need large storage or cannot discover the Pareto Optimal solutions. This is due to the complexity of finding a policy having multiple steps to multiple possible objectives. This paper aims to employ a decomposition strategy based on MOEA/D in XCS to approximate complex Pareto Fronts. In order to achieve multi-objective learning, a new MORL algorithm has been developed based on XCS and MOEA/D. The experimental results show that on complex bi-objective maze problems our MORL algorithm is able to learn a group of Pareto optimal solutions for MORL problems without huge storage. Analysis of the learned policies shows successful trade-offs between the distance to the reward versus the amount of reward itself.
引用
收藏
页码:622 / 629
页数:8
相关论文
共 15 条
[1]   A fast and efficient multi-objective evolutionary learning scheme for fuzzy rule-based classifiers [J].
Antonelli, Michela ;
Ducange, Pietro ;
Marcelloni, Francesco .
INFORMATION SCIENCES, 2014, 283 :36-54
[2]  
Butz M. V., 2000, TECH REP
[3]  
Cheng X, 2016, IEEE C EVOL COMPUTAT, P4007, DOI 10.1109/CEC.2016.7744298
[4]   Multiobjective optimization and evolutionary algorithms for the application mapping problem in multiprocessor system-on-chip design [J].
Erbas, Cagkan ;
Cerav-Erbas, Selin ;
Pimentel, Andy D. .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2006, 10 (03) :358-374
[5]   A multi-objective genetic optimization for fast, fuzzy rule-based credit classification with balanced accuracy and interpretability [J].
Gorzalczany, Marian B. ;
Rudzinski, Filip .
APPLIED SOFT COMPUTING, 2016, 40 :206-220
[6]  
Hu H., 2010, IEEE T NEURAL NETWOR, P1
[7]  
Lanzi P. L., 1998, THESIS
[8]   Multiobjective Optimization Problems With Complicated Pareto Sets, MOEA/D and NSGA-II [J].
Li, Hui ;
Zhang, Qingfu .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2009, 13 (02) :284-302
[9]  
Miettinen Kaisa., 2003, European Journal of Operational Research, V148, P229, DOI DOI 10.1016/S0377-2217(02)00303-X
[10]   A multi-objective genetic optimization of interpretability-oriented fuzzy rule-based classifiers [J].
Rudzinski, Filip .
APPLIED SOFT COMPUTING, 2016, 38 :118-133