An extended probabilistic model building genetic network programming using both of good and bad individuals

被引:10
作者
Li, Xianneng [1 ]
Mabu, Shingo [1 ]
Hirasawa, Kotaro [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Wakamatsu Ku, Kitakyushu, Fukuoka 8080135, Japan
基金
日本学术振兴会;
关键词
probabilistic modeling; estimation of distribution algorithms (EDAs); bad individuals; reinforcement learning; probabilistic model building genetic network programming;
D O I
10.1002/tee.21864
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Classical estimation of distribution algorithms (EDAs) generally use truncation selection to estimate the distribution of the good individuals while ignoring the bad ones. However, various researches in evolutionary algorithms (EAs) have reported that the bad individuals may affect and help solving the problem. This paper proposes a new method to use the bad individuals by studying the substructures rather than the entire individual structures to solve reinforcement learning (RL) problems, which generally factorize their entire solutions to the sequences of state-action pairs. This work was studied in a recent graph-based EDA named probabilistic model building genetic network programming (PMBGNP), which could solve RL problems successfully, to propose an extended PMBGNP. The effectiveness of this work is verified in an RL problem, namely robot control. Compared to other related work, results show that the proposed method can significantly speed up the evolution efficiency. (c) 2013 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.
引用
收藏
页码:339 / 347
页数:9
相关论文
共 31 条
[1]  
[Anonymous], 1994, Tech. Rep., DOI DOI 10.5555/865123
[2]   Approaches to Selection and their Effect on Fitness Modelling in an Estimation of Distribution Algorithm [J].
Brownlee, Alexander E. I. ;
McCall, John A. W. ;
Zhang, Qingfu ;
Brown, Deryck F. .
2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, :2621-+
[3]   A Hybrid EDA for Protein Folding Based on HP Model [J].
Chen, Benhui ;
Hu, Jinglu .
IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2010, 5 (04) :459-466
[4]  
Handa H., 2009, Proceedings of the 11th annual conference on genetic and evolutionary computation, P405
[5]   A Bayesian Network Approach to Program Generation [J].
Hasegawa, Yoshihiko ;
Iba, Hitoshi .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2008, 12 (06) :750-764
[6]  
Hirasawa K, 2001, IEEE C EVOL COMPUTAT, P1276, DOI 10.1109/CEC.2001.934337
[7]   A double-deck elevator group supervisory control system using genetic network programming [J].
Hirasawa, Kotaro ;
Eguchi, Toru ;
Zhou, Jin ;
Yu, Lu ;
Hu, Jinglu ;
Markon, Sandor .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (04) :535-550
[8]   Estimation of Distribution Algorithms Making Use of Both High Quality and Low Quality Individuals [J].
Hong, Yi ;
Zhu, Guopu ;
Kwong, Sam ;
Ren, Qingsheng .
2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, :1806-+
[9]  
Katagiri H, 2000, IEEE SYS MAN CYBERN, P3829, DOI 10.1109/ICSMC.2000.886607
[10]  
Larranaga P., 2001, Estimation of Distribution Algorithms: ANew Tool for Evolutionary Computation