Multiple Seeds Based Evolutionary Algorithm for Mining Boolean Association Rules

被引:0
作者
Kabir, Mir Md. Jahangir [1 ]
Xu, Shuxiang [1 ]
Kang, Byeong Ho [1 ]
Zhao, Zongyuan [1 ]
机构
[1] Univ Tasmania, Sch Engn & ICT, Hobart, Tas, Australia
来源
TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING (PAKDD 2016) | 2016年 / 9794卷
关键词
Multiple seeds based genetic algorithm; Boolean association rules; Initial population; Conditional probability; Evolutionary learning; GENETIC ALGORITHM;
D O I
10.1007/978-3-319-42996-0_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the association rule mining algorithms use a single seed for initializing a population without paying attention to the effectiveness of an initial population in an evolutionary learning. Recently, researchers show that an initial population has significant effects on producing good solutions over several generations of a genetic algorithm. There are two significant challenges raised by single seed based genetic algorithms for real world applications: (1) solutions of a genetic algorithm are varied, since different seeds generate different initial populations, (2) it is a hard process to define an effective seed for a specific application. To avoid these problems, in this paper we propose a new multiple seeds based genetic algorithm (MSGA) which generates multiple seeds from different domains of a solution space to discover high quality rules from a large data set. This approach introduces m-domain model and m-seeds selection process through which the whole solution space is subdivided into m-number of same size domains and from each domain it selects a seed. By using these seeds, this method generates an effective initial population to perform an evolutionary learning of the fitness value of each rule. As a result, this method obtains strong searching efficiency at the beginning of the evolution and achieves fast convergence along with the evolution. MSGA is tested with different mutation and crossover operators for mining interesting Boolean association rules from different real world data sets and compared the results with different single seeds based genetic algorithms.
引用
收藏
页码:61 / 72
页数:12
相关论文
共 14 条
[1]  
Borgelt C., 2003, IEEE ICDM WORKSH FRE, V90, P280
[2]   Dynamic diversity control in genetic algorithm for mining unsearched solution space in TSP problems [J].
Chang, Pei-Chann ;
Huang, Wei-Hsiu ;
Ting, Ching-Jung .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (03) :1863-1878
[3]   On the discovery of association rules by means of evolutionary algorithms [J].
del Jesus, Maria J. ;
Gamez, Jose A. ;
Gonzalez, Pedro ;
Puerta, Jose M. .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (05) :397-415
[4]  
Hipp Jochen, 2000, ACM SIGKDD Explorations, V2, P58, DOI [10.1145/360402.360421, DOI 10.1145/360402.360421]
[5]   A New Evolutionary Algorithm for Extracting a Reduced Set of Interesting Association Rules [J].
Kabir, Mir Md. Jahangir ;
Xu, Shuxiang ;
Kang, Byeong Ho ;
Zhao, Zongyuan .
NEURAL INFORMATION PROCESSING, PT II, 2015, 9490 :133-142
[6]  
Kabir MMJ, 2015, IEEE C EVOL COMPUTAT, P39, DOI 10.1109/CEC.2015.7256872
[7]   Quasi-random initial population for genetic algorithms [J].
Maaranen, H ;
Miettinen, K ;
Mäkelä, MM .
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2004, 47 (12) :1885-1895
[8]   On initial populations of a genetic algorithm for continuous optimization problems [J].
Maaranen, Heikki ;
Miettinen, Kaisa ;
Penttinen, Antti .
JOURNAL OF GLOBAL OPTIMIZATION, 2007, 37 (03) :405-436
[9]   A New Multiobjective Evolutionary Algorithm for Mining a Reduced Set of Interesting Positive and Negative Quantitative Association Rules [J].
Martin, Diana ;
Rosete, Alejandro ;
Alcala-Fdez, Jesus ;
Herrera, Francisco .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2014, 18 (01) :54-69
[10]  
Mukhopadhyay A, 2014, IEEE T EVOLUT COMPUT, V18, P4, DOI 10.1109/TEVC.2013.2290086