The best-so-far ABC with multiple patrilines for clustering problems

被引:22
作者
Banharnsakun, Anan [1 ]
Sirinaovakul, Booncharoen [1 ]
Achalakul, Tiranee [1 ]
机构
[1] King Mongkuts Univ Technol Thonburi, Dept Comp Engn, Bangkok, Thailand
关键词
Clustering; Best-so-far ABC; Multiple patrilines; Swarm intelligence; Optimization; Distributed environments; Parallel computing; INFORMATION-RETRIEVAL; COLONY APPROACH; OPTIMIZATION; ALGORITHM; PATTERN; RECOGNITION; NETWORKS; MODEL;
D O I
10.1016/j.neucom.2012.02.047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is an important process in many application domains such as machine learning, data mining, pattern recognition, image analysis, information retrieval, and bioinformatics. The main objective of clustering is to search for hidden patterns that may exist in datasets. Since the clustering problem is considered to be NP-hard, previous research has applied bio-inspired heuristic methods to solve such problems. In this paper we propose an effective method for clustering using an algorithm inspired by the decision making processes of bee swarms. The algorithm is called the Best-so-far Artificial Bee Colony with multiple patrilines. In the Best-so-far method, the solution direction is biased toward the Best-so-far solution rather than a neighboring solution proposed in the original Artificial Bee Colony algorithm. We introduce another bee-inspired concept called multiple patrilines to further improve the diversity of solutions and allow the calculations to be distributed among multiple computing units. We empirically assess the performance of our proposed method on several standard datasets taken from the UCI Machine Learning Repository. The results show that the proposed method produces solutions that are as good as or better than the current state-of-the-art clustering techniques reported in the literature. Furthermore, to demonstrate the computing performance and scalability of the algorithm, we assess the algorithm on a large disk drive manufacturing dataset. The results indicate that our distributed Best-so-far approach is scalable and produces good solutions while significantly improving the processing time. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:355 / 366
页数:12
相关论文
共 46 条
[1]   Application of machine learning techniques to analyse student interactions and improve the collaboration process [J].
Anaya, Antonio R. ;
Boticario, Jesus G. .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (02) :1171-1181
[2]  
[Anonymous], 2005, Technical Report-TR06
[3]  
[Anonymous], INT WORKSH TEMP SPAT
[4]  
[Anonymous], 1999, HPL-1999-124
[5]   An evolutionary technique based on K-Means algorithm for optimal clustering in RN [J].
Bandyopadhyay, S ;
Maulik, U .
INFORMATION SCIENCES, 2002, 146 (1-4) :221-237
[6]  
Banharnsakun A., 2010, 2010 Second World Congress on Nature and Biologically Inspired Computing (NaBIC 2011), P13, DOI 10.1109/NABIC.2010.5716309
[7]   Job Shop Scheduling with the Best-so-far ABC [J].
Banharnsakun, Anan ;
Sirinaovakul, Booncharoen ;
Achalakul, Tiranee .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2012, 25 (03) :583-593
[8]   The best-so-far selection in Artificial Bee Colony algorithm [J].
Banharnsakun, Anan ;
Achalakul, Tiranee ;
Sirinaovakul, Booncharoen .
APPLIED SOFT COMPUTING, 2011, 11 (02) :2888-2901
[9]   Long distance bigram models applied to word clustering [J].
Bassiou, Nikoletta ;
Kotropoulos, Constantine .
PATTERN RECOGNITION, 2011, 44 (01) :145-158
[10]   Ant colony optimization techniques for the vehicle routing problem [J].
Bell, JE ;
McMullen, PR .
ADVANCED ENGINEERING INFORMATICS, 2004, 18 (01) :41-48