Moth-Flame Optimization-Bat Optimization: Map-Reduce Framework for Big Data Clustering Using the Moth-Flame Bat Optimization and Sparse Fuzzy C-Means

被引:37
作者
Ravuri, Vasavi [1 ]
Vasundra, S. [2 ,3 ]
机构
[1] VNRVJIET, Hyderabad 500090, Telangana, India
[2] JNTUA Univ, Dept CSE, Ananthapuramu, India
[3] JNTUA Univ, Ananthapuramu, India
关键词
big data; big data clustering; fuzzy; optimization algorithm; spark architecture; ALGORITHM; SELECTION;
D O I
10.1089/big.2019.0125
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The technical advancements in big data have become popular and most desirable among users for storing, processing, and handling huge data sets. However, clustering using these big data sets has become a major challenge in big data analysis. The conventional clustering algorithms used scalable solutions for managing huge data sets. Thus, this study proposes a technique for big data clustering using the spark architecture. The proposed technique undergoes two steps for clustering the big data, involving feature selection and clustering, performed in the initial cluster nodes of spark architecture. At first, the initial cluster nodes read the big data from various distributed systems, and the optimal features are selected and placed in the feature vector based on the proposed moth-flame optimization-based bat (MFO-Bat) algorithm, which is designed by integrating MFO and Bat algorithms. Then, the selected features are fed to the final cluster nodes of spark, which uses the sparse-fuzzy C-means method for performing optimal clustering. The performance of proposed MFO-Bat outperformed other existing methods with a maximal classification accuracy of 95.806%, Dice coefficient of 99.181%, and Jaccard coefficient of 98.376%, respectively.
引用
收藏
页码:203 / 217
页数:15
相关论文
共 29 条
[1]   Memory-enriched big bang-big crunch optimization algorithm for data clustering [J].
Bijari, Kayvan ;
Zare, Hadi ;
Veisi, Hadi ;
Bobarshad, Hossein .
NEURAL COMPUTING & APPLICATIONS, 2018, 29 (06) :111-121
[2]   Combined First-Principles Calculations and Experimental Study of the Phonon Modes in the Multiferroic Compound GeV4S8 [J].
Cannuccia, Elena ;
Vinh Ta Phuoc ;
Briere, Benjamin ;
Cario, Laurent ;
Janod, Etienne ;
Corraze, Benoit ;
Lepetit, Marie Bernadette .
JOURNAL OF PHYSICAL CHEMISTRY C, 2017, 121 (06) :3522-3529
[3]   Sparse Regularization in Fuzzy c-Means for High-Dimensional Data Clustering [J].
Chang, Xiangyu ;
Wang, Qingnan ;
Liu, Yuewen ;
Wang, Yu .
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (09) :2616-2627
[4]  
Chormunge Smita, 2018, Journal of Electrical Systems and Information Technology, V5, P542, DOI 10.1016/j.jesit.2017.06.004
[5]  
Dash M, 2000, FEATURE SELECTION CL, P110
[6]   Multidimensional query reformulation with measure decomposition [J].
Diamantini, Claudia ;
Potena, Domenico ;
Storti, Emanuele .
INFORMATION SYSTEMS, 2018, 78 :23-39
[7]   Research and implementation of user clustering based on MapReduce in multimedia big data [J].
Fan, Tongke .
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (08) :10017-10031
[8]  
Gangurde HD, 2014, FEATURE SELECTION US, P1
[9]  
Gu Z, 2017, TRANSPORT RES C-EMER, V94, P151
[10]   Big data clustering with varied density based on MapReduce [J].
Heidari, Safanaz ;
Alborzi, Mahmood ;
Radfar, Reza ;
Afsharkazemi, Mohammad Ali ;
Ghatari, Ali Rajabzadeh .
JOURNAL OF BIG DATA, 2019, 6 (01)