Fractional Fuzzy Clustering and Particle Whale Optimization-Based MapReduce Framework for Big Data Clustering

被引:7
|
作者
Kulkarni, Omkaresh [1 ]
Jena, Sudarson [2 ]
Sanjay, C. H. [3 ]
机构
[1] GITAM Univ, Gandhi Inst Technol & Management, Hyderabad 502329, Telangana, India
[2] Sambalpur Univ Inst Informat Technol, Dept Comp Sci Engn & Applicat, Sambalpur, Orissa, India
[3] GITAM Univ, Hyderabad, India
关键词
Big data clustering; fractional theory; TSK clustering; MRF; PSO; WOA; DISCOVERY; ALGORITHM;
D O I
10.1515/jisys-2018-0117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent advancements in information technology and the web tend to increase the volume of data used in day-to-day life. The result is a big data era, which has become a key issue in research due to the complexity in the analysis of big data. This paper presents a technique called FPWhale-MRF for big data clustering using the MapReduce framework (MRF), by proposing two clustering algorithms. In FPWhale-MRF, the mapper function estimates the cluster centroids using the Fractional Tangential-Spherical Kernel clustering algorithm, which is developed by integrating the fractional theory into a Tangential-Spherical Kernel clustering approach. The reducer combines the mapper outputs to find the optimal centroids using the proposed Particle-Whale (P-Whale) algorithm, for the clustering. The P-Whale algorithm is proposed by combining Whale Optimization Algorithm with Particle Swarm Optimization, for effective clustering such that its performance is improved. Two datasets, namely localization and skin segmentation datasets, are used for the experimentation and the performance is evaluated regarding two performance evaluation metrics: clustering accuracy and DB-index. The maximum accuracy attained by the proposed FPWhale-MRF technique is 87.91% and 90% for the localization and skin segmentation datasets, respectively, thus proving its effectiveness in big data clustering.
引用
收藏
页码:1496 / 1513
页数:18
相关论文
共 50 条
  • [11] Survey on clustering methods : Towards fuzzy clustering for big data
    Ben Ayed, Abdelkarim
    Ben Halima, Mohamed
    Alimi, Adel M.
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 331 - 336
  • [12] MapReduce-based K-Prototypes Clustering Method for Big Data
    Ben HajKacem, Mohamed Aymen
    Ben N'cir, Chiheb-Eddine
    Essoussi, Nadia
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 1030 - 1036
  • [13] Mapreduce fuzzy c-means ensemble clustering with gentle adaboost for big data analytics
    Padmapriya K.M.
    Anandhi B.
    Vijayakumar M.
    International Journal of Business Intelligence and Data Mining, 2021, 19 (02): : 170 - 188
  • [14] A Modified Hybrid Fuzzy Clustering Method for Big Data
    Khoshkbarchi, Amir
    Kamali, Ali
    Amjadi, Mehdi
    Haeri, Maryam Amir
    2016 8TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2016, : 196 - 201
  • [15] A Particle Swarm Optimization-Based Heuristic for Software Module Clustering Problem
    Prajapati, Amarjeet
    Chhabra, Jitender Kumar
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) : 7083 - 7094
  • [16] MapReduce-Enhanced Fuzzy K-Least Medians for Qualitative Clustering of Document Big Data
    Sardar, Tanvir Habib
    Ansari, Zahid Ahmed
    Theerthagiri, Prasannavenkatesan
    Karthikeyan, P.
    Ayyasamy, Vadivel
    Saini, Dilip Kumar Jang Bahadur
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (4-5)
  • [17] Analysis of particle swarm optimization based hierarchical data clustering approaches
    Alam, Shafiq
    Dobbie, Gillian
    Rehman, Saeed Ur
    SWARM AND EVOLUTIONARY COMPUTATION, 2015, 25 : 36 - 51
  • [18] Overlapping Cluster Control Mechanism for Particle Swarm Optimization-based Clustering Algorithm
    Suharjono, Amin
    Wirawan
    Hendrantoro, G.
    2011 IEEE REGION 10 CONFERENCE TENCON 2011, 2011, : 124 - 127
  • [19] A New Hybrid Evolutionary-based Data Clustering Using Fuzzy Particle Swarm Optimization
    Youssef, Sherin M.
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 717 - 724
  • [20] Clustering of Mixed Data by Integrating Fuzzy, Probabilistic, and Collaborative Clustering Framework
    Pathak, Arkanath
    Pal, Nikhil R.
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2016, 18 (03) : 339 - 348