Scalable incremental fuzzy consensus clustering algorithm for handling big data

被引:0
|
作者
Preeti Jha
Aruna Tiwari
Neha Bharill
Milind Ratnaparkhe
Neha Nagendra
Mukkamalla Mounika
机构
[1] Indian Institute of Technology,
[2] Mahindra University,undefined
[3] Ecole Centrale School of Engineering,undefined
[4] ICAR-Indian Institute of Soybean Research,undefined
来源
Soft Computing | 2021年 / 25卷
关键词
Fuzzy consensus clustering; Space optimization; Incremental algorithm; Big data;
D O I
暂无
中图分类号
学科分类号
摘要
Consensus clustering can produce novel, stable, and robust clustering results. Consensus clustering intends to merge a few existing basic segments into a coordinated one, and this has been broadly perceived as a promising solution for heterogeneous data clustering for big data. Even though many clustering algorithms have been proposed, getting a decent quality segment with high effectiveness is still not yet decided. In this paper, we propose a scalable incremental fuzzy consensus clustering (SIFCC) algorithm for a big data framework. It has been implemented on Apache Spark cluster framework, a distributed data stream environment for handling big data by considering the data as a set of data subsets that are processed incrementally. Sparks work great for iterative algorithms by supporting in-memory calculations, scalability, etc. SIFCC not only facilitates efficient big data clustering, but also improves the quality of clusters, performs storage space optimization, and time complexity during clustering. To establish the comparison, we designed and implemented the scalable model of existing fuzzy consensus clustering (FCC) on Apache Spark cluster, named as a scalable fuzzy consensus clustering (SFCC). Extensive experiments on real-world datasets show that the SIFCC algorithm achieves the better potential for clustering of Big Data in comparison with SFCC.
引用
收藏
页码:8703 / 8719
页数:16
相关论文
共 50 条
  • [1] Scalable incremental fuzzy consensus clustering algorithm for handling big data
    Jha, Preeti
    Tiwari, Aruna
    Bharill, Neha
    Ratnaparkhe, Milind
    Nagendra, Neha
    Mounika, Mukkamalla
    SOFT COMPUTING, 2021, 25 (13) : 8703 - 8719
  • [2] Speeding up the large-scale consensus fuzzy clustering for handling Big Data
    Sassi Hidri, Minyar
    Zoghlami, Mohamed Ali
    Ben Ayed, Rahma
    FUZZY SETS AND SYSTEMS, 2018, 348 : 50 - 74
  • [3] Fuzzy Consensus Clustering With Applications on Big Data
    Wu, Junjie
    Wu, Zhiang
    Cao, Jie
    Liu, Hongfu
    Chen, Guoqing
    Zhang, Yanchun
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2017, 25 (06) : 1430 - 1445
  • [4] Research on incremental clustering algorithm for big data
    Yang X.
    Applied Mathematics and Nonlinear Sciences, 2023, 8 (02) : 169 - 180
  • [5] PBIRCH: A scalable parallel clustering algorithm for incremental data
    Garg, Ashwani
    Mangla, Ashish
    Gupta, Neelima
    Bhatnagar, Vasudha
    10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 315 - +
  • [6] A Novel Scalable Kernelized Fuzzy Clustering Algorithms Based on In-Memory Computation for Handling Big Data
    Jha, Preeti
    Tiwari, Aruna
    Bharill, Neha
    Ratnaparkhe, Milind
    Mounika, Mukkamalla
    Nagendra, Neha
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (06): : 908 - 919
  • [7] Sampling-Based Consensus Fuzzy Clustering on Big Data
    Zoghlami, Mohamed Ali
    Sassi Hidri, Minyar
    Ben Ayed, Rahma
    2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 1501 - 1508
  • [8] Consensus Clustering on Big Data
    Liu, Hongfu
    Cheng, Gong
    Wu, Junjie
    2015 12TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2015,
  • [9] IF-CLARANS: Intuitionistic Fuzzy Algorithm for Big Data Clustering
    Shili, Hechmi
    Ben Romdhane, Lotfi
    INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: THEORY AND FOUNDATIONS, PT II, 2018, 854 : 39 - 50
  • [10] Scalable Clustering Algorithms for Big Data: A Review
    Mahdi, Mahmoud A.
    Hosny, Khalid M.
    Elhenawy, Ibrahim
    IEEE ACCESS, 2021, 9 : 80015 - 80027