Data Clustering Algorithms: Experimentation and Comparison

被引:0
作者
Khandare, Anand [1 ]
Pawar, Rutika [1 ]
机构
[1] Thakur Coll Engn & Technol, Dept Comp Engn, Mumbai, Maharashtra, India
来源
INTELLIGENT COMPUTING AND NETWORKING, IC-ICN 2021 | 2022年 / 301卷
关键词
Clustering; Data mining; KDD; K-Means clustering; DBSCAN; Agglomerative; Clusters;
D O I
10.1007/978-981-16-4863-2_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to increasing databases of all kinds, clustering has become one of the most essential tasks to classify the data. Clustering means to group or divide the data points based on their similarity to each other. Clustering can be stated as an unsupervised data mining technique that describes the nature of datasets. The main objective of data clustering is to obtain groups of similar entities. There are various methods of clustering such as hierarchical, partition-based, method-based, grid-based, and model-based. This paper provides a detailed study about clustering, its working processes. Along with basic information, detailed information about validity measures required to evaluate algorithms is discussed. This paper reviews clustering algorithms like K-Means, Agglomerative, and DBSCAN. A tabular comparison of algorithms is represented to acquire in-depth knowledge. The results obtained after experimenting with algorithms are also been discussed at the end.
引用
收藏
页码:86 / 99
页数:14
相关论文
共 50 条
  • [1] Comparison of Data Mining Clustering Algorithms
    Shah, Chintan
    Jivani, Anjali
    2013 4TH NIRMA UNIVERSITY INTERNATIONAL CONFERENCE ON ENGINEERING (NUICONE 2013), 2013,
  • [2] COMPARISON OF ALGORITHMS FOR CLUSTERING INCOMPLETE DATA
    Matyja, Artur
    Siminski, Krzysztof
    FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2014, 39 (02) : 107 - 127
  • [3] Comparison of Clustering Algorithms for Revenue and Cost Analysis
    Boyko, Nataliya
    Hetman, Solomiya
    Kots, Iryna
    COLINS 2021: COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS, VOL I, 2021, 2870
  • [4] A survey on parallel clustering algorithms for Big Data
    Zineb Dafir
    Yasmine Lamari
    Said Chah Slaoui
    Artificial Intelligence Review, 2021, 54 : 2411 - 2443
  • [5] A survey on parallel clustering algorithms for Big Data
    Dafir, Zineb
    Lamari, Yasmine
    Slaoui, Said Chah
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (04) : 2411 - 2443
  • [6] Comparative study of Data Mining Clustering algorithms
    Venkatkumar, Iyer Aurobind
    Shardaben, Sanatkumar Jayantibhai Kondhol
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON DATA SCIENCE & ENGINEERING (ICDSE), 2016, : 72 - 78
  • [7] Research on data stream clustering algorithms
    Shifei Ding
    Fulin Wu
    Jun Qian
    Hongjie Jia
    Fengxiang Jin
    Artificial Intelligence Review, 2015, 43 : 593 - 600
  • [8] Research on data stream clustering algorithms
    Ding, Shifei
    Wu, Fulin
    Qian, Jun
    Jia, Hongjie
    Jin, Fengxiang
    ARTIFICIAL INTELLIGENCE REVIEW, 2015, 43 (04) : 593 - 600
  • [9] DBSCAN and CLARA Clustering Algorithms and their usage for the Soil Data Clustering
    Vukcevic, M.
    Popovic-Bugarin, V.
    Dervic, E.
    2019 8TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2019, : 456 - 461
  • [10] Distributed Data Clustering by Neural Network Algorithms
    Kholod, Ivan I.
    Rukavitsyn, Andrey N.
    Reva, Nikita
    Shorov, Andrey V.
    PROCEEDINGS OF THE 2019 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (EICONRUS), 2019, : 249 - 253