Comprehensive review on Clustering Techniques and its application on High Dimensional Data

被引:10
作者
Alam, Afroj [1 ]
Muqeem, Mohd [1 ]
Ahmad, Sultan [2 ]
机构
[1] Integral Univ, Dept Comp Applicat, Lucknow, UP, India
[2] Prince Sattam Bin Abdulaziz Univ, Dept Comp Sci, Coll Comp Engn & Sci, Al Kharj, Saudi Arabia
来源
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY | 2021年 / 21卷 / 06期
关键词
Data mining; Clustering; K-means; PAM; CLARA; ETL; High-dimensional datasets; curse of dimensionality;
D O I
10.22937/IJCSNS.2021.21.6.31
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is a most powerful un-supervised machine learning techniques for division of instances into homogenous group, which is called cluster. This Clustering is mainly used for generating a good quality of cluster through which we can discover hidden patterns and knowledge from the large datasets. It has huge application in different field like in medicine field, healthcare, gene-expression, image processing, agriculture, fraud detection, profitability analysis etc. The goal of this paper is to explore both hierarchical as well as partitioning clustering and understanding their problem with various approaches for their solution. Among different clustering K-means is better than other clustering due to its linear time complexity. Further this paper also focused on data mining that dealing with high-dimensional datasets with their problems and their existing approaches for their relevancy
引用
收藏
页码:237 / 244
页数:8
相关论文
共 28 条
[1]   Automatic subspace clustering of high dimensional data [J].
Agrawal, R ;
Gehrke, J ;
Gunopulos, D ;
Raghavan, P .
DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 11 (01) :5-33
[2]  
Ahmad Sultan, 2019, IEEE 2 INT C SMART S
[3]   Clustering high dimensional data [J].
Assent, Ira .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (04) :340-350
[4]  
Bansal A., 2017, ARTIC INT J COMPUT A, V157, P975, DOI 10.5120/ijca2017912719
[5]   Efficient agglomerative hierarchical clustering [J].
Bouguettaya, Athman ;
Yu, Qi ;
Liu, Xumin ;
Zhou, Xiangmin ;
Song, Andy .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (05) :2785-2797
[6]   Hierarchical Clustering: Objective Functions and Algorithms [J].
Cohen-Addad, Vincent ;
Kanade, Varun ;
Mallmann-Trenn, Frederik ;
Mathieu, Claire .
JOURNAL OF THE ACM, 2019, 66 (04)
[7]   Adaptive dimension reduction for clustering high dimensional data [J].
Ding, C ;
He, XF ;
Zha, HY ;
Simon, HD .
2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, :147-154
[8]  
Elavarasi SA., 2011, International Journal of Enterprise Computing and Business Systems, V1
[9]   How many clusters? Which clustering method? Answers via model-based cluster analysis [J].
Fraley, C ;
Raftery, AE .
COMPUTER JOURNAL, 1998, 41 (08) :578-588
[10]   Clustering-based short-term load forecasting for residential electricity under the increasing-block pricing tariffs in China [J].
Fu, Xin ;
Zeng, Xiao-Jun ;
Feng, Pengpeng ;
Cai, Xiuwen .
ENERGY, 2018, 165 :76-89