Data clustering: application and trends

被引:72
|
作者
Oyewole, Gbeminiyi John [1 ]
Thopil, George Alex [1 ]
机构
[1] Univ Pretoria, Dept Engn & Technol Management, Pretoria, South Africa
关键词
Clustering; Clustering classification; Clustering components; Industry applications; Clustering algorithms; Clustering trends; PATTERN-CLASSIFICATION; R PACKAGE; ALGORITHMS; SYSTEM; ICT; INFORMATION; EXPLORATION; INDICATORS; CHALLENGES; MANAGEMENT;
D O I
10.1007/s10462-022-10325-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering has primarily been used as an analytical technique to group unlabeled data for extracting meaningful information. The fact that no clustering algorithm can solve all clustering problems has resulted in the development of several clustering algorithms with diverse applications. We review data clustering, intending to underscore recent applications in selected industrial sectors and other notable concepts. In this paper, we begin by highlighting clustering components and discussing classification terminologies. Furthermore, specific, and general applications of clustering are discussed. Notable concepts on clustering algorithms, emerging variants, measures of similarities/dissimilarities, issues surrounding clustering optimization, validation and data types are outlined. Suggestions are made to emphasize the continued interest in clustering techniques both by scholars and Industry practitioners. Key findings in this review show the size of data as a classification criterion and as data sizes for clustering become larger and varied, the determination of the optimal number of clusters will require new feature extracting methods, validation indices and clustering techniques. In addition, clustering techniques have found growing use in key industry sectors linked to the sustainable development goals such as manufacturing, transportation and logistics, energy, and healthcare, where the use of clustering is more integrated with other analytical techniques than a stand-alone clustering technique.
引用
收藏
页码:6439 / 6475
页数:37
相关论文
共 50 条
  • [21] A New Measure of Stability of Clustering Solutions: Application to Data Partitioning
    Saha, Sriparna
    Bandyopadhyay, Sanghamitra
    PROCEEDINGS 2009 INTERNATIONAL CONFERENCE ON ADAPTIVE AND INTELLIGENT SYSTEMS, ICAIS 2009, 2009, : 181 - 186
  • [22] Data mining application in banking sector with clustering and classification methods
    Calis, Asli
    Boyaci, Ahmet
    Baynal, Kasim
    2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND OPERATIONS MANAGEMENT (IEOM), 2015,
  • [23] Clustering of periodic multichannel timeseries data with application to plasma fluctuations
    Haskey, S. R.
    Blackwell, B. D.
    Pretty, D. G.
    COMPUTER PHYSICS COMMUNICATIONS, 2014, 185 (06) : 1669 - 1680
  • [24] A New Line Symmetry Distance and Its Application to Data Clustering
    Saha, Sriparna
    Bandyopadhyay, Sanghamitra
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2009, 24 (03) : 544 - 556
  • [25] Stochastic Numerical P Systems With Application in Data Clustering Problems
    Yang, Jinyu
    Peng, Hong
    Luo, Xiaohui
    Wang, Jun
    IEEE ACCESS, 2020, 8 : 31507 - 31518
  • [26] Multiresolution clustering of dependent functional data with application to climate reconstruction
    Abramowicz, Konrad
    Schelin, Lina
    de Luna, Sara Sjostedt
    Strandberg, Johan
    STAT, 2019, 8 (01):
  • [27] A New Class Topper Optimization Algorithm with an Application to Data Clustering
    Das, Pranesh
    Das, Dushmanta Kumar
    Dey, Shouvik
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2020, 8 (04) : 948 - 959
  • [28] Feature Weighted Kernel Clustering with Application to Medical Data Analysis
    Jia, Hong
    Cheung, Yiu-ming
    BRAIN AND HEALTH INFORMATICS, 2013, 8211 : 496 - 505
  • [29] An improved clustering algorithm and its application in IoT data analysis
    Yao, Xuanxia
    Wang, Jiafei
    Shen, Mengyu
    Kong, Huafeng
    Ning, Huansheng
    COMPUTER NETWORKS, 2019, 159 : 63 - 72
  • [30] Regularized matrix data clustering and its application to image analysis
    Gao, Xu
    Shen, Weining
    Zhang, Liwen
    Hu, Jianhua
    Fortin, Norbert J.
    Frostig, Ron D.
    Ombao, Hernando
    BIOMETRICS, 2021, 77 (03) : 890 - 902