ClusterMPP: An unsupervised density-based clustering algorithm via Marked Point Process

被引:2
作者
Henni, Khadidja [1 ]
Alata, Olivier [2 ]
Zaoui, Lynda [1 ]
Vannier, Brigitte [3 ]
El Idrissi, Abdellatif [4 ]
Moussa, Ahmed [5 ]
机构
[1] Univ Sci & Technol, Dept Comp Sci, LSSD Lab, Oran, Algeria
[2] Jean Monnet Univ, CNRS, UMR 5516, Hubert Curien Lab, St Etienne, France
[3] Poitiers Univ, Receptors Regulat & Tumor Cells Lab, Poitiers, France
[4] Abdelmalek Essaadi Univ, ENSA Tangier, Tangier, Morocco
[5] Abdelmalek Essaadi Univ, ENSA Tangier, Syst & Data Engn Team, Tangier, Morocco
关键词
Unsupervised learning; density-based clustering; mode detection; Marked Point Process; non-parametric; multidimensional data; overlapping clusters; BIG DATA; EXTRACTION; MODEL;
D O I
10.3233/IDA-160010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional clustering algorithms optimize a single criterion, which may not conform to diverse needs of multidimensional data science. This paper proposes a new clustering algorithm that solves multiple clustering issues, called clustering by Marked Point Process (ClusterMPP). It is a new, efficient, scalable and unsupervised density-based clustering algorithm. ClusterMPP simulates a proposed Marked Point Process (MPP) to find clusters of complex shapes present in the raw data space. The outputs of this new algorithm, in the first step, are the observations belonging to each cluster mode called "prototypes". The classification process is achieved, in the second step, using an improved KNN algorithm. We conduct intensive experiments to compare ClusterMPP with the most well-known algorithms. The results of ClusterMPP proved its efficiency on high complex and scalable datasets.
引用
收藏
页码:827 / 847
页数:21
相关论文
共 46 条
  • [21] The K-Means-Type Algorithms Versus Imbalanced Data Distributions
    Liang, Jiye
    Bai, Liang
    Dang, Chuangyin
    Cao, Fuyuan
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (04) : 728 - 745
  • [22] A novel and quick SVM-based multi-class classifier
    Liu, Yiguang
    You, Zhisheng
    Cao, Liping
    [J]. PATTERN RECOGNITION, 2006, 39 (11) : 2258 - 2264
  • [23] An efficient and scalable density-based Clustering algorithm for datasets with complex structures
    Lv, Yinghua
    Ma, Tinghuai
    Tang, Meili
    Cao, Jie
    Tian, Yuan
    Al-Dhelaan, Abdullah
    Al-Rodhaan, Mznah
    [J]. NEUROCOMPUTING, 2016, 171 : 9 - 22
  • [24] A Markov random field model for mode detection in cluster analysis
    Moussa, Ahmed
    Sbihi, Abderrahmane
    Postaire, Jack-Gerard
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1197 - 1207
  • [25] Mouysset S., 2013, STUDIES CLASSIFICATI, V4, P153
  • [26] Ntoutsi I., 2012, SIAM SDM
  • [27] Models for association rules based on clustering and correlation
    Ordonez, Carlos
    [J]. INTELLIGENT DATA ANALYSIS, 2009, 13 (02) : 337 - 358
  • [28] Parimala LopezSenthilkumar., 2011, INT J ADV SCI TECHNO, V31, P59
  • [29] A simple and fast algorithm for K-medoids clustering
    Park, Hae-Sang
    Jun, Chi-Hyuck
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3336 - 3341
  • [30] AN APPROXIMATE SOLUTION TO NORMAL MIXTURE IDENTIFICATION WITH APPLICATION TO UNSUPERVISED PATTERN-CLASSIFICATION
    POSTAIRE, JG
    VASSEUR, CPA
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1981, 3 (02) : 163 - 179