Active Semi-Supervised Clustering Algorithm for Multi-Density Datasets

被引:0
作者
Atwa, Walid [1 ]
Almazroi, Abdulwahab Ali [1 ]
Aldhahr, Eman A. [2 ]
Janbi, Nourah Fahad [1 ]
机构
[1] Univ Jeddah, Coll Comp & Informat Technol Khulais, Dept Informat Technol, Jeddah, Saudi Arabia
[2] Univ Jeddah, Dept Comp Sci & Artificial Intelligence, Coll Comp Sci & Engn, Jeddah, Saudi Arabia
关键词
Semi-supervised clustering; pairwise constraints; multi-density data; active learning; CLASSIFICATION; DBSCAN;
D O I
10.14569/IJACSA.2024.0151052
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Semi-supervised clustering with pairwise constraints has been a hot topic among researchers and experts. However, the problem becomes quite difficult to manage using random constraints for clustering data when the clusters have different shapes, densities, and sizes. This research proposes an active semi-supervised density-based clustering algorithm, termed "ASS-DBSCAN," designed specifically for clustering multi-density data. By integrating active learning and semi- supervised techniques, ASS-DBSCAN enhances traditional clustering methods, allowing it to handle complex data distributions with varying densities more effectively. This research provides two major contributions. The first contribution of this research is to analyze how to link constraints (including that must be linked and ones that should not be linked) that will be utilized by the clustering algorithm. The second contribution made by this research is the ability to add multiple density levels to the dataset. We perform experiments over real datasets. The ASS-DBSCAN algorithm was evaluated against existing state-of-the-art system for various evaluation metrics in which it performed remarkably well.
引用
收藏
页码:493 / 500
页数:8
相关论文
共 50 条
  • [31] Research of semi-supervised spectral clustering algorithm based on pairwise constraints
    Shifei Ding
    Hongjie Jia
    Liwen Zhang
    Fengxiang Jin
    [J]. Neural Computing and Applications, 2014, 24 : 211 - 219
  • [32] Active Semi-supervised Affinity Propagation Clustering Algorithm based on Local Outlier Factor
    Qi, Lei
    Ting, Li
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9368 - 9373
  • [33] Constraint-based Clustering Algorithm for Multi-Density Data and Arbitrary Shapes
    Atwa, Walid
    Li, Kan
    [J]. ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, ICDM 2017, 2017, 10357 : 78 - 92
  • [34] Active Learning for Semi-Supervised K-Means Clustering
    Vu, Viet-Vu
    Labroche, Nicolas
    Bouchon-Meunier, Bernadette
    [J]. 22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 1, 2010,
  • [35] SDenPeak: Semi-Supervised Nonlinear Clustering based on Density and Distance
    Fan, Wen-Qi
    Wang, Chang-Dong
    Lai, Jian-Huang
    [J]. PROCEEDINGS 2016 IEEE SECOND INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2016), 2016, : 269 - 275
  • [36] Active Learning of Instance-level Constraints for Semi-supervised Document Clustering
    Zhao, Weizhong
    He, Qing
    Ma, Huifang
    Shi, Zhongzhi
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2009, : 264 - 268
  • [37] Active learning for semi-supervised clustering based on locally linear propagation reconstruction
    Chang, Chin-Chun
    Lin, Po-Yi
    [J]. NEURAL NETWORKS, 2015, 63 : 170 - 184
  • [38] The Recommendation System Based on Semi-Supervised PSO Clustering Algorithm
    Zhou Wen Min
    Pan Xiu Qin
    Li Rui Xiang
    Lu Yong
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL FORUM ON MECHANICAL, CONTROL AND AUTOMATION (IFMCA 2016), 2017, 113 : 63 - 71
  • [39] Improved Semi-supervised Clustering Algorithm Based on Affinity Propagation
    金冉
    刘瑞娟
    李晔锋
    寇春海
    [J]. JournalofDonghuaUniversity(EnglishEdition), 2015, 32 (01) : 125 - 131
  • [40] Semi-Supervised Kernel Clustering Algorithm based on Seed Set
    Li, Kunlun
    Zhang, Chao
    Cao, Zheng
    [J]. 2009 ASIA-PACIFIC CONFERENCE ON INFORMATION PROCESSING (APCIP 2009), VOL 1, PROCEEDINGS, 2009, : 169 - 172