Active Semi-Supervised Clustering Algorithm for Multi-Density Datasets

被引:0
作者
Atwa, Walid [1 ]
Almazroi, Abdulwahab Ali [1 ]
Aldhahr, Eman A. [2 ]
Janbi, Nourah Fahad [1 ]
机构
[1] Univ Jeddah, Coll Comp & Informat Technol Khulais, Dept Informat Technol, Jeddah, Saudi Arabia
[2] Univ Jeddah, Dept Comp Sci & Artificial Intelligence, Coll Comp Sci & Engn, Jeddah, Saudi Arabia
关键词
Semi-supervised clustering; pairwise constraints; multi-density data; active learning; CLASSIFICATION; DBSCAN;
D O I
10.14569/IJACSA.2024.0151052
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Semi-supervised clustering with pairwise constraints has been a hot topic among researchers and experts. However, the problem becomes quite difficult to manage using random constraints for clustering data when the clusters have different shapes, densities, and sizes. This research proposes an active semi-supervised density-based clustering algorithm, termed "ASS-DBSCAN," designed specifically for clustering multi-density data. By integrating active learning and semi- supervised techniques, ASS-DBSCAN enhances traditional clustering methods, allowing it to handle complex data distributions with varying densities more effectively. This research provides two major contributions. The first contribution of this research is to analyze how to link constraints (including that must be linked and ones that should not be linked) that will be utilized by the clustering algorithm. The second contribution made by this research is the ability to add multiple density levels to the dataset. We perform experiments over real datasets. The ASS-DBSCAN algorithm was evaluated against existing state-of-the-art system for various evaluation metrics in which it performed remarkably well.
引用
收藏
页码:493 / 500
页数:8
相关论文
共 50 条
  • [21] Active Semi-Supervised Classification based on Multiple Clustering Hierarchies
    Batista, Antonio J. L.
    Campello, Ricardo J. G. B.
    Sander, Jorg
    PROCEEDINGS OF 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, (DSAA 2016), 2016, : 11 - 20
  • [22] A batch-mode active learning SVM method based on semi-supervised clustering
    Fu, Chun-Jiang
    Yang, Yu-Pu
    INTELLIGENT DATA ANALYSIS, 2015, 19 (02) : 345 - 358
  • [23] A semi-supervised clustering algorithm for network intrusion detection
    Wei X.-T.
    Huang H.-K.
    Tian S.-F.
    Tiedao Xuebao/Journal of the China Railway Society, 2010, 32 (01): : 49 - 53
  • [24] Semi-supervised Clustering Framework Based on Active Learning for Real Data
    Odate, Ryosuke
    Shinjo, Hiroshi
    Suzuki, Yasufumi
    Motobayashi, Masahiro
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2018, 2018, 11004 : 184 - 193
  • [25] Research Progress on Semi-Supervised Clustering
    Yue Qin
    Shifei Ding
    Lijuan Wang
    Yanru Wang
    Cognitive Computation, 2019, 11 : 599 - 612
  • [26] An Efficient Semi-Supervised Clustering Algorithm with Sequential Constraints
    Yi, Jinfeng
    Zhang, Lijun
    Yang, Tianbao
    Liu, Wei
    Wang, Jun
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1405 - 1414
  • [27] Active learning of pair-wise constraints in semi-supervised clustering
    Jiang, Weijin, 1600, Editorial Board of Journal of Basic Science and (22): : 1248 - 1261
  • [28] A semi-supervised fuzzy clustering algorithm applied to gene expression data
    Maraziotis, Ioannis A.
    PATTERN RECOGNITION, 2012, 45 (01) : 637 - 648
  • [29] Active Semi-supervised Affinity Propagation Clustering Algorithm based on Local Outlier Factor
    Qi, Lei
    Ting, Li
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9368 - 9373
  • [30] Research of semi-supervised spectral clustering algorithm based on pairwise constraints
    Shifei Ding
    Hongjie Jia
    Liwen Zhang
    Fengxiang Jin
    Neural Computing and Applications, 2014, 24 : 211 - 219