Three-way clustering based on the graph of local density trend

被引:0
作者
Yang, Haifeng [1 ]
Wang, Weiqi [1 ]
Cai, Jianghui [1 ,2 ]
Wang, Jie [1 ]
Li, Yating [3 ]
Xun, Yaling [1 ]
Zhao, Xujun [1 ]
机构
[1] Taiyuan Univ Sci & Technol, Sch Comp Sci & Technol, Taiyuan 030024, Shanxi, Peoples R China
[2] North Univ China, Sch Comp Sci & Technol, Taiyuan 030051, Shanxi, Peoples R China
[3] Taiyuan Univ Sci & Technol, Sch Elect Informat Engn, Taiyuan 030024, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Three-way clustering; Three-way decision; The graph of local density trend; Isolation forest; FUZZY;
D O I
10.1016/j.ijar.2025.109422
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Three-way clustering demonstrates its unique advantages in dealing with the issues of information ambiguity and unclear boundaries present in real-world datasets. The core and boundary region in the data are identified as key features of cluster analysis. Typically, data is segmented into three regions based on a set of predetermined global thresholds, a common practice in three-way clustering. However, this method, which relies on global thresholds, often overlooks the intrinsic distribution patterns within the dataset and determining these thresholds a priori can be quite challenging. In this paper, we propose a three-way clustering method based on the graph of local density trend (3W-GLDT). Specifically, the algorithm first uses a density-decreasing strategy to build subgraphs and divide the core region data. Then, the unreasonable connection is corrected by using isolated forest, which increases the number of core points and enlarges the distribution range of core points. Next, a three-way allocation strategy is proposed, which fully considers the degree of local aggregation of subgraphs and the natural domain information of each data object to ensure the correct allocation. Finally, the proposed algorithm is compared with 8 different clustering methods on 8 synthetic datasets and 10 UCI real datasets. The experimental results show that the 3W-GLDT algorithm has good performance and clustering results.
引用
收藏
页数:24
相关论文
共 54 条
  • [1] Variance based three-way clustering approaches for handling overlapping clustering
    Afridi, Mohammad Khan
    Azam, Nouman
    Yao, JingTao
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2020, 118 : 47 - 63
  • [2] A three-way clustering approach for handling missing data using GTRS
    Afridi, Mohammad Khan
    Azam, Nouman
    Yao, JingTao
    Alanazi, Eisa
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 98 : 11 - 24
  • [3] A spatial filtering inspired three-way clustering approach with application to outlier detection
    Ali, Bahar
    Azam, Nouman
    Shah, Anwar
    Yao, JingTao
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 130 : 1 - 21
  • [4] Border-Peeling Clustering
    Averbuch-Elor, Hadar
    Bar, Nadav
    Cohen-Or, Daniel
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (07) : 1791 - 1797
  • [5] FCM - THE FUZZY C-MEANS CLUSTERING-ALGORITHM
    BEZDEK, JC
    EHRLICH, R
    FULL, W
    [J]. COMPUTERS & GEOSCIENCES, 1984, 10 (2-3) : 191 - 203
  • [6] A novel graph-attention based multimodal fusion network for joint classification of hyperspectral image and LiDAR data
    Cai, Jianghui
    Zhang, Min
    Yang, Haifeng
    He, Yanting
    Yang, Yuqing
    Shi, Chenhui
    Zhao, Xujun
    Xun, Yaling
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [7] A review on semi-supervised clustering
    Cai, Jianghui
    Hao, Jing
    Yang, Haifeng
    Zhao, Xujun
    Yang, Yuqing
    [J]. INFORMATION SCIENCES, 2023, 632 : 164 - 200
  • [8] Further Results on Input-to-State Stability of Stochastic Cohen-Grossberg BAM Neural Networks with Probabilistic Time-Varying Delays
    Chandrasekar, A.
    Radhika, T.
    Zhu, Quanxin
    [J]. NEURAL PROCESSING LETTERS, 2022, 54 (01) : 613 - 635
  • [9] An axiomatic framework for three-way clustering
    Chen, Yingxiao
    Zhu, Ping
    Yao, Yiyu
    [J]. INFORMATION SCIENCES, 2024, 675
  • [10] Demsar J, 2006, J MACH LEARN RES, V7, P1