Three-way clustering based on the graph of local density trend

被引:0
作者
Yang, Haifeng [1 ]
Wang, Weiqi [1 ]
Cai, Jianghui [1 ,2 ]
Wang, Jie [1 ]
Li, Yating [3 ]
Xun, Yaling [1 ]
Zhao, Xujun [1 ]
机构
[1] Taiyuan Univ Sci & Technol, Sch Comp Sci & Technol, Taiyuan 030024, Shanxi, Peoples R China
[2] North Univ China, Sch Comp Sci & Technol, Taiyuan 030051, Shanxi, Peoples R China
[3] Taiyuan Univ Sci & Technol, Sch Elect Informat Engn, Taiyuan 030024, Shanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Three-way clustering; Three-way decision; The graph of local density trend; Isolation forest; FUZZY;
D O I
10.1016/j.ijar.2025.109422
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Three-way clustering demonstrates its unique advantages in dealing with the issues of information ambiguity and unclear boundaries present in real-world datasets. The core and boundary region in the data are identified as key features of cluster analysis. Typically, data is segmented into three regions based on a set of predetermined global thresholds, a common practice in three-way clustering. However, this method, which relies on global thresholds, often overlooks the intrinsic distribution patterns within the dataset and determining these thresholds a priori can be quite challenging. In this paper, we propose a three-way clustering method based on the graph of local density trend (3W-GLDT). Specifically, the algorithm first uses a density-decreasing strategy to build subgraphs and divide the core region data. Then, the unreasonable connection is corrected by using isolated forest, which increases the number of core points and enlarges the distribution range of core points. Next, a three-way allocation strategy is proposed, which fully considers the degree of local aggregation of subgraphs and the natural domain information of each data object to ensure the correct allocation. Finally, the proposed algorithm is compared with 8 different clustering methods on 8 synthetic datasets and 10 UCI real datasets. The experimental results show that the 3W-GLDT algorithm has good performance and clustering results.
引用
收藏
页数:24
相关论文
共 54 条
  • [21] Three-way evidence theory-based density peak clustering with the principle of justifiable granularity
    Ju, Hengrong
    Lu, Yang
    Ding, Weiping
    Cao, Jinxin
    Yang, Xibei
    [J]. APPLIED SOFT COMPUTING, 2024, 152
  • [22] Interval set clustering of web users with rough K-means
    Lingras, P
    West, C
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2004, 23 (01) : 5 - 16
  • [23] Rough clustering
    Lingras, Pawan
    Peters, Georg
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (01) : 64 - 72
  • [24] Anomaly Detection With Kernel Preserving Embedding
    Liu, Huawen
    Li, Enhui
    Liu, Xinwang
    Su, Kaile
    Zhang, Shichao
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (05)
  • [25] Semi-supervised concept factorization for document clustering
    Lu, Mei
    Zhao, Xiang-Jun
    Zhang, Li
    Li, Fan-Zhang
    [J]. INFORMATION SCIENCES, 2016, 331 : 86 - 98
  • [26] Mark Berg M.K., 2008, Computational Geometry: Algorithms and Applications, P95
  • [27] Comprehensive survey on hierarchical clustering algorithms and the recent developments
    Ran, Xingcheng
    Xi, Yue
    Lu, Yonggang
    Wang, Xiangwen
    Lu, Zhenyu
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (08) : 8219 - 8264
  • [28] Shah A, 2022, APPL INTELL, V52, P18131, DOI [10.1007/978-981-16-5547-0_1, 10.1007/s10489-021-03072-0]
  • [29] A three-way clustering approach for novelty detection
    Shah, Anwar
    Azam, Nouman
    Ali, Bahar
    Khan, Muhammad Taimoor
    Yao, JingTao
    [J]. INFORMATION SCIENCES, 2021, 569 : 650 - 668
  • [30] Three-Way Spectral Clustering
    Shi, Hong
    Liu, Qiang
    Wang, Pingxin
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2018), 2018, 11177 : 389 - 398