K-DGHC: A hierarchical clustering method based on K-dominance granularity

被引:7
|
作者
Yu, Bin [1 ,2 ]
Zheng, Zijian [1 ,2 ]
Dai, Jianhua [1 ,2 ]
机构
[1] Hunan Normal Univ, Coll Informat Sci & Engn, Changsha 410081, Hunan, Peoples R China
[2] Hunan Normal Univ, Hunan Prov Key Lab Intelligent Comp & Language Inf, Changsha 410081, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Dominance granularity; K-dominance granularity; Shared neighborhood; Hierarchical clustering; K-DGHC; ALGORITHM; FIND;
D O I
10.1016/j.ins.2023.03.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing hierarchical clustering (HC) algorithms generally depend on the Euclidean characteristic metric (Euclidean distance, Manhattan distance, Chebyshev distance, etc.) on Euclidean space to describe the similarity between objects, which makes the clustering process oriented to data sets with uniform and regular distribution in Euclidean space. Although such methods can visually distinguish the cluster distribution of data, it is not effective for the data sets which are densely distributed, interlaced and complex in Euclidean space. As a scalable, efficient and robust method, granular computing generally analyzes data from the perspective of similarity and proximity. In consideration of the advantages of granular computing in extracting data information from a multi-level perspective, in order to reduce the limitations of HC methods based on Euclidean features on non-Euclidean data, this paper proposes a novel HC method based on non-Euclidean feature structure. First, this paper constructs the similarity between objects based on K-dominance granularity and neighborhood search, and considers the environmental information of data points from both global and local perspectives. Secondly, a new HC method based on non-Euclidean feature structure is designed on the basis of the similarity measurement constructed in this paper. Finally, through comparative analysis, the experimental results prove that our method can more accurately identify the densely distributed and interlaced data sets in Euclidean space; it is significantly better than comparison algorithms using different Euclidean features to measure similarity; it has good robustness when additional Gaussian noise is added.
引用
收藏
页码:232 / 251
页数:20
相关论文
共 50 条
  • [1] An improved hierarchical clustering method based on the k-NN and density peak clustering
    Shi, Zhicheng
    Guo, Renzhong
    Zhao, Zhigang
    TRANSACTIONS IN GIS, 2023, 27 (08) : 2197 - 2212
  • [2] K-centroid link: a novel hierarchical clustering linkage method
    Dogan, Alican
    Birant, Derya
    APPLIED INTELLIGENCE, 2022, 52 (05) : 5537 - 5560
  • [3] K-centroid link: a novel hierarchical clustering linkage method
    Alican Dogan
    Derya Birant
    Applied Intelligence, 2022, 52 : 5537 - 5560
  • [4] Questions clustering using canopy-K-means and hierarchical-K-means clustering
    Alian M.
    Al-Naymat G.
    International Journal of Information Technology, 2022, 14 (7) : 3793 - 3802
  • [5] A novel clustering algorithm based on hierarchical and K-means clusteringz
    Li Wenchao
    Zhou Yong
    Xia Shixiong
    PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 4, 2007, : 605 - +
  • [6] Hierarchical K-Means Clustering Algorithm Based on Silhouette and Entropy
    Dong, Wuzhou
    Ren, JiaDong
    Zhang, Dongmei
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2011, 7002 : 339 - +
  • [7] Bayesian hierarchical K-means clustering
    Liu, Yue
    Li, Bufang
    INTELLIGENT DATA ANALYSIS, 2020, 24 (05) : 977 - 992
  • [8] A hierarchical k-means clustering based fingerprint quality classification
    Munir, Muhammad Umer
    Javed, Muhammad Younus
    Khan, Shoab Ahmad
    NEUROCOMPUTING, 2012, 85 : 62 - 67
  • [9] An Empirical comparison of Clustering using Hierarchical methods and K-means
    Praveen, P.
    Rama, B.
    PROCEEDINGS OF THE 2016 IEEE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL & ELECTRONICS, INFORMATION, COMMUNICATION & BIO INFORMATICS (IEEE AEEICB-2016), 2016, : 445 - 449
  • [10] K-means based method for overlapping document clustering
    Beltran, Beatriz
    Vilarino, Darnes
    Martinez-Trinidad, Jose Fco.
    Carrasco-Ochoa, J. A.
    Pinto, David
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 2127 - 2135