Generalization of clustering agreements and distances for overlapping clusters and network communities

被引:9
作者
Rabbany, Reihaneh [1 ]
Zaiane, Osmar R. [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
关键词
Clustering agreement; Cluster evaluation; Cluster validation; Network clusters; Community detection; Overlapping clusters;
D O I
10.1007/s10618-015-0426-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A measure of distance between two clusterings has important applications, including clustering validation and ensemble clustering. Generally, such distance measure provides navigation through the space of possible clusterings. Mostly used in cluster validation, a normalized clustering distance, a.k.a. agreement measure, compares a given clustering result against the ground-truth clustering. The two widely-used clustering agreement measures are adjusted rand index and normalized mutual information. In this paper, we present a generalized clustering distance from which these two measures can be derived. We then use this generalization to construct new measures specific for comparing (dis)agreement of clusterings in networks, a.k.a. communities. Further, we discuss the difficulty of extending the current, contingency based, formulations to overlapping cases, and present an alternative algebraic formulation for these (dis)agreement measures. Unlike the original measures, the new co-membership based formulation is easily extendable for different cases, including overlapping clusters and clusters of inter-related data. These two extensions are, in particular, important in the context of finding communities in complex networks.
引用
收藏
页码:1458 / 1485
页数:28
相关论文
共 27 条
  • [21] Identification of the clusters of social network communities for users with a specific characteristic
    Vakhitov, Galim
    Enikeeva, Zulfira
    Yangirova, Nadiya
    Shavalieva, Adel
    Ustin, Pavel
    12TH INTERNATIONAL CONFERENCE ON THE DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2019), 2019, : 140 - 146
  • [22] Detecting network communities using regularized spectral clustering algorithm
    Liang Huang
    Ruixuan Li
    Hong Chen
    Xiwu Gu
    Kunmei Wen
    Yuhua Li
    Artificial Intelligence Review, 2014, 41 : 579 - 594
  • [23] Impact of Similarity Measure on the Quality of Communities Detected in Social Network by Hierarchical Clustering
    Szyman, Pawel
    Barbucha, Dariusz
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 13501 : 29 - 42
  • [24] RETRACTED: Extract the network communities based on fuzzy clustering theory (Retracted Article)
    Lin, Zhenzhou
    Li, Huijia
    MODERN PHYSICS LETTERS B, 2021, 35 (18):
  • [25] Discovering Communities with SGNS Modelling-based Network connections and Text communications Clustering
    Mohotti, Wathsala Anupama
    Nayak, Richi
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1770 - 1777
  • [26] Dynamic clustering-based consensus model for large-scale group decision-making considering overlapping communities
    Hua, Zhen
    Gou, Xiangjie
    Martinez, Luis
    INFORMATION FUSION, 2025, 115
  • [27] Integrating Fuzzy Clustering and Graph Convolution Network to Accurately Identify Clusters From Attributed Graph
    Yang, Yue
    Li, Guodong
    Li, Dongxu
    Zhang, Jun
    Hu, Pengwei
    Hu, Lun
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2025, 12 (02): : 1112 - 1125