Generalization of clustering agreements and distances for overlapping clusters and network communities

被引:9
|
作者
Rabbany, Reihaneh [1 ]
Zaiane, Osmar R. [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
关键词
Clustering agreement; Cluster evaluation; Cluster validation; Network clusters; Community detection; Overlapping clusters;
D O I
10.1007/s10618-015-0426-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A measure of distance between two clusterings has important applications, including clustering validation and ensemble clustering. Generally, such distance measure provides navigation through the space of possible clusterings. Mostly used in cluster validation, a normalized clustering distance, a.k.a. agreement measure, compares a given clustering result against the ground-truth clustering. The two widely-used clustering agreement measures are adjusted rand index and normalized mutual information. In this paper, we present a generalized clustering distance from which these two measures can be derived. We then use this generalization to construct new measures specific for comparing (dis)agreement of clusterings in networks, a.k.a. communities. Further, we discuss the difficulty of extending the current, contingency based, formulations to overlapping cases, and present an alternative algebraic formulation for these (dis)agreement measures. Unlike the original measures, the new co-membership based formulation is easily extendable for different cases, including overlapping clusters and clusters of inter-related data. These two extensions are, in particular, important in the context of finding communities in complex networks.
引用
收藏
页码:1458 / 1485
页数:28
相关论文
共 27 条
  • [1] Generalization of clustering agreements and distances for overlapping clusters and network communities
    Reihaneh Rabbany
    Osmar R. Zaïane
    Data Mining and Knowledge Discovery, 2015, 29 : 1458 - 1485
  • [2] Optimal Fuzzy Clustering in Overlapping Clusters
    Ammor, Ouafa
    Lachkar, Abdelmonaime
    Slaoui, Khadija
    Rais, Noureddine
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2008, 5 (04) : 402 - 408
  • [3] Overlapping communities detection of social network based on hybrid C-means clustering algorithm
    Lei, Yu
    Zhou, Ying
    Shi, Jiao
    SUSTAINABLE CITIES AND SOCIETY, 2019, 47
  • [4] Detecting overlapping and hierarchical communities in complex network using interaction-based edge clustering
    Kim, Paul
    Kim, Sangwook
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2015, 417 : 46 - 56
  • [5] A spectral algorithm with additive clustering for the recovery of overlapping communities in networks
    Kaufmann, Emilie
    Bonald, Thomas
    Lelarge, Marc
    THEORETICAL COMPUTER SCIENCE, 2018, 742 : 3 - 26
  • [6] Finding overlapping communities based on Markov chain and link clustering
    Deng, Xiaoheng
    Li, Genghao
    Dong, Mianxiong
    Ota, Kaoru
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2017, 10 (02) : 411 - 420
  • [7] Finding overlapping communities based on Markov chain and link clustering
    Xiaoheng Deng
    Genghao Li
    Mianxiong Dong
    Kaoru Ota
    Peer-to-Peer Networking and Applications, 2017, 10 : 411 - 420
  • [8] A benchmarking tool for the generation of bipartite network models with overlapping communities
    Valejo, Alan
    Goes, Fabiana
    Romanetto, Luzia
    Ferreira de Oliveira, Maria Cristina
    Lopes, Alneu de Andrade
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (04) : 1641 - 1669
  • [9] A benchmarking tool for the generation of bipartite network models with overlapping communities
    Alan Valejo
    Fabiana Góes
    Luzia Romanetto
    Maria Cristina Ferreira de Oliveira
    Alneu de Andrade Lopes
    Knowledge and Information Systems, 2020, 62 : 1641 - 1669
  • [10] Detecting Hierarchical and Overlapping Network Communities Based on Opinion Dynamics
    Ren, Ren
    Shao, Jinliang
    Cheng, Yuhua
    Wang, Xiaofan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2696 - 2710