A survey of clustering large probabilistic graphs: Techniques, evaluations, and applications

被引:1
作者
Danesh, Malihe [1 ,2 ]
Dorrigiv, Morteza [2 ]
Yaghmaee, Farzin [2 ]
机构
[1] Univ Sci & Technol Mazandaran, Dept Comp Engn, Behshahr, Iran
[2] Semnan Univ, Fac Elect & Comp Engn, Semnan, Iran
关键词
clustering; possible worlds-based methods; probabilistic graph; threshold-based methods; EFFICIENT; ALGORITHMS;
D O I
10.1111/exsy.13248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the growth of uncertainty in the real world, analysing probabilistic graphs is crucial. Clustering is one of the most fundamental methods of mining probabilistic graphs to discover the hidden patterns in them. This survey examines an extensive and organized analysis of the clustering techniques of large probabilistic graphs proposed in the literature. First, the definition of probabilistic graphs and modelling them are introduced. Second, the clustering of such graphs and their challenges, such as uncertainty of edges, high dimensions, and the impossibility of applying certain graph clustering techniques directly, are expressed. Then, a taxonomy of clustering approaches is discussed in two main categories: threshold-based and possible worlds-based methods. The techniques presented in each category are explained and examined. Here, these methods are evaluated on real datasets, and their performance is compared with each other. Finally, the survey is summarized by describing some of the applications of probabilistic graph clustering and future research directions.
引用
收藏
页数:20
相关论文
共 66 条
[1]  
Abu Khurma R., 2021, ALGORITHMS INTELLIGE
[2]  
Adar E., 2007, IEEE Data Eng. Bull., V30, P15
[3]   A Survey of Uncertain Data Algorithms and Applications [J].
Aggarwal, Charu C. ;
Yu, Philip S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (05) :609-623
[4]  
[Anonymous], 2018, IEEE Transactions on Cybernetics
[5]   Gaining confidence in high-throughput protein interaction networks [J].
Bader, JS ;
Chaudhuri, A ;
Rothberg, JM ;
Chant, J .
NATURE BIOTECHNOLOGY, 2004, 22 (01) :78-85
[6]   Correlation clustering [J].
Bansal, N ;
Blum, A ;
Chawla, S .
MACHINE LEARNING, 2004, 56 (1-3) :89-113
[7]   Injecting Uncertainty in Graphs for Identity Obfuscation [J].
Boldi, Paolo ;
Bonchi, Francesco ;
Gionis, Aristides ;
Tassa, Tamir .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (11) :1376-1387
[8]  
Cao C., 2020, MCC F1 CURVE PERFORM
[9]   Clustering Uncertain Graphs [J].
Ceccarello, Matteo ;
Fantozzi, Carlo ;
Pietracaprina, Andrea ;
Pucci, Geppino ;
Vandin, Fabio .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 11 (04) :472-484
[10]   A comparative study of efficient initialization methods for the k-means clustering algorithm [J].
Celebi, M. Emre ;
Kingravi, Hassan A. ;
Vela, Patricio A. .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (01) :200-210