A Variational Bayesian Framework for Clustering with Multiple Graphs

被引:17
作者
Shiga, Motoki [1 ]
Mamitsuka, Hiroshi [1 ]
机构
[1] Kyoto Univ, Bioinformat Ctr, Inst Chem Res, Uji, Kyoto 6110011, Japan
基金
日本科学技术振兴机构;
关键词
Clustering; graphs; statistical machine learning; variational Bayesian learning; localized clusters; STOCHASTIC BLOCKMODELS;
D O I
10.1109/TKDE.2010.272
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining patterns in graphs has become an important issue in real applications, such as bioinformatics and web mining. We address a graph clustering problem where a cluster is a set of densely connected nodes, under a practical setting that 1) the input is multiple graphs which share a set of nodes but have different edges and 2) a true cluster cannot be found in all given graphs. For this problem, we propose a probabilistic generative model and a robust learning scheme based on variational Bayesian estimation. A key feature of our probabilistic framework is that not only nodes but also given graphs can be clustered at the same time, allowing our model to capture clusters found in only part of all given graphs. We empirically evaluated the effectiveness of the proposed framework on not only a variety of synthetic graphs but also real gene networks, demonstrating that our proposed approach can improve the clustering performance of competing methods in both synthetic and real data.
引用
收藏
页码:577 / 589
页数:13
相关论文
共 26 条
[1]  
Airoldi EM, 2008, J MACH LEARN RES, V9, P1981
[2]  
[Anonymous], 2007, Proceedings of the International Conference on Machine Learning, DOI DOI 10.1145/1273496.1273642
[3]   Geometry, Flows, and Graph-Partitioning Algorithms [J].
Arora, Sanjeev ;
Rao, Satish ;
Vazirani, Umesh .
COMMUNICATIONS OF THE ACM, 2008, 51 (10) :96-105
[4]  
Attias H, 2000, ADV NEUR IN, V12, P209
[5]  
Bishop C.M., 2006, SPRINGER, P461
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]   Genetic and physical maps of Saccharomyces cerevisiae [J].
Cherry, JM ;
Ball, C ;
Weng, S ;
Juvik, G ;
Schmidt, R ;
Adler, C ;
Dunn, B ;
Dwight, S ;
Riles, L ;
Mortimer, RK ;
Botstein, D .
NATURE, 1997, 387 (6632) :67-73
[8]   Frequent pattern mining: current status and future directions [J].
Han, Jiawei ;
Cheng, Hong ;
Xin, Dong ;
Yan, Xifeng .
DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 15 (01) :55-86
[9]   Transcriptional regulatory code of a eukaryotic genome [J].
Harbison, CT ;
Gordon, DB ;
Lee, TI ;
Rinaldi, NJ ;
Macisaac, KD ;
Danford, TW ;
Hannett, NM ;
Tagne, JB ;
Reynolds, DB ;
Yoo, J ;
Jennings, EG ;
Zeitlinger, J ;
Pokholok, DK ;
Kellis, M ;
Rolfe, PA ;
Takusagawa, KT ;
Lander, ES ;
Gifford, DK ;
Fraenkel, E ;
Young, RA .
NATURE, 2004, 431 (7004) :99-104
[10]   Bayesian approach to network modularity [J].
Hofman, Jake M. ;
Wiggins, Chris H. .
PHYSICAL REVIEW LETTERS, 2008, 100 (25)