Spectral Clustering in Social Networks

被引:8
作者
Kurucz, Miklos [1 ]
Benczur, Andras A. [1 ]
Csalogany, Karoly [1 ]
Lukacs, Laszlo [1 ]
机构
[1] Hungarian Acad Sci, Comp & Automat Res Inst, Informat Lab, Data Min & Web Search Res Grp, H-1051 Budapest, Hungary
来源
ADVANCES IN WEB MINING AND WEB USAGE ANALYSIS | 2009年 / 5439卷
关键词
spectral clustering; telephone call graph; social network mining; Web graph; RANDOM GRAPHS; ALGORITHM;
D O I
10.1145/1348549.1348559
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We evaluate various heuristics for hierarchical spectral clustering in large telephone call and Web graphs. Spectral clustering without additional heuristics often produces very uneven cluster sizes or low quality clusters that may consist of several disconnected components, a fact that appears to be common for several data sources but, to our knowledge, no general solution provided so far. Divide-and-Merge, a recently described postfiltering procedure may be used to eliminate bad quality branches in a binary tree hierarchy. We propose in alternate solution that enables k-way cuts in each step by immediately filtering unbalanced or low quality clusters before splitting them further. Our experiments are performed on graphs with various weight and normalization built based on call detail records and Web crawls. We measure clustering quality both by modularity as well as by the geographic and topical homogeneity of the clusters. Compared to divide-and-merge, we give more homogeneous clusters with a more desirable distribution of the cluster sizes.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 47 条
[1]  
Aiello W., 2000, Proceedings of the Thirty Second Annual ACM Symposium on Theory of Computing, P171, DOI 10.1145/335305.335326
[2]  
ALPERT CJ, 1995, DES AUT CON, P195
[3]   MULTIWAY PARTITIONING VIA GEOMETRIC EMBEDDINGS, ORDERINGS, AND DYNAMIC-PROGRAMMING [J].
ALPERT, CJ ;
KAHNG, AB .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1995, 14 (11) :1342-1358
[4]   RECENT DIRECTIONS IN NETLIST PARTITIONING - A SURVEY [J].
ALPERT, CJ ;
KAHNG, AB .
INTEGRATION-THE VLSI JOURNAL, 1995, 19 (1-2) :1-81
[5]  
[Anonymous], CZECHOSLOVAK MATH J
[6]  
[Anonymous], MITLCSTR906
[7]   A novel evolutionary data mining algorithm with applications to churn prediction [J].
Au, WH ;
Chan, KCC ;
Yao, X .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2003, 7 (06) :532-545
[8]   AN ALGORITHM FOR PARTITIONING THE NODES OF A GRAPH [J].
BARNES, ER .
SIAM JOURNAL ON ALGEBRAIC AND DISCRETE METHODS, 1982, 3 (04) :541-550
[9]  
BENCZUR AA, 2006, NSF US HUNG WORKSH L
[10]  
Berry M.W.:., 1992, SVDPACK: A Fortran-77 software library for the sparse singular value decomposition