Krylov Subspace Approximation for Local Community Detection in Large Networks

被引:25
作者
He, Kun [1 ]
Shi, Pan [1 ]
Bindel, David [2 ]
Hopcroft, John E. [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Hubei, Peoples R China
[2] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
基金
中国国家自然科学基金;
关键词
Local community detection; spectral clustering; Krylov subspace; Rayleigh quotient; sparse linear coding; HEAT KERNEL; MODULARITY; PAGERANK;
D O I
10.1145/3340708
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Community detection is an important information mining task to uncover modular structures in large networks. For increasingly common large network datasets, global community detection is prohibitively expensive, and attention has shifted to methods that mine local communities, i.e., identifying all latent members of a particular community from a few labeled seed members. To address such semi-supervised mining task, we systematically develop a local spectral (LOSP) subspace-based community detection method, called LOSP. We define a family of LOSP subspaces based on Krylov subspaces, and seek a sparse indicator for the target community via an l(1) norm minimization over the Krylov subspace. Variants of LOSP depend on type of random walks with different diffusion speeds, type of random walks, dimension of the LOSP subspace, and step of diffusions. The effectiveness of the proposed LOSP approach is theoretically analyzed based on Rayleigh quotients, and it is experimentally verified on a wide variety of real-world networks across social, production, and biological domains, as well as on an extensive set of synthetic LFR benchmark datasets.
引用
收藏
页数:30
相关论文
共 54 条
  • [1] Abbe E., 2017, J. Mach. Learn. Res., V18, P6446
  • [2] A Separability Framework for Analyzing Community Structure
    Abrahao, Bruno
    Soundarajan, Sucheta
    Hopcroft, John
    Kleinberg, Robert
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2014, 8 (01) : 101 - 129
  • [3] Link communities reveal multiscale complexity in networks
    Ahn, Yong-Yeol
    Bagrow, James P.
    Lehmann, Sune
    [J]. NATURE, 2010, 466 (7307) : 761 - U11
  • [4] Ali HafizTiomoko., 2017, The Journal of Machine Learning Research, V18, P8344
  • [5] Andersen R, 2006, ANN IEEE SYMP FOUND, P475
  • [6] Scalable and Efficient Flow-Based Community Detection for Large-Scale Graph Analysis
    Bae, Seung-Hee
    Halperin, Daniel
    West, Jevin D.
    Rosvall, Martin
    Howe, Bill
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2017, 11 (03)
  • [7] Overlapping community detection using superior seed set selection in social networks
    Belfin, R., V
    Kanaga, Grace Mary E.
    Brodka, Piotr
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2018, 70 : 1074 - 1083
  • [8] Fast unfolding of communities in large networks
    Blondel, Vincent D.
    Guillaume, Jean-Loup
    Lambiotte, Renaud
    Lefebvre, Etienne
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
  • [9] THE FACTORIZATION OF A SQUARE MATRIX INTO 2 SYMMETRICAL-MATRICES
    BOSCH, AJ
    [J]. AMERICAN MATHEMATICAL MONTHLY, 1986, 93 (06) : 462 - 464
  • [10] Weighted modularity optimization for crisp and fuzzy community detection in large-scale networks
    Cao, Jie
    Bu, Zhan
    Gao, Guangliang
    Tao, Haicheng
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2016, 462 : 386 - 395