Kernel Spectral Clustering for Big Data Networks

被引:49
作者
Mall, Raghvendra [1 ]
Langone, Rocco [1 ]
Suykens, Johan A. K. [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT SCD SISTA, B-3001 Louvain, Belgium
关键词
kernel spectral clustering; out-of-sample extensions; sampling graphs; angular similarity; COMMUNITY STRUCTURE;
D O I
10.3390/e15051567
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
This paper shows the feasibility of utilizing the Kernel Spectral Clustering (KSC) method for the purpose of community detection in big data networks. KSC employs a primal-dual framework to construct a model. It results in a powerful property of effectively inferring the community affiliation for out-of-sample extensions. The original large kernel matrix cannot fitinto memory. Therefore, we select a smaller subgraph that preserves the overall community structure to construct the model. It makes use of the out-of-sample extension property for community membership of the unseen nodes. We provide a novel memory-and computationally efficient model selection procedure based on angular similarity in the eigenspace. We demonstrate the effectiveness of KSC on large scale synthetic networks and real world networks like the YouTube network, a road network of California and the Livejournal network. These networks contain millions of nodes and several million edges.
引用
收藏
页码:1567 / 1586
页数:20
相关论文
共 25 条
  • [1] Multiway Spectral Clustering with Out-of-Sample Extensions through Weighted Kernel PCA
    Alzate, Carlos
    Suykens, Johan A. K.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (02) : 335 - 347
  • [2] [Anonymous], 2010, Proceedings of the 19th International Conference on World Wide Web, DOI DOI 10.1145/1772690.1772762
  • [3] [Anonymous], 2005, Advances in Neural Information Processing Systems
  • [4] [Anonymous], 2006, KDD
  • [5] [Anonymous], 1997, SPECTRAL GRAPH THEOR
  • [6] Baylis J., 1988, ERROR CORRECTING COD
  • [7] Fast unfolding of communities in large networks
    Blondel, Vincent D.
    Guillaume, Jean-Loup
    Lambiotte, Renaud
    Lefebvre, Etienne
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
  • [8] Clauset A, 2004, PHYS REV E, V70, DOI 10.1103/PhysRevE.70.066111
  • [9] Danaon L., 2005, J STAT MECH-THEORY E, V9
  • [10] Community detection in graphs
    Fortunato, Santo
    [J]. PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2010, 486 (3-5): : 75 - 174