Combination of links and node contents for community discovery using a graph regularization approach

被引:19
作者
Cao, Jinxin [1 ,2 ]
Wang, Hongcui [1 ,3 ]
Jin, Di [1 ]
Dang, Jianwu [1 ,4 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300350, Peoples R China
[2] Nantong Univ, Sch Comp Sci & Technol, Nantong 226019, Jangsu, Peoples R China
[3] Zhejiang Univ, Water Resources & Elect Power, Hangzhou 310018, Zhejiang, Peoples R China
[4] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa 9231292, Japan
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2019年 / 91卷
基金
国家重点研发计划;
关键词
Community detection; Nonnegative matrix factorization; Node contents; Graph regularization; Node popularities;
D O I
10.1016/j.future.2018.08.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the rapid growth of the networked data, the study of community detection is drawing increasing attention of researchers. A number of algorithms have been proposed and some of them have been well applied in many research fields, such as recommendation systems, information retrieval, etc. Traditionally, the community detection methods mainly use the knowledge of the topological structure which contains the most important clue for finding potential groups or communities. However, as we know, a wealth of content information exists on the nodes in real-world networks, and may help for community detection. Considering the above problem, we introduce a novel community detection method under the framework of nonnegative matrix factorization (NMF), and adopt the idea that two nodes with similar content will be most likely to belong to the same community to achieve the incorporation of links and node contents, i.e., we employ a graph regularization to penalize the dissimilarity of nodes denoted by community memberships. Besides, we introduce an intuitive manifold learning strategy to recover the intrinsic geometrical structure of the content information, i.e., K-near neighbor consistency. In addition, we found that, there are still drawbacks in this framework due to it does not consider the heterogeneous distribution of node degrees. This heterogeneous distribution can affect the function of graph regularization and isolates the original community memberships. We first proposed the node popularities satisfying the above interpretation and develop a new NMF-based model, named as Combination of Links and Node Contents for Community Discovery (CLNCCD). The experiments on both artificial and real-world networks compared with the state-of-the-art methods show that, the new model obtains significant improvement for community detection by incorporating node contents effectively. (C) 2018 Published by Elsevier B.V.
引用
收藏
页码:361 / 370
页数:10
相关论文
共 28 条
[1]  
[Anonymous], 2013, P 22 INT C WORLD WID, DOI DOI 10.1145/2488388.2488483
[2]  
[Anonymous], 2012, Using content and interactions for discovering communities in social networks
[3]  
[Anonymous], 2012, LEARNING DISCOVER SO
[4]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[5]   Graph Regularized Nonnegative Matrix Factorization for Data Representation [J].
Cai, Deng ;
He, Xiaofei ;
Han, Jiawei ;
Huang, Thomas S. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) :1548-1560
[6]   Comparing community structure identification -: art. no. P09008 [J].
Danon, L ;
Díaz-Guilera, A ;
Duch, J ;
Arenas, A .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2005, :219-228
[7]   Magnetic eigenmaps for community detection in directed networks [J].
Fanuel, Michael ;
Alaiz, Carlos M. ;
Suykens, Johan A. K. .
PHYSICAL REVIEW E, 2017, 95 (02)
[8]  
Fengli Zhang, 2015, MultiMedia Modeling. 21st International Conference, MMM 2015, January 5-7, 2015, Proceedings: LNCS 8935, P418, DOI 10.1007/978-3-319-14445-0_36
[9]   Community structure in social and biological networks [J].
Girvan, M ;
Newman, MEJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (12) :7821-7826
[10]   K-Clique Community Detection in Social Networks Based on Formal Concept Analysis [J].
Hao, Fei ;
Min, Geyong ;
Pei, Zheng ;
Park, Doo-Soon ;
Yang, Laurence T. .
IEEE SYSTEMS JOURNAL, 2017, 11 (01) :250-259