Effective and Efficient Community Search over Large Heterogeneous Information Networks

被引:107
作者
Fang, Yixiang [1 ]
Yang, Yixing [1 ]
Zhang, Wenjie [1 ]
Lin, Xuemin [1 ]
Cao, Xin [1 ]
机构
[1] Univ New South Wales, Sydney, NSW, Australia
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2020年 / 13卷 / 06期
关键词
D O I
10.14778/3380750.3380756
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, the topic of community search (CS) has gained plenty of attention. Given a query vertex, CS looks for a dense subgraph that contains it. Existing studies mainly focus on homogeneous graphs in which vertices are of the same type, and cannot be directly applied to heterogeneous information networks (HINs) that consist of multi-typed, interconnected objects, such as the bibliographic networks and knowledge graphs. In this paper, we study the problem of community search over large HINs; that is, given a query vertex q, find a community from an HIN containing q, in which all the vertices are with the same type of q and have close relationships. To model the relationship between two vertices of the same type, we adopt the well-known concept of meta-path, which is a sequence of relations defined between different types of vertices. We then measure the cohesiveness of the community by extending the classic minimum degree metric with a meta-path. We further propose efficient query algorithms for finding communities using these cohesiveness metrics. We have performed extensive experiments on five real large HINs, and the results show that the proposed solutions are effective for searching communities. Moreover, they are much faster than the baseline solutions.
引用
收藏
页码:854 / 867
页数:14
相关论文
共 80 条
[1]  
Amelio A, 2014, LECT NOTES SOC NETW, P105, DOI 10.1007/978-3-7091-1797-2__6
[2]  
Balakrishnan H., 2006, ACM SE 44, P280
[3]   A clique-based approach for co-location pattern mining [J].
Bao, Xuguang ;
Wang, Lizhen .
INFORMATION SCIENCES, 2019, 490 :244-264
[4]  
Batagelj V, 2003, CORR, V1, P34, DOI 10.1007/BF01074693
[5]  
Bo Zhang, 2016, Web Technologies and Applications. 18th Asia-Pacific Web Conference, APWeb 2016. Proceedings: LNCS 9932, P414, DOI 10.1007/978-3-319-45817-5_37
[6]   Distance-generalized Core Decomposition [J].
Bonchi, Francesco ;
Khan, Arijit ;
Severini, Lorenzo .
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, :1006-1023
[7]   Index-based Optimal Algorithms for Computing Steiner Components with Maximum Connectivity [J].
Chang, Lijun ;
Lin, Xuemin ;
Qin, Lu ;
Yu, Jeffrey Xu ;
Zhang, Wenjie .
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, :459-474
[8]   Efficient and Incremental Clustering Algorithms on Star-Schema Heterogeneous Graphs [J].
Chen, Lu ;
Gao, Yunjun ;
Zhang, Yuanliang ;
Jensen, Christian S. ;
Zheng, Bolong .
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, :256-267
[9]   Maximum Co-located Community Search in Large Scale Social Networks [J].
Chen, Lu ;
Liu, Chengfei ;
Zhou, Rui ;
Li, Jianxin ;
Yang, Xiaochun ;
Wang, Bin .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (10) :1233-1246
[10]  
Chen Y, 2018, TKDE, V31, P1624