Efficiently summarizing attributed diffusion networks

被引:3
作者
Amiri, Sorour E. [1 ]
Chen, Liangzhe [1 ]
Prakash, B. Aditya [1 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
基金
美国国家科学基金会;
关键词
Attributed graph; Summarization; Topic aware influence maximization;
D O I
10.1007/s10618-018-0572-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a large attributed social network, can we find a compact, diffusion-equivalent representation while keeping the attribute properties? Diffusion networks with user attributes such as friendship, email communication, and people contact networks are increasingly common-place in the real-world. However, analyzing them is challenging due to their large size. In this paper, we first formally formulate a novel problem of summarizing an attributed diffusion graph to preserve its attributes and influence-based properties. Next, we propose ANeTS, an effective sub-quadratic parallelizable algorithm to solve this problem: it finds the best set of candidate nodes and merges them to construct a smaller network of 'super-nodes' preserving the desired properties. Extensive experiments on diverse real-world datasets show that ANeTS outperforms all state-of-the-art baselines (some of which do not even finish in 14 days). Finally, we show how ANeTS helps in multiple applications such as Topic-Aware viral marketing and sense-making of diverse graphs from different domains.
引用
收藏
页码:1251 / 1274
页数:24
相关论文
共 35 条
[21]  
Liu Y., 2016, ABS161204883 CORR
[22]   Focused Clustering and Outlier Detection in Large Attributed Graphs [J].
Perozzi, Bryan ;
Akoglu, Leman ;
Sanchez, Patricia Iglesias ;
Mueller, Emmanuel .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :1346-1355
[23]  
Prakash BA, 2011, THRESHOLD CONDITIONS
[24]   Fast Influence-based Coarsening for Large Networks [J].
Purohit, Manish ;
Prakash, B. Aditya ;
Kang, Chanhyun ;
Zhang, Yao ;
Subrahmanian, V. S. .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :1296-1305
[25]  
Qiang Qu, 2014, Machine Learning and Knowledge Discovery in Databases. European Conference, ECML PKDD 2014. Proceedings: LNCS 8725, P597, DOI 10.1007/978-3-662-44851-9_38
[26]   FUSE: a profit maximization approach for functional summarization of biological networks [J].
Seah, Boon-Siew ;
Bhowmick, Sourav S. ;
Dewey, C. Forbes, Jr. ;
Yu, Hanry .
BMC BIOINFORMATICS, 2012, 13 :S10
[27]   Collective Classification in Network Data [J].
Sen, Prithviraj ;
Namata, Galileo ;
Bilgic, Mustafa ;
Getoor, Lise ;
Gallagher, Brian ;
Eliassi-Rad, Tina .
AI MAGAZINE, 2008, 29 (03) :93-106
[28]   VEGAS: Visual influEnce GrAph Summarization on Citation Networks [J].
Shi, Lei ;
Tong, Hanghang ;
Tang, Jie ;
Lin, Chuang .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (12) :3417-3431
[29]  
Tian R. A., 2008, ACM SIGMOD INT C MAN, P567, DOI DOI 10.1145/1376616.1376675
[30]  
Toivonen H., 2011, P 17 ACM SIGKDD INT, P965