Distributed k-Core View Materialization and Maintenance for Large Dynamic Graphs

被引:31
作者
Aksu, Hidayet [1 ]
Canim, Mustafa [2 ]
Chang, Yuan-Chi [2 ]
Korpeoglu, Ibrahim [3 ]
Ulusoy, Ozgur [3 ]
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06836 Ankara, Turkey
[2] IBM Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[3] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
关键词
k-core; graph theory; distributed computing; dynamic social networks; RANDOM HYPERGRAPHS;
D O I
10.1109/TKDE.2013.2297918
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In graph theory, k-core is a key metric used to identify subgraphs of high cohesion, also known as the 'dense' regions of a graph. As the real world graphs such as social network graphs grow in size, the contents get richer and the topologies change dynamically, we are challenged not only to materialize k-core subgraphs for one time but also to maintain them in order to keep up with continuous updates. Adding to the challenge is that real world data sets are outgrowing the capacity of a single server and its main memory. These challenges inspired us to propose a new set of distributed algorithms for k-core view construction and maintenance on a horizontally scaling storage and computing platform. Our algorithms execute against the partitioned graph data in parallel and take advantage of k-core properties to aggressively prune unnecessary computation. Experimental evaluation results demonstrated orders of magnitude speedup and advantages of maintaining k-core incrementally and in batch windows over complete reconstruction. Our algorithms thus enable practitioners to create and maintain many k-core views on different topics in rich social network content simultaneously.
引用
收藏
页码:2439 / 2452
页数:14
相关论文
共 41 条
  • [1] Altaf-Ul-Amine Md, 2003, Gen. Inf, V14, P498, DOI [10.11234/gi1990.14.498, DOI 10.11234/GI1990.14.498]
  • [2] [Anonymous], 2008, NETW HETEROG MEDIA
  • [3] [Anonymous], 2005, P NIPS, DOI DOI 10.5555/2976248.2976254
  • [4] [Anonymous], P 2010 ACM SIGMOD IN, DOI [DOI 10.1145/1807167.1807184, 10.1145/1807167.1807184]
  • [5] [Anonymous], 2007, P 5 ACM US INT MEAS
  • [6] [Anonymous], 2012, 161291 MICR RES
  • [7] [Anonymous], 2011, SIGMOD 11 P 2011 INT, DOI [DOI 10.1145/1989323.1989438, 10.1145/1989323.1989438]
  • [8] [Anonymous], P 8 ACM SIGCOMM C IN
  • [9] [Anonymous], 2003, CORR
  • [10] An automated method for finding molecular complexes in large protein interaction networks
    Bader, GD
    Hogue, CW
    [J]. BMC BIOINFORMATICS, 2003, 4 (1)