Learning and Scaling Directed Networks via Graph Embedding

被引:3
作者
Drobyshevskiy, Mikhail [1 ]
Korshunov, Anton [1 ]
Turdakov, Denis [1 ]
机构
[1] Russian Acad Sci, Inst Syst Programming, Moscow, Russia
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT I | 2017年 / 10534卷
关键词
Random graph generating; Graph embedding; Representation learning;
D O I
10.1007/978-3-319-71249-9_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reliable evaluation of network mining tools implies significance and scalability testing. This is usually achieved by picking several graphs of various size from different domains. However, graph properties and thus evaluation results could be dramatically different from one domain to another. Hence the necessity of aggregating results over a multitude of graphs within each domain. The paper introduces an approach to automatically learn features of a directed graph from any domain and generate similar graphs while scaling input graph size with a real-valued factor. Generating multiple graphs with similar size allows significance testing, while scaling graph size makes scalability evaluation possible. The proposed method relies on embedding an input graph into low-dimensional space, thus encoding graph features in a set of node vectors. Edge weights and node communities could be imitated as well in optional steps. We demonstrate that embedding-based approach ensures variability of synthetic graphs while keeping degree and subgraphs distributions close to the original graphs. Therefore, the method could make significance and scalability testing of network algorithms more reliable without the need to collect additional data. We also show that embedding-based approach preserves various features in generated graphs which can't be achieved by other generators imitating a given graph.
引用
收藏
页码:634 / 650
页数:17
相关论文
共 30 条
[1]   Statistical mechanics of complex networks [J].
Albert, R ;
Barabási, AL .
REVIEWS OF MODERN PHYSICS, 2002, 74 (01) :47-97
[2]  
[Anonymous], 2006, P 15 ACM INT C INF K
[3]  
[Anonymous], 2010, AISTATS
[4]  
[Anonymous], 2014, PROC 20 ACM SIGKDD, DOI DOI 10.1145/2623330.2623732
[5]   Mining large networks with subgraph counting [J].
Bordino, Ilaria ;
Donato, Debora ;
Gionis, Aristides ;
Leonardi, Stefano .
ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :737-+
[6]  
Chakrabarti D, 2004, SIAM PROC S, P442
[7]  
Chykhradze K., 2014, Complex networks v, P199
[8]   Scale-free networks are ultrasmall [J].
Cohen, R ;
Havlin, S .
PHYSICAL REVIEW LETTERS, 2003, 90 (05) :4
[9]   Pseudofractal scale-free web [J].
Dorogovtsev, SN ;
Goltsev, AV ;
Mendes, JFF .
PHYSICAL REVIEW E, 2002, 65 (06) :1-066122
[10]  
[Дробышевский Михаил Drobyshevskiy Mikhail], 2016, [Труды Института системного программирования РАН, Trudy Instituta sistemnogo programmirovaniya RAN], V28, P153