DGCNN: A convolutional neural network over large-scale labeled graphs

被引:162
作者
Anh Viet Phan [1 ,2 ]
Minh Le Nguyen [1 ]
Yen Lam Hoang Nguyen [1 ]
Lam Thu Bui [2 ]
机构
[1] JAIST, Nomi City 9231211, Japan
[2] Le Quy Don Tech Univ, Res Grp Computat Intelligence, 236 Hoang Quoc Viet St, Hanoi, Vietnam
关键词
Labeled directed graphs; Convolutional neural networks (CNNs); Control flow graphs (CFGs); abstract syntax trees (ASTs); KERNELS;
D O I
10.1016/j.neunet.2018.09.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploiting graph-structured data has many real applications in domains including natural language semantics, programming language processing, and malware analysis. A variety of methods has been developed to deal with such data. However, learning graphs of large-scale, varying shapes and sizes is a big challenge for any method. In this paper, we propose a multi-view multi-layer convolutional neural network on labeled directed graphs (DGCNN), in which convolutional filters are designed flexibly to adapt to dynamic structures of local regions inside graphs. The advantages of DGCNN are that we do not need to align vertices between graphs, and that DGCNN can process large-scale dynamic graphs with hundred thousands of nodes. To verify the effectiveness of DGCNN, we conducted experiments on two tasks: malware analysis and software defect prediction. The results show that DGCNN outperforms the baselines, including several deep neural networks. (c) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:533 / 543
页数:11
相关论文
共 36 条
  • [1] Allen Frances E., 1970, ACM SIGPLAN NOTICES, V5, P1, DOI DOI 10.1145/390013.808479
  • [2] Graph-based malware detection using dynamic analysis
    Anderson, Blake
    Quist, Daniel
    Neil, Joshua
    Storlie, Curtis
    Lane, Terran
    [J]. JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2011, 7 (04): : 247 - 258
  • [3] [Anonymous], 2017, ARXIV170402901
  • [4] [Anonymous], 2016, US CURRAN
  • [5] [Anonymous], 2015, PROC INT S FOUND PRA
  • [6] [Anonymous], 2016, ARXIV160908965
  • [7] [Anonymous], 2016, ARXIV161103199
  • [8] [Anonymous], 2017, ARXIV170202181
  • [9] [Anonymous], 2011, Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '11
  • [10] [Anonymous], 2012, P 2012 SIAM INT C DA