Search Efficient Binary Network Embedding

被引:0
作者
Zhang, Daokun [1 ]
Yin, Jie [1 ]
Zhu, Xingquan [2 ]
Zhang, Chengqi [3 ]
机构
[1] Univ Sydney, Discipline Business Analyt, City Rd, Camperdown, NSW 2006, Australia
[2] Florida Atlantic Univ, Dept Comp & Elect Engn & Comp Sci, Glades Rd, Boca Raton, FL 33431 USA
[3] Univ Technol Sydney, Fac Engn & Informat Technol, Australian Artificial Intelligence Inst, Broadway, Ultimo, NSW 2007, Australia
基金
澳大利亚研究理事会; 美国国家科学基金会;
关键词
Network embedding; binary coding; similarity search; efficiency;
D O I
10.1145/3436892
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional network embedding primarily focuses on learning a continuous vector representation for each node, preserving network structure and/or node content information, such that off-the-shelf machine learning algorithms can be easily applied to the vector-format node representations for network analysis. However, the learned continuous vector representations are inefficient for large-scale similarity search, which often involves finding nearest neighbors measured by distance or similarity in a continuous vector space. In this article, we propose a search efficient binary network embedding algorithm called BinaryNE to learn a binary code for each node, by simultaneously modeling node context relations and node attribute relations through a three-layer neural network. BinaryNE learns binary node representations using a stochastic gradient descent-based online learning algorithm. The learned binary encoding not only reduces memory usage to represent each node, but also allows fast bit-wise comparisons to support faster node similarity search than using Euclidean or other distance measures. Extensive experiments and comparisons demonstrate that BinaryNE not only delivers more than 25 times faster search speed, but also provides comparable or better search quality than traditional continuous vector based network embedding methods. The binary codes learned by BinaryNE also render competitive performance on node classification and node clustering tasks. The source code of the BinaryNE algorithm is available at https://github.com/daokunzhang/BinaryNE.
引用
收藏
页数:27
相关论文
共 69 条
  • [1] Friends and neighbors on the Web
    Adamic, LA
    Adar, E
    [J]. SOCIAL NETWORKS, 2003, 25 (03) : 211 - 230
  • [2] Allgower E.L., 2012, Numerical Continuation Methods: an Introduction, V13
  • [3] [Anonymous], 2011, SIGKDD
  • [4] [Anonymous], 2010, 50 Years of Integer Programming 1958-2008, DOI DOI 10.1007/978-3-540-68279-0_15
  • [5] [Anonymous], 2016, ICLR
  • [6] [Anonymous], 2016, J Telecommun Inf Technol
  • [7] [Anonymous], 2017, ACM COMPUT SURV, DOI DOI 10.1145/3047307
  • [8] Assessing the relevance of node features for network structure
    Bianconi, Ginestra
    Pin, Paolo
    Marsili, Matteo
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (28) : 11433 - 11438
  • [9] Min-wise independent permutations
    Broder, AZ
    Charikar, M
    Frieze, AM
    Mitzenmacher, M
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2000, 60 (03) : 630 - 659
  • [10] Cao S., 2015, CIKM, P891