Scalable and Effective Bipartite Network Embedding

被引:4
|
作者
Yang, Renchi [1 ]
Shi, Jieming [2 ]
Huang, Keke [1 ]
Xiao, Xiaokui [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Hong Kong Polytech Univ, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22) | 2022年
关键词
Bipartite Graphs; Network Embedding; Poisson Distribution; GRAPHS;
D O I
10.1145/3514221.3517838
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a bipartite graph G consisting of inter-set weighted edges connecting the nodes in two disjoint sets U and V, bipartite network embedding (BNE) maps each node u(i) is an element of U and v(j) is an element of V to compact embedding vectors that capture the hidden topological features surrounding the nodes, to facilitate downstream tasks. Effective BNE should preserve not only the direct connections between nodes but also the multi-hop relationships formed alternately by the two types of nodes in G, which can incur prohibitive overheads, especially on massive bipartite graphs with millions of nodes and billions of edges. Existing solutions are hardly scalable to massive bipartite graphs, and often produce low-quality results. This paper proposes GEBE, a generic BNE framework achieving state-of-the-art performance on massive bipartite graphs, via four main algorithmic designs. First, we present two generic measures to capture the multi-hop similarity/proximity between homogeneous/heterogeneous nodes respectively, and the measures can be instantiated with three popular probability distributions, including Poisson, Geometric, and Uniform distributions. Second, GEBE formulates a novel and unified BNE objective to preserve the two measures of all possible node pairs. Third, GEBE includes several efficiency designs to get high-quality embeddings on massive graphs. Finally, we observe that GEBE achieves the best performance when instantiating MHS and MHP using a Poisson distribution, and thus, we further develop GEBE(p) based on Poisson-instantiated MHS and MHP, with non-trivial efficiency optimizations. Extensive experiments, comparing 15 competitors on 10 real datasets, demonstrate that our solutions, especially GEBE(p), obtain superior result utility than all competitors for top-N recommendation and link prediction, while being up to orders of magnitude faster.
引用
收藏
页码:1977 / 1991
页数:15
相关论文
共 50 条
  • [1] PANE: scalable and effective attributed network embedding
    Renchi Yang
    Jieming Shi
    Xiaokui Xiao
    Yin Yang
    Sourav S. Bhowmick
    Juncheng Liu
    The VLDB Journal, 2023, 32 : 1237 - 1262
  • [2] PANE: scalable and effective attributed network embedding
    Yang, Renchi
    Shi, Jieming
    Xiao, Xiaokui
    Yang, Yin
    Bhowmick, Sourav S.
    Liu, Juncheng
    VLDB JOURNAL, 2023, 32 (06): : 1237 - 1262
  • [3] Bipartite Network Embedding via Effective Integration of Explicit and Implicit Relations
    Wang, Yaping
    Jiao, Pengfei
    Wang, Wenjun
    Lu, Chunyu
    Liu, Hongtao
    Wang, Bo
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 435 - 451
  • [4] BiNE: Bipartite Network Embedding
    Gao, Ming
    Chen, Leihui
    He, Xiangnan
    Zhou, Aoying
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 715 - 724
  • [5] BiANE: Bipartite Attributed Network Embedding
    Huang, Wentao
    Li, Yuchen
    Fang, Yuan
    Fan, Ju
    Yang, Hongxia
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 149 - 158
  • [6] Scalable Multiplex Network Embedding
    Zhang, Hongming
    Qiu, Liwei
    Yi, Lingling
    Song, Yangqiu
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3082 - 3088
  • [7] Research on Bipartite Network Embedding with Auxiliary Information
    Ahmed, Hasnat
    Ali, Shahbaz
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 298 - 303
  • [8] Bipartite network embedding with Symmetric Neighborhood Convolution
    Zhou, Cangqi
    Zhang, Jing
    Gao, Kaisheng
    Li, Qianmu
    Hu, Dianming
    Sheng, Victor S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 198
  • [9] TLINE: Scalable Transductive Network Embedding
    Zhang, Xia
    Chen, Weizheng
    Yan, Hongfei
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2016, 2016, 9994 : 98 - 110
  • [10] SINE: Scalable Incomplete Network Embedding
    Zhang, Daokun
    Yin, Jie
    Zhu, Xingquan
    Zhang, Chengqi
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 737 - 746