CoarSAS2hvec: Heterogeneous Information Network Embedding with Balanced Network Sampling

被引:10
|
作者
Zhan, Ling [1 ]
Jia, Tao [1 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing 400715, Peoples R China
关键词
heterogeneous information networks; network embedding; context sampling; random walk; information entropy;
D O I
10.3390/e24020276
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Heterogeneous information network (HIN) embedding is an important tool for tasks such as node classification, community detection, and recommendation. It aims to find the representations of nodes that preserve the proximity between entities of different nature. A family of approaches that are widely adopted applies random walk to generate a sequence of heterogeneous contexts, from which, the embedding is learned. However, due to the multipartite graph structure of HIN, hub nodes tend to be over-represented to their context in the sampled sequence, giving rise to imbalanced samples of the network. Here, we propose a new embedding method: CoarSAS2hvec. The self-avoiding short sequence sampling with the HIN coarsening procedure (CoarSAS) is utilized to better collect the rich information in HIN. An optimized loss function is used to improve the performance of the HIN structure embedding. CoarSAS2hvec outperforms nine other methods in node classification and community detection on four real-world data sets. Using entropy as a measure of the amount of information, we confirm that CoarSAS catches richer information of the network compared with that through other methods. Hence, the traditional loss function applied to samples by CoarSAS can also yield improved results. Our work addresses a limitation of the random-walk-based HIN embedding that has not been emphasized before, which can shed light on a range of problems in HIN analyses.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Heterogeneous Information Network Embedding With Adversarial Disentangler
    Wang, Ruijia
    Shi, Chuan
    Zhao, Tianyu
    Wang, Xiao
    Ye, Yanfang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1581 - 1593
  • [2] HINE: Heterogeneous Information Network Embedding
    Chen, Yuxin
    Wang, Chenguang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), PT I, 2017, 10177 : 180 - 195
  • [3] Heterogeneous Information Network Embedding for Recommendation
    Shi, Chuan
    Hu, Binbin
    Zhao, Wayne Xin
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 357 - 370
  • [4] Proximity-aware heterogeneous information network embedding
    Zhang, Chen
    Wang, Guodong
    Yu, Bin
    Xie, Yu
    Pan, Ke
    KNOWLEDGE-BASED SYSTEMS, 2020, 193
  • [5] Collaborative Knowledge Distillation for Heterogeneous Information Network Embedding
    Wang, Can
    Zhou, Sheng
    Yu, Kang
    Chen, Defang
    Li, Bolang
    Feng, Yan
    Chen, Chun
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 1631 - 1639
  • [6] Network Sampling Using k-hop Random Walks for Heterogeneous Network Embedding
    Anil, Akash
    Singhal, Shubham
    Jain, Piyush
    Singh, Sanasam Ranbir
    Ladhar, Ajay
    Singh, Sandeep
    Chugh, Uppinder
    PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, : 354 - 357
  • [7] HeteSpaceyWalk: A Heterogeneous Spacey Random Walk for Heterogeneous Information Network Embedding
    He, Yu
    Song, Yangqiu
    Li, Jianxin
    Ji, Cheng
    Peng, Jian
    Peng, Hao
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 639 - 648
  • [8] Embedding Heterogeneous Information Network in Hyperbolic Spaces
    Zhang, Yiding
    Wang, Xiao
    Liu, Nian
    Shi, Chuan
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (02)
  • [9] AHINE: Adaptive Heterogeneous Information Network Embedding
    Lin, Yucheng
    Hong, Huiting
    Yang, Xiaoqing
    Gong, Pinghua
    Li, Zang
    Ye, Jieping
    11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 100 - 107
  • [10] Heterogeneous Information Network Embedding for Mention Recommendation
    Yi, Feng
    Jiang, Bo
    Wu, Jianjun
    IEEE ACCESS, 2020, 8 : 91394 - 91404