CoarSAS2hvec: Heterogeneous Information Network Embedding with Balanced Network Sampling

被引:9
作者
Zhan, Ling [1 ]
Jia, Tao [1 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing 400715, Peoples R China
关键词
heterogeneous information networks; network embedding; context sampling; random walk; information entropy;
D O I
10.3390/e24020276
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Heterogeneous information network (HIN) embedding is an important tool for tasks such as node classification, community detection, and recommendation. It aims to find the representations of nodes that preserve the proximity between entities of different nature. A family of approaches that are widely adopted applies random walk to generate a sequence of heterogeneous contexts, from which, the embedding is learned. However, due to the multipartite graph structure of HIN, hub nodes tend to be over-represented to their context in the sampled sequence, giving rise to imbalanced samples of the network. Here, we propose a new embedding method: CoarSAS2hvec. The self-avoiding short sequence sampling with the HIN coarsening procedure (CoarSAS) is utilized to better collect the rich information in HIN. An optimized loss function is used to improve the performance of the HIN structure embedding. CoarSAS2hvec outperforms nine other methods in node classification and community detection on four real-world data sets. Using entropy as a measure of the amount of information, we confirm that CoarSAS catches richer information of the network compared with that through other methods. Hence, the traditional loss function applied to samples by CoarSAS can also yield improved results. Our work addresses a limitation of the random-walk-based HIN embedding that has not been emphasized before, which can shed light on a range of problems in HIN analyses.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Sampling informative context nodes for network embedding
    Zhu, Danhao
    Dai, Xin-Yu
    Chen, Jiajun
    Yin, Jie
    SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (11)
  • [22] SINE: Side Information Network Embedding
    Chen, Zitai
    Cai, Tongzhao
    Chen, Chuan
    Zheng, Zibin
    Ling, Guohui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 692 - 708
  • [23] Heterogeneous Information Network Embedding with Meta-path Based Graph Attention Networks
    Cao, Meng
    Ma, Xiying
    Xu, Ming
    Wang, Chongjun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 622 - 634
  • [24] MINE: A Method of Multi-Interaction Heterogeneous Information Network Embedding
    Zhu, Dongjie
    Sun, Yundong
    Li, Xiaofang
    Du, Haiwen
    Qu, Rongning
    Yu, Pingping
    Piao, Xuefeng
    Higgs, Russell
    Cao, Ning
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (03): : 1343 - 1356
  • [25] Dual-View Fusion of Heterogeneous Information Network Embedding for Recommendation
    Ma, Jinlong
    Wang, Runfeng
    IEEE LATIN AMERICA TRANSACTIONS, 2024, 22 (07) : 557 - 565
  • [26] Semantic Based Heterogeneous Information Network Embedding for Patent Citation Recommendation
    Zhang, Yanping
    Li, Shuang
    Chen, Xi
    Qian, Fulan
    Zhao, Shu
    Zhu, Shuwei
    Wang, Yulu
    2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING (ICAICE 2020), 2020, : 518 - 527
  • [27] User behavior prediction via heterogeneous information preserving network embedding
    Yuan, Weiwei
    He, Kangya
    Han, Guangjie
    Guan, Donghai
    Khattak, Asad Masood
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 92 : 52 - 58
  • [28] Attention-Based Knowledge Tracing with Heterogeneous Information Network Embedding
    Zhang, Nan
    Du, Ye
    Deng, Ke
    Li, Li
    Shen, Jun
    Sun, Geng
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT I, 2020, 12274 : 95 - 103
  • [29] MINE: A method of multi-interaction heterogeneous information network embedding
    Zhu D.
    Sun Y.
    Li X.
    Du H.
    Qu R.
    Yu P.
    Piao X.
    Higgs R.
    Cao N.
    Yu, Pingping (yppflx@hotmail.com), 2020, Tech Science Press (63) : 1343 - 1356
  • [30] RHINE: Relation Structure-Aware Heterogeneous Information Network Embedding
    Shi, Chuan
    Lu, Yuanfu
    Hu, Linmei
    Liu, Zhiyuan
    Ma, Huadong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (01) : 433 - 447