Learning Heterogeneous Network Embedding From Text and Links

被引:2
作者
Long, Yunfei [1 ]
Xiang, Rong [1 ]
Lu, Qin [1 ]
Xiong, Dan [1 ]
Huang, Chu-Ren [2 ]
Bi, Chenglin [3 ]
Li, Mingle [4 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Chinese & Bilingual Studies, Hong Kong, Hong Kong, Peoples R China
[3] Adv Micro Devices Shanghai, Shanghai 201203, Peoples R China
[4] Huawei Technol Co Ltd, Shenzhen 518100, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
关键词
Network embedding; heterogeneous network; attention mechanism; text processing; INTERNET;
D O I
10.1109/ACCESS.2018.2873044
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding methods to represent multiple types of nodes in heterogeneous networks is both challenging and rewarding, as there is much less work in this area compared with that of homogeneous networks. In this paper, we propose a novel approach to learn node embedding for heterogeneous networks through a joint learning framework of both network links and text associated with nodes. A novel attention mechanism is also used to make good use of text extended through links to obtain much larger network context. Link embedding is first learned through a random-walk-based method to process multiple types of links. Text embedding is separately learned at both sentence level and document level to capture salient semantic information more comprehensively. Then, both types of embeddings are jointly fed into a hierarchical neural network model to learn node representation through mutual enhancement. The attention mechanism follows linked edges to obtain context of adjacent nodes to extend context for node representation. The evaluation on a link prediction task in a heterogeneous network data set shows that our method outperforms the current state-of-the-art method by 2.5%-5.0% in AUC values with p-value less than 10(-9), indicating very significant improvement.
引用
收藏
页码:55850 / 55860
页数:11
相关论文
共 48 条
  • [1] Airoldi EM, 2008, J MACH LEARN RES, V9, P1981
  • [2] Network analysis of protein structures identifies functional residues
    Amitai, G
    Shemesh, A
    Sitbon, E
    Shklar, M
    Netanely, D
    Venger, I
    Pietrokovski, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2004, 344 (04) : 1135 - 1146
  • [3] [Anonymous], 2000, Journal of Social Structure
  • [4] [Anonymous], 2017, Graph embedding techniques, applications, and performance: A survey
  • [5] [Anonymous], 2016, INT JOINT C ARTIFICI
  • [6] [Anonymous], 2011, Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '11
  • [7] [Anonymous], 2016, Network, DOI DOI 10.1145/2623330.2623732
  • [8] [Anonymous], 2013, 22 INT WORLD WID WEB, DOI DOI 10.1145/2488388.2488393
  • [9] Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics
    Asgari, Ehsaneddin
    Mofrad, Mohammad R. K.
    [J]. PLOS ONE, 2015, 10 (11):
  • [10] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
    BENGIO, Y
    SIMARD, P
    FRASCONI, P
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 157 - 166