Embedding of Embedding (EOE) : Joint Embedding for Coupled Heterogeneous Networks

被引:126
作者
Xu, Linchuan [1 ]
Wei, Xiaokai [2 ]
Cao, Jiannong [1 ]
Yu, Philip S. [2 ]
机构
[1] Hong Kong Polytech Univ, Hong Kong, Peoples R China
[2] Univ Illinois, Chicago, IL USA
来源
WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING | 2017年
关键词
Network Embedding; Coupled Heterogeneous Networks; Data Mining; PREDICTION;
D O I
10.1145/3018661.3018723
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Network embedding is increasingly employed to assist network analysis as it is effective to learn latent features that encode linkage information. Various network embedding methods have been proposed, but they are only designed for a single network scenario. In the era of big data, different types of related information can be fused together to form a coupled heterogeneous network, which consists of two different but related sub-networks connected by inter-network edges. In this scenario, the inter-network edges can act as complementary information in the presence of intra-network ones. This complementary information is important because it can make latent features more comprehensive and accurate. And it is more important when the intra-network edges are absent, which can be referred to as the cold-start problem. In this paper, we thus propose a method named embedding of embedding (EOE) for coupled heterogeneous networks. In the EOE, latent features encode not only intra-network edges, but also inter-network ones. To tackle the challenge of heterogeneities of two networks, the EOE incorporates a harmonious embedding matrix to further embed the embeddings that only encode intra-network edges. Empirical experiments on a variety of real-world datasets demonstrate the EOE outperforms consistently single network embedding methods in applications including visualization, link prediction multi-class classification, and multi-label classification.
引用
收藏
页码:741 / 749
页数:9
相关论文
共 26 条
[1]  
Ahmed A., 2013, 22 INT WORLD WID WEB, P37, DOI [10.1145/2488388.2488393, DOI 10.1145/2488388.2488393]
[2]  
[Anonymous], 2016, IEEE INT C BIG DAT
[3]  
[Anonymous], 2014, P 20 ACM SIGKDD INT, DOI [DOI 10.1145/2623330.2623732, 10 . 1145 / 2623330 . 2623732. arXiv: 1403.6652]
[5]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[6]  
Bezdek J. C., 2002, Advances in Soft Computing - AFSS 2002. 2002 AFSS International Conference on Fuzzy Systems. Proceedings (Lecture Notes in Artificial Intelligence Vol.2275), P288
[7]   Collective Prediction of Multiple Types of Links in Heterogeneous Information Networks [J].
Cao, Bokai ;
Kong, Xiangnan ;
Yu, Philip S. .
2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, :50-59
[8]  
Carpenter Bob., 2008, Lazy sparse stochastic gradient descent for regularized multinomial logistic regression, P1
[9]   Assessing Drug Target Association Using Semantic Linked Data [J].
Chen, Bin ;
Ding, Ying ;
Wild, David J. .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (07)
[10]  
Cox T.F., 2000, Multidimensional Scaling, V2nd ed.