AENEA: A novel autoencoder-based network embedding algorithm

被引:0
作者
Xiaolong Xu
Haoyan Xu
Yang Wang
Jing Zhang
机构
[1] Nanjing University of Posts and Telecommunications,School of Computer Science
[2] Nanjing University of Posts and Telecommunications,Jiangsu Key Laboratory of Big Data Security & Intelligent Processing
[3] Shanghai Business School,Faculty of Business Information
来源
Peer-to-Peer Networking and Applications | 2021年 / 14卷
关键词
Network embedding; Deep learning; Semi-supervised learning; Autoencoder;
D O I
暂无
中图分类号
学科分类号
摘要
Network embedding aims to represent vertices in the network with low-dimensional dense real number vectors, so that the attained vertices can acquire the ability of representation and inference in vector space. With the expansion of the scale of complex networks, how to make the high-dimensional network represented in low-dimensional vector space through network becomes an important issue. The typical algorithms of current autoencoder-based network embedding methods include DNGR and SDNE. DNGR method trains the Positive Pointwise Mutual Information (PPMI) matrix with the Stacked Denosing Autoencoder (SDAE), which is lacking in depth thereby attaining less satisfactory representation of network. Besides, SDNE used a semi-supervised autoencoder for embedding the adjacency matrix, whose sparsity may generate more cost in the learning process. In order to solve these problems, we propose a novel Autoencoder-based Network Embedding Algorithm (AENEA). AENEA is mainly divided into three steps. First, the random surfing model is used to process the original network to obtain the Probabilistic Co-occurrence (PCO) matrix between the nodes. Secondly, the Probabilistic Co-occurrence (PCO) matrix is processed to generate the corresponding Positive Pointwise Mutual Information (PPMI) matrix. Finally, the PPMI matrix is used to learn the representation of vertices in the network by using a semi-supervised autoencoder. We implemented a series of experiments to test the performance of AENEA, DNGR, SDNE and so on, on the standardized datasets 20-NewsGroup and Wine. The experimental results show that the performance of AENEA is obviously superior to the existing algorithms in clustering, classification and visualization tasks.
引用
收藏
页码:1829 / 1840
页数:11
相关论文
共 36 条
[31]  
Bullinaria JA(undefined)undefined undefined undefined undefined-undefined
[32]  
Levy JP(undefined)undefined undefined undefined undefined-undefined
[33]  
Strehl A(undefined)undefined undefined undefined undefined-undefined
[34]  
Ghosh J(undefined)undefined undefined undefined undefined-undefined
[35]  
Maaten Lvd(undefined)undefined undefined undefined undefined-undefined
[36]  
Hinton G(undefined)undefined undefined undefined undefined-undefined