AENEA: A novel autoencoder-based network embedding algorithm

被引:0
作者
Xiaolong Xu
Haoyan Xu
Yang Wang
Jing Zhang
机构
[1] Nanjing University of Posts and Telecommunications,School of Computer Science
[2] Nanjing University of Posts and Telecommunications,Jiangsu Key Laboratory of Big Data Security & Intelligent Processing
[3] Shanghai Business School,Faculty of Business Information
来源
Peer-to-Peer Networking and Applications | 2021年 / 14卷
关键词
Network embedding; Deep learning; Semi-supervised learning; Autoencoder;
D O I
暂无
中图分类号
学科分类号
摘要
Network embedding aims to represent vertices in the network with low-dimensional dense real number vectors, so that the attained vertices can acquire the ability of representation and inference in vector space. With the expansion of the scale of complex networks, how to make the high-dimensional network represented in low-dimensional vector space through network becomes an important issue. The typical algorithms of current autoencoder-based network embedding methods include DNGR and SDNE. DNGR method trains the Positive Pointwise Mutual Information (PPMI) matrix with the Stacked Denosing Autoencoder (SDAE), which is lacking in depth thereby attaining less satisfactory representation of network. Besides, SDNE used a semi-supervised autoencoder for embedding the adjacency matrix, whose sparsity may generate more cost in the learning process. In order to solve these problems, we propose a novel Autoencoder-based Network Embedding Algorithm (AENEA). AENEA is mainly divided into three steps. First, the random surfing model is used to process the original network to obtain the Probabilistic Co-occurrence (PCO) matrix between the nodes. Secondly, the Probabilistic Co-occurrence (PCO) matrix is processed to generate the corresponding Positive Pointwise Mutual Information (PPMI) matrix. Finally, the PPMI matrix is used to learn the representation of vertices in the network by using a semi-supervised autoencoder. We implemented a series of experiments to test the performance of AENEA, DNGR, SDNE and so on, on the standardized datasets 20-NewsGroup and Wine. The experimental results show that the performance of AENEA is obviously superior to the existing algorithms in clustering, classification and visualization tasks.
引用
收藏
页码:1829 / 1840
页数:11
相关论文
共 36 条
[1]  
Daokun Z(2020)Network representation learning: a survey IEEE Trans Big Data 6 3-28
[2]  
Jie Y(2020)Mining consuming behaviors with temporal evolution for personalized recommendation in mobile marketing apps Mob Netw Appl 25 1233-1248
[3]  
Xingquan Z(2019)An iot-based task scheduling optimization scheme considering the deadline and cost-aware scientific workflow for cloud computing EURASIP J Wirel Commun Netw 2019 249-732
[4]  
Chengqi Z(2016)Neural networks for the prediction of organic chemistry reactions ACS Central Sci 2 725-1065
[5]  
Gao H(2018)A unified framework for community detection and network representation learning IEEE Trans Knowl Data Eng 31 1051-2326
[6]  
Kuang L(2000)Nonlinear dimensionality reduction by locally linear embedding Science 290 2323-591
[7]  
Yin Y(2001)Laplacian eigenmaps and spectral techniques for embedding and clustering Adv Neural Inf Proces Syst 14 585-29
[8]  
Guo B(1990)Word association norms, mutual information, and lexicography Comput Linguist 16 22-526
[9]  
Dou K(2007)Extracting semantic representations from word co-occurrence statistics: a computational study Behav Res Methods 39 510-617
[10]  
Ma X(2002)Cluster ensembles|a knowledgereuse framework for combining multiple partitions J Mach Learn Res 3 583-2605