AENEA: A novel autoencoder-based network embedding algorithm

被引:3
作者
Xu, Xiaolong [1 ]
Xu, Haoyan [2 ]
Wang, Yang [2 ]
Zhang, Jing [3 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing, Jiangsu, Peoples R China
[3] Shanghai Business Sch, Fac Business Informat, Shanghai, Peoples R China
关键词
Network embedding; Deep learning; Semi-supervised learning; Autoencoder;
D O I
10.1007/s12083-020-01043-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Network embedding aims to represent vertices in the network with low-dimensional dense real number vectors, so that the attained vertices can acquire the ability of representation and inference in vector space. With the expansion of the scale of complex networks, how to make the high-dimensional network represented in low-dimensional vector space through network becomes an important issue. The typical algorithms of current autoencoder-based network embedding methods include DNGR and SDNE. DNGR method trains the Positive Pointwise Mutual Information (PPMI) matrix with the Stacked Denosing Autoencoder (SDAE), which is lacking in depth thereby attaining less satisfactory representation of network. Besides, SDNE used a semi-supervised autoencoder for embedding the adjacency matrix, whose sparsity may generate more cost in the learning process. In order to solve these problems, we propose a novel Autoencoder-based Network Embedding Algorithm (AENEA). AENEA is mainly divided into three steps. First, the random surfing model is used to process the original network to obtain the Probabilistic Co-occurrence (PCO) matrix between the nodes. Secondly, the Probabilistic Co-occurrence (PCO) matrix is processed to generate the corresponding Positive Pointwise Mutual Information (PPMI) matrix. Finally, the PPMI matrix is used to learn the representation of vertices in the network by using a semi-supervised autoencoder. We implemented a series of experiments to test the performance of AENEA, DNGR, SDNE and so on, on the standardized datasets 20-NewsGroup and Wine. The experimental results show that the performance of AENEA is obviously superior to the existing algorithms in clustering, classification and visualization tasks.
引用
收藏
页码:1829 / 1840
页数:12
相关论文
共 36 条
[1]  
Ahmed A., 2013, WWW 2013, P37, DOI [10.1145/2488388.2488393, DOI 10.1145/2488388.2488393]
[2]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[3]  
Asuncion A., 2007, UCI Machine Learning Repository
[4]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[5]   Extracting semantic representations from word co-occurrence statistics: A computational study [J].
Bullinaria, John A. ;
Levy, Joseph P. .
BEHAVIOR RESEARCH METHODS, 2007, 39 (03) :510-526
[6]  
Cao SS, 2016, AAAI CONF ARTIF INTE, P1145
[7]   PME: Projected Metric Embedding on Heterogeneous Networks for Link Prediction [J].
Chen, Hongxu ;
Yin, Hongzhi ;
Wang, Weiqing ;
Wang, Hao ;
Quoc Viet Hung Nguyen ;
Li, Xue .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :1177-1186
[8]   How Do the Open Source Communities Address Usability and UX Issues? An Exploratory Study [J].
Cheng, Jinghui ;
Guo, Jin L. C. .
CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,
[9]  
CHURCH KW, 1990, 27TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, P76
[10]   V2VR: Reliable Hybrid-Network-Oriented V2V Data Transmission and Routing Considering RSUs and Connectivity Probability [J].
Gao, Honghao ;
Liu, Can ;
Li, Youhuizi ;
Yang, Xiaoxian .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) :3533-3546