Outlier Resistant Unsupervised Deep Architectures for Attributed Network Embedding

被引:88
作者
Bandyopadhyay, Sambaran [1 ,2 ]
Lokesh, N. [2 ]
Vivek, Saley Vishal [2 ]
Murty, M. N. [2 ]
机构
[1] IBM Res, Bangalore, Karnataka, India
[2] Indian Inst Sci, Bangalore, Karnataka, India
来源
PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20) | 2020年
关键词
network representation learning; community outliers; adversarial learning; deep autoencoder; graph mining; social networks;
D O I
10.1145/3336191.3371788
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Attributed network embedding is the task to learn a lower dimensional vector representation of the nodes of an attributed network, which can be used further for downstream network mining tasks. Nodes in a network exhibit community structure and most of the network embedding algorithms work well when the nodes, along with their attributes, adhere to the community structure of the network. But real life networks come with community outlier nodes, which deviate significantly in terms of their link structure or attribute similarities from the other nodes of the community they belong to. These outlier nodes, if not processed carefully, can even affect the embeddings of the other nodes in the network. Thus, a node embedding framework for dealing with both the link structure and attributes in the presence of outliers in an unsupervised setting is practically important. In this work, we propose a deep unsupervised autoencoders based solution which minimizes the effect of outlier nodes while generating the network embedding. We use both stochastic gradient descent and closed form updates for faster optimization of the network parameters. We further explore the role of adversarial learning for this task, and propose a second unsupervised deep model which learns by discriminating the structure and the attribute based embeddings of the network and minimizes the effect of outliers in a coupled way. Our experiments show the merit of these deep models to detect outliers and also the superiority of the generated network embeddings for different downstream mining tasks. To the best of our knowledge, these are the first unsupervised non linear approaches that reduce the effect of the outlier nodes while generating Network Embedding.
引用
收藏
页码:25 / 33
页数:9
相关论文
共 32 条
[1]  
[Anonymous], 30 AAAI C ART INT
[2]  
[Anonymous], 2018, ARXIV180405313
[3]  
[Anonymous], 2017, P 31 INT C NEUR INF
[4]  
[Anonymous], 2018, P 27 INT JOINT C ART
[5]  
[Anonymous], 2014, PROC 20 ACM SIGKDD, DOI DOI 10.1145/2623330.2623732
[6]  
Bandyopadhyay S, 2019, AAAI CONF ARTIF INTE, P12
[7]  
Ding Kaize, 2019, SDM
[8]  
Gao HC, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3364
[9]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[10]   node2vec: Scalable Feature Learning for Networks [J].
Grover, Aditya ;
Leskovec, Jure .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :855-864