TOPIC SEQUENCE EMBEDDING FOR USER IDENTITY LINKAGE FROM HETEROGENEOUS BEHAVIOR DATA

被引:2
作者
Yang, Jinzhu [1 ,2 ]
Zhou, Wei [1 ,2 ]
Qian, Wanhui [1 ,2 ]
Han, Jizhong [1 ]
Hu, Songlin [1 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
User Identity Linkage; Information Security; Social Network; User Privacy;
D O I
10.1109/ICASSP39728.2021.9415111
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In social media, user identity linkage is a vital information security issue of identifying users' private information across multiple online social networks. With the popularity of behavior-rich social services, existing methods attempt to align users through encoding behaviors. However, most of the efforts suffer from the high variety and heterogeneity of behavior data across social networks, resulting in a limitation of modeling user intrinsic characteristics. To address the above issues, we focus on keyword-based topics to formulate user's variety behaviors for user identity linkage. In this paper, a novel Topic Sequence Embedding (TSeqE) method is proposed to embed contextual information of topics to represent users' intrinsic characteristics for identity linkage. Furthermore, we introduce a domain-adversarial training strategy to tackle the behavior heterogeneity problem. Our experiments on two real-world datasets demonstrate that TSeqE produces a significant improvement compared with several strong baselines.
引用
收藏
页码:2590 / 2594
页数:5
相关论文
共 18 条
[1]  
[Anonymous], 2013, P 2013 KDD CUP 2013
[2]  
Duchi J, 2011, J MACH LEARN RES, V12, P2121
[3]   DPLink: User Identity Linkage via Deep Neural Network From Heterogeneous Mobility Data [J].
Feng, Jie ;
Zhang, Mingyang ;
Wang, Huandong ;
Yang, Zeyu ;
Zhang, Chao ;
Li, Yong ;
Jin, Depeng .
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, :459-469
[4]  
Ganin Y, 2016, J MACH LEARN RES, V17
[5]   On the Reliability of Profile Matching Across Large Online Social Networks [J].
Goga, Oana ;
Loiseau, Patrick ;
Sommer, Robin ;
Teixeira, Renata ;
Gummadi, Krishna P. .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :1799-1808
[6]  
Liu J., 2013, P 6 ACM INT C WEB SE, P495
[7]   HYDRA: Large-scale Social Identity Linkage via Heterogeneous Behavior Modeling [J].
Liu, Siyuan ;
Wang, Shuhui ;
Zhu, Feida ;
Zhang, Jinbo ;
Krishnan, Ramayya .
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, :51-62
[8]  
Man T., 2016, INT JOINT C ART INT, P1823
[9]   User Identity Linkage by Latent User Space Modelling [J].
Mu, Xin ;
Zhu, Feida ;
Lim, Ee-Peng ;
Xiao, Jing ;
Wang, Jianzong ;
Zhou, Zhi-Hua .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :1775-1784
[10]   Where You Are Is Who You Are: User Identification by Matching Statistics [J].
Naini, Farid M. ;
Unnikrishnan, Jayakrishnan ;
Thiran, Patrick ;
Vetterli, Martin .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2016, 11 (02) :358-372