Online Social Network User Home Location Inference Based on Heterogeneous Networks

被引:0
作者
Fei, Gaolei [1 ]
Liu, Yang [1 ]
Hu, Guangmin [1 ]
Wen, Sheng [2 ]
Xiang, Yang
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
[2] Swinburne Univ Technol, Sch Software & Elect Engn, Hawthorn, Vic 3122, Australia
基金
中国国家自然科学基金;
关键词
Social networking (online); Heterogeneous networks; Training; Blogs; Data models; Urban areas; Inference algorithms; Online social network; location inference; heterogeneous network; information fusion; TWITTER;
D O I
10.1109/TDSC.2024.3376372
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Inferring the home locations of online social network (OSN) users from their corresponding account data is an important process for many applications, such as personal privacy protection and business advertising applications. The existing methods typically use a supervised learning method to infer a user's home location according to a single or partial aspect of their OSN information. However, the home location of a user may be represented in a biased way if only a single or partial aspect of the information is used, and the performances of the supervised learning-based methods are also very sensitive to the quality of the training set utilized. To address these problems, this article presents a novel unsupervised method for inferring the home locations of the OSN users. The method first builds a heterogeneous network model to comprehensively represent the complex location information in the OSN data and then recursively infers users' home locations by fusing the direct and indirect location information of the users. Experiments that compared our method with five existing typical Twitter user home location inference methods on a Twitter dataset demonstrate that the proposed method can significantly improve the accuracy and reliability of user home location inference.
引用
收藏
页码:5509 / 5525
页数:17
相关论文
共 45 条
[1]  
Akbik A, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P724
[2]  
[Anonymous], 2014, P C EMP METH NAT LAN
[3]  
[Anonymous], 2014, Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
[4]  
Arlia D., 2001, Euro-Par 2001 Parallel Processing. 7th International Euro-Par Conference. Proceedings (Lecture Notes in Computer Science Vol.2150), P326
[5]   Representation Learning for Attributed Multiplex Heterogeneous Network [J].
Cen, Yukuo ;
Zou, Xu ;
Zhang, Jianwei ;
Yang, Hongxia ;
Zhou, Jingren ;
Tang, Jie .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :1358-1368
[6]  
Chandra S., 2011, Proceedings of the 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and IEEE Third International Conference on Social Computing (PASSAT/SocialCom 2011), P838, DOI 10.1109/PASSAT/SocialCom.2011.120
[7]   Statistical Features-Based Real-Time Detection of Drifted Twitter Spam [J].
Chen, Chao ;
Wang, Yu ;
Zhang, Jun ;
Xiang, Yang ;
Zhou, Wanlei ;
Min, Geyong .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (04) :914-925
[8]  
Compton R, 2014, IEEE INT CONF BIG DA, P393, DOI 10.1109/BigData.2014.7004256
[9]  
Culotta Aron., 2004, P HLTNAACL 04, P109
[10]   Inferring the Location of Twitter Messages Based on User Relationships [J].
Davis, Clodoveu A., Jr. ;
Pappa, Gisele L. ;
Rocha de Oliveira, Diogo Renno ;
Arcanjo, Filipe de L. .
TRANSACTIONS IN GIS, 2011, 15 (06) :735-751