MetaGeo: A General Framework for Social User Geolocation Identification With Few-Shot Learning

被引:8
作者
Zhou, Fan [1 ]
Qi, Xiuxiu [1 ]
Zhang, Kunpeng [2 ]
Trajcevski, Goce [3 ]
Zhong, Ting [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 610054, Peoples R China
[2] Univ Maryland, Dept Decis Operat & Informat Technol, College Pk, MD 20742 USA
[3] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50011 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Geology; Social networking (online); Task analysis; Predictive models; Training; Blogs; Adaptation models; Bayesian learning; few-shot learning; geolocation; meta-learning; semisupervised learning;
D O I
10.1109/TNNLS.2022.3154204
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identifying the geolocation of social media users is an important problem in a wide range of applications, spanning from disease outbreaks, emergency detection, local event recommendation, to fake news localization, online marketing planning, and even crime control and prevention. Researchers have attempted to propose various models by combining different sources of information, including text, social relation, and contextual data, which indeed has achieved promising results. However, existing approaches still suffer from certain constraints, such as: 1) a very few samples are available and 2) prediction models are not easy to be generalized for users from new regions--which are challenges that motivate our study. In this article, we propose a general framework for identifying user geolocation--MetaGeo, which is a meta-learning-based approach, learning the prior distribution of the geolocation task in order to quickly adapt the prediction toward users from new locations. Different from typical meta-learning settings that only learn a new concept from few-shot samples, MetaGeo improves the geolocation prediction with conventional settings by ensembling numerous mini-tasks. In addition, MetaGeo incorporates probabilistic inference to alleviate two issues inherent in training with few samples: location uncertainty and task ambiguity. To demonstrate the effectiveness of MetaGeo, we conduct extensive experimental evaluations on three real-world datasets and compare the performance with several state-of-the-art benchmark models. The results demonstrate the superiority of MetaGeo in both the settings where the predicted locations/regions are known or have not been seen during training.
引用
收藏
页码:8950 / 8964
页数:15
相关论文
共 79 条
[1]  
Ahmed A., 2013, P WORLD WIDE WEB C, P25
[2]  
Amit R, 2018, PR MACH LEARN RES, V80
[3]  
Amitay E., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P273, DOI 10.1145/1008992.1009040
[4]  
Antoniou A., 2019, INT C LEARN REPR, DOI DOI 10.1145/3351556.3351574
[5]  
Backstrom L., 2010, Proceedings of the 19th international conference on World wide web, P61
[6]   Twitter Geolocation: A Hybrid Approach [J].
Bakerman, Jordan ;
Pazdernik, Karl ;
Wilson, Alyson ;
Fairchild, Geoffrey ;
Bahran, Rian .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (03)
[7]  
Bauer M., 2017, ABS170600326 CORR, P1
[8]  
Bertinetto L, 2016, ADV NEUR IN, V29
[9]  
Bishop Christopher M., 2006, Pattern recognition and machine learning, DOI [10.1007/978-0-387-45528-0, DOI 10.1007/978-0-387-45528-0]
[10]  
Blundell C, 2015, PR MACH LEARN RES, V37, P1613