GLDM: Geo-location prediction of twitter users with deep learning methods

被引:2
作者
Al-Jamaan, Rawabe [1 ]
Yklef, Mourad [1 ]
Alothaim, Abdulrahman [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
关键词
Convolutional neural network; location estimation; machine learning; natural language processing; Twitter;
D O I
10.3233/JIFS-230518
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social networks like Twitter are extremely popular and widely used, which has increased interest in studying the information posted there. One such analytical application is extracting location information of users for real-time monitoring of the objects and events of interest, such as political and social events, disease surveillance, natural calamities, and crime prevention. Identifying geographic location is a nontrivial task, as user profiles contain outdated and inaccurate location information. Furthermore, extracting geographical information from Arabic tweets is challenging since they contain many nonstandard data (dialects), complex structures, abbreviations, grammatical and spelling mistakes, etc. This study focuses on the localization of Saudi Arabian users who tweet in Arabic. This study proposes a convolutional neural network-based deep learning model to predict a Twitter user's region-level location using user profiles, text texts, place attachments, and historical tweets. The model was evaluated empirically on a dataset of 95,739 tweets written in Arabic and produced by 4,331 users from Saudi Arabia cities. Regarding classification accuracy, the proposed CNN model outperformed machine learning classifiers such as NB, LR, and SVM with a 60% accuracy on the test set. This study is the first of its kind, aimed at localizing Saudi users based on their tweets.
引用
收藏
页码:2723 / 2734
页数:12
相关论文
共 40 条
  • [1] Agarap A. F., 2018, ARXIV
  • [2] A survey of location inference techniques on Twitter
    Ajao, Oluwaseun
    Hong, Jun
    Liu, Weiru
    [J]. JOURNAL OF INFORMATION SCIENCE, 2015, 41 (06) : 855 - 864
  • [3] Social Media in Disaster Risk Reduction and Crisis Management
    Alexander, David E.
    [J]. SCIENCE AND ENGINEERING ETHICS, 2014, 20 (03) : 717 - 733
  • [4] Alkhatnai M, 2020, COMPUT SIST, V24, P1607, DOI [10.13053/cys-24-4-3878, 10.13053/CyS-24-4-3878]
  • [5] Allan J., 2022, FINANCES ONLINE
  • [6] Amajd M., 2017, INT C ACTUAL PROBLEM, P364
  • [7] Amitay E., 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P273, DOI 10.1145/1008992.1009040
  • [8] Amjad M., 2021, Acta Polytechnica Hungarica, P1785
  • [9] Analysis of COVID-19 Pandemic Using Artificial Intelligence
    Amjad, Maaz
    Rodriguez Chavez, Yuriria
    Nayab, Zaryyab
    Zhila, Alisa
    Sidorov, Grigori
    Gelbukh, Alexander
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2020, PT II, 2020, 12469 : 65 - 73
  • [10] [Anonymous], 2009, PROC 18 INT C WORLD, DOI DOI 10.1145/1526709.1526899