Location Inference for Non-Geotagged Tweets in User Timelines

被引:24
作者
Li, Pengfei [1 ]
Lu, Hua [2 ]
Kanhabua, Nattiya [3 ]
Zhao, Sha [1 ]
Pan, Gang [1 ,4 ]
机构
[1] Zhejiang Univ, Dept Comp Sci, Hangzhou 310027, Zhejiang, Peoples R China
[2] Aalborg Univ, Dept Comp Sci, DK-9220 Aalborg, Denmark
[3] NTENT, Barcelona 08018, Spain
[4] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Zhejiang, Peoples R China
关键词
Twitter; location inference; bayes; LSTM; EFFICIENT; TWITTER;
D O I
10.1109/TKDE.2018.2852764
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media like Twitter have become globally popular in the past decade. Thanks to the high penetration of smartphones, social media users are increasingly going mobile. This trend has contributed to foster various location based services deployed on social media, the success of which heavily depends on the availability and accuracy of users' location information. However, only a very small fraction of tweets in Twitter are geo-tagged. Therefore, it is necessary to infer locations for tweets in order to attain the purpose of those location based services. In this paper, we tackle this problem by scrutinizing Twitter user timelines in a novel fashion. First of all, we split each user's tweet timeline temporally into a number of clusters, each tending to imply a distinct location. Subsequently, we adapt two machine learning models to our setting and design classifiers that classify each tweet cluster into one of the pre-defined location classes at the city level. The Bayes based model focuses on the information gain of words with location implications in the user-generated contents. The convolutional LSTM model treats user-generated contents and their associated locations as sequences and employs bidirectional LSTM and convolution operation to make location inferences. The two models are evaluated on a large set of real Twitter data. The experimental results suggest that our models are effective at inferring locations for non-geotagged tweets and the models outperform the state-of-the-art and alternative approaches significantly in terms of inference accuracy.
引用
收藏
页码:1150 / 1165
页数:16
相关论文
共 10 条
  • [1] Twitter Event Photo Detection Using both Geotagged Tweets and Non-geotagged Photo Tweets
    Takamu, Kaneko
    Hang, Nga Do
    Yanai, Keiji
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2015, PT II, 2015, 9315 : 128 - 138
  • [2] Fine-Grained Geolocalisation of Non-Geotagged Tweets
    Paraskevopoulos, Pavlos
    Palpanas, Themis
    PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, : 105 - 112
  • [3] Collecting Non-Geotagged Local Tweets via Bandit Algorithms
    Ueda, Saki
    Yamaguchi, Yuto
    Kitagawa, Hiroyuki
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2331 - 2334
  • [4] Where has this tweet come from? Fast and fine-grained geolocalization of non-geotagged tweets
    Paraskevopoulos, Pavlos
    Palpanas, Themis
    SOCIAL NETWORK ANALYSIS AND MINING, 2016, 6 (01)
  • [5] Location estimation of non-geo-tagged tweets
    Samuel, Avinash
    Sharma, Dilip Kumar
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 205 - 216
  • [6] Location estimation of non-geo-tagged tweets
    Avinash Samuel
    Dilip Kumar Sharma
    Evolutionary Intelligence, 2021, 14 : 205 - 216
  • [7] Online Social Network User Home Location Inference Based on Heterogeneous Networks
    Fei, Gaolei
    Liu, Yang
    Hu, Guangmin
    Wen, Sheng
    Xiang, Yang
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (06) : 5509 - 5525
  • [8] Tweets from Justin Bieber's Heart: The Dynamics of the "Location" Field in User Profiles
    Hecht, Brent
    Hong, Lichan
    Suh, Bongwon
    Chi, Ed H.
    29TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2011, : 237 - 246
  • [9] Twitter User Location Inference Based on Representation Learning and Label Propagation
    Tian, Hechan
    Zhang, Meng
    Luo, Xiangyang
    Liu, Fenlin
    Qiao, Yaqiong
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2648 - 2654
  • [10] A Location Inference Algorithm Based-on Smart Phone User Data Modelling
    Kim, Sang-il
    Jung, Wan
    Kim, Hwa-sung
    2014 16TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2014, : 1232 - 1236