Location Prediction for Tweets

被引:8
作者
Huang, Chieh-Yang [1 ]
Tong, Hanghang [1 ]
He, Jingrui [1 ]
Maciejewski, Ross [1 ]
机构
[1] Arizona State Univ, CIDSE, Tempe, AZ 85281 USA
来源
FRONTIERS IN BIG DATA | 2019年 / 2卷
关键词
data mining; location prediction; multi-head self-attention mechanism; joint training; deep learning; tweets; EVENT DETECTION;
D O I
10.3389/fdata.2019.00005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Geographic information provides an important insight into many data mining and social media systems. However, users are reluctant to provide such information due to various concerns, such as inconvenience, privacy, etc. In this paper, we aim to develop a deep learning based solution to predict geographic information for tweets. The current approaches bear two major limitations, including (a) hard to model the long term information and (b) hard to explain to the end users what the model learns. To address these issues, our proposed model embraces three key ideas. First, we introduce a multi-head self-attention model for text representation. Second, to further improve the result on informal language, we treat subword as a feature in our model. Lastly, the model is trained jointly with the city and country to incorporate the information coming from different labels. The experiment performed on W-NUT 2016 Geo-tagging shared task shows our proposed model is competitive with the state-of-the-art systems when using accuracy measurement, and in the meanwhile, leading to a better distance measure over the existing approaches.
引用
收藏
页数:12
相关论文
共 43 条
  • [1] [Anonymous], 2016, PROC 2 WORKSHOP NOIS
  • [2] [Anonymous], ADV LOCATION BASED S
  • [3] [Anonymous], 2015, Character-level convolutional networks for text classification
  • [4] [Anonymous], 2012, P ACM GIS, DOI DOI 10.1145/2424321.2424348
  • [5] [Anonymous], 2013, 24 ACM C HYP SOC MED, DOI [DOI 10.1145/2481492.2481494, 10.1145/2481492.2481494]
  • [6] Backstrom L., 2010, Proceedings of the 19th international conference on World wide web, P61, DOI [DOI 10.1145/1772690.1772698, 10.1145/1772690.1772698]
  • [7] A neural probabilistic language model
    Bengio, Y
    Ducharme, R
    Vincent, P
    Jauvin, C
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) : 1137 - 1155
  • [8] Bojanowski Piotr, 2017, Trans. Assoc. Comput. Linguist., V5, P135, DOI DOI 10.1162/TACL_A_00051
  • [9] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [10] Chandra S., 2011, Proceedings of the 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and IEEE Third International Conference on Social Computing (PASSAT/SocialCom 2011), P838, DOI 10.1109/PASSAT/SocialCom.2011.120