Where's @Waldo?: Finding Users on Twitter

被引:10
作者
Clarkson, Kyle [1 ]
Srivastava, Gautam [2 ,5 ]
Meawad, Fatma [4 ]
Dwivedi, Ashutosh Dhar [2 ,3 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC, Canada
[2] Brandon Univ, Dept Math & Comp Sci, Brandon, MB, Canada
[3] Polish Acad Sci, Inst Comp Sci, Warsaw, Poland
[4] Singapore Inst Technol, Informat & Commun Technol, Singapore, Singapore
[5] China Med Univ, Res Ctr Interneural Comp, Taichung, Taiwan
来源
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2019, PT II | 2019年 / 11509卷
关键词
Data mining; Geolocation inference; Natural language processing; Twitter; Eclipse; Statistical probability; LOCATION;
D O I
10.1007/978-3-030-20915-5_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In today's social media world we are provided with an impressive amount of data about users and their societal interactions. This offers computer scientists among others many new opportunities for research exploration. Arguably, one of the most interesting areas of work is that of predicting events and developments based on social media data and trends. We have recently seen this happen in many areas including politics, finance, entertainment, market demands, health, and many others. Furthermore, there has been a lot of attention garnered on being able to predict a user's location based on their online activity taking into account that large amount of social interaction online is done behind usernames and anonymous titles. This area of research is well-known as geolocation inference. In this paper, we propose a novel model for geolocation inference of social media users using the aid of a discrete event: the Solar Eclipse of 2017. Being able to use the path pf the eclipse and timing of its path of travel to infer a user's location is a unique model seen only in this paper. We apply this unique model to Twitter data gathered from users during the Solar Eclipse of 2017 and attempt to determine if certain features of the data itself are indicative of users viewing the eclipse or of similar events. Taking advantage of Stanford's natural language processing software, we also consider the proportions and existences of many words, part-of-speech tags, and relations between users both found in our sample data, in an attempt to find key features of users who are viewing the eclipse. We discuss our results using our unique model and conclude by discussing the strengths and weaknesses of the model with the resulting potential future work.
引用
收藏
页码:338 / 349
页数:12
相关论文
共 26 条
[1]  
[Anonymous], 2008, TECHNICAL REPORT
[2]  
[Anonymous], 2013, 24 ACM C HYP SOC MED, DOI [DOI 10.1145/2481492.2481494, 10.1145/2481492.2481494]
[3]  
[Anonymous], 2011, P 20 ACM INT C INF K, DOI DOI 10.1145/2063576.2063959
[4]  
Backstrom L., 2010, Proceedings of the 19th international conference on World wide web, P61, DOI [DOI 10.1145/1772690.1772698, 10.1145/1772690.1772698]
[5]  
Bifet A, 2010, LECT NOTES ARTIF INT, V6332, P1, DOI 10.1007/978-3-642-16184-1_1
[6]  
Caverlee J., 2013, IEEE Data Eng. Bull, V36, P33
[7]  
Cheng R, 2006, LECT NOTES COMPUT SC, V4258, P393
[8]  
Cheng Z., 2010, CIKM 10, P759, DOI DOI 10.1145/1871437.1871535
[9]  
Conover WJ., 1999, WILEY SERIES PROBABI, V3rd
[10]   Inferring the Location of Twitter Messages Based on User Relationships [J].
Davis, Clodoveu A., Jr. ;
Pappa, Gisele L. ;
Rocha de Oliveira, Diogo Renno ;
Arcanjo, Filipe de L. .
TRANSACTIONS IN GIS, 2011, 15 (06) :735-751