Twitter location (sometimes) matters: Exploring the relationship between georeferenced tweet content and nearby feature classes

被引:27
作者
Hahmann, Stefan [1 ]
Purves, Ross S. [2 ]
Burghardt, Dirk [1 ]
机构
[1] Tech Univ Dresden, Inst Cartog, Dresden, Germany
[2] Univ Zurich Irchel, Dept Geog, Zurich, Switzerland
关键词
correlation between location and content; mobile microblogging; natural language processing; data mining; Twitter; OpenStreetMap;
D O I
10.5311/JOSIS.2014.9.185
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
In this paper, we investigate whether microblogging texts (tweets) produced on mobile devices are related to the geographical locations where they were posted. For this purpose, we correlate tweet topics to areas. In doing so, classified points of interest from OpenStreetMapserve as validation points. We adopted the classification and geolocation of these points to correlate with tweet content by means of manual, supervised, and unsupervised machine learning approaches. Evaluation showed the manual classification approach to be highest quality, followed by the supervised method, and that the unsupervised classification was of low quality. We found that the degree to which tweet content is related to nearby points of interest depends upon topic (that is, upon the OpenStreetMap category). A more general synthesis with prior research leads to the conclusion that the strength of the relationship of tweets and their geographic origin also depends upon geographic scale (where smaller scale correlations are more significant than those of larger scale).
引用
收藏
页码:1 / 36
页数:36
相关论文
共 72 条
[1]   World Explorer: Visualizing Aggregate Data from Unstructured Text in Geo-Referenced Collections [J].
Ahern, Shane ;
Naaman, Mor ;
Nair, Rahul ;
Yang, Jeannie .
PROCEEDINGS OF THE 7TH ACM/IEE JOINT CONFERENCE ON DIGITAL LIBRARIES: BUILDING & SUSTAINING THE DIGITAL ENVIRONMENT, 2007, :1-10
[2]   THEMATIC PATTERNS IN GEOREFERENCED TWEETS THROUGH SPACE-TIME VISUAL ANALYTICS [J].
Andrienko, Gennady ;
Andrienko, Natalia ;
Bosch, Harald ;
Ertl, Thomas ;
Fuchs, Georg ;
Jankowski, Piotr ;
Thom, Dennis .
COMPUTING IN SCIENCE & ENGINEERING, 2013, 15 (03) :72-+
[3]  
Androutsopoulos I., 2000, SIGIR Forum, V34, P160
[4]  
[Anonymous], 2010, PROC 2 ACM SIGSPATIA, DOI DOI 10.1145/1867699.1867701
[5]  
Asur S., 2010, Proceedings 2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT), P492, DOI 10.1109/WI-IAT.2010.63
[6]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[7]  
Biemann C, 2004, LECT NOTES COMPUT SC, V2945, P217
[8]  
Bird S., 2009, NATURAL LANGUAGE PRO
[9]  
Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
[10]   Twitter mood predicts the stock market [J].
Bollen, Johan ;
Mao, Huina ;
Zeng, Xiaojun .
JOURNAL OF COMPUTATIONAL SCIENCE, 2011, 2 (01) :1-8