A Statistical Approach for Studying the Spatio-Temporal Distribution of Geolocated Tweets in Urban Environments

被引:7
作者
Santa, Fernando [1 ]
Henriques, Roberto [1 ]
Torres-Sospedra, Joaquin [2 ]
Pebesma, Edzer [3 ]
机构
[1] Univ Nova Lisboa, Nova Informat Management Sch NOVA IMS, P-1070312 Lisbon, Portugal
[2] Univ Jaume 1, Inst New Imaging Technol, Castellon de La Plana 12071, Spain
[3] Univ Munster, Inst Geoinformat, D-48149 Munster, Germany
基金
欧盟地平线“2020”;
关键词
human activity; spatio-temporal statistics; negative binomial regression; functional principal component analysis; multitype spatial point patterns; SOCIAL MEDIA; LAND-USE; TWITTER; MOBILITY; DYNAMICS; BEHAVIOR; SENSORS; MODELS; CITY;
D O I
10.3390/su11030595
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
An in-depth descriptive approach to the dynamics of the urban population is fundamental as a first step towards promoting effective planning and designing processes in cities. Understanding the behavioral aspects of human activities can contribute to their effective management and control. We present a framework, based on statistical methods, for studying the spatio-temporal distribution of geolocated tweets as a proxy for where and when people carry out their activities. We have evaluated our proposal by analyzing the distribution of collected geolocated tweets over a two-week period in the summer of 2017 in Lisbon, London, and Manhattan. Our proposal considers a negative binomial regression analysis for the time series of counts of tweets as a first step. We further estimate a functional principal component analysis of second-order summary statistics of the hourly spatial point patterns formed by the locations of the tweets. Finally, we find groups of hours with a similar spatial arrangement of places where humans develop their activities through hierarchical clustering over the principal scores. Social media events are found to show strong temporal trends such as seasonal variation due to the hour of the day and the day of the week in addition to autoregressive schemas. We have also identified spatio-temporal patterns of clustering, i.e., groups of hours of the day that present a similar spatial distribution of human activities.
引用
收藏
页数:29
相关论文
共 81 条
[31]   Understanding human activity patterns based on space-time-semantics [J].
Huang, Wei ;
Li, Songnian .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 121 :1-10
[32]   Spatial-Temporal Event Detection from Geo-Tagged Tweets [J].
Huang, Yuqian ;
Li, Yue ;
Shan, Jie .
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (04)
[33]  
Husson F., 2017, EXPLORATORY MULTIVAR, V2nd, DOI [10.1201/b21874, DOI 10.1201/B21874]
[34]  
Illian J., 2006, Case Studies in Spatial Point Process Modeling, P135
[35]   A TOOLBOX FOR FITTING COMPLEX SPATIAL POINT PROCESS MODELS USING INTEGRATED NESTED LAPLACE APPROXIMATION (INLA) [J].
Illian, Janine B. ;
Sorbye, Sigrunn H. ;
Rue, Havard .
ANNALS OF APPLIED STATISTICS, 2012, 6 (04) :1499-1530
[37]   Clustering daily patterns of human activities in the city [J].
Jiang, Shan ;
Ferreira, Joseph ;
Gonzalez, Marta C. .
DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 25 (03) :478-510
[38]   Users of the world, unite! The challenges and opportunities of Social Media [J].
Kaplan, Andreas M. ;
Haenlein, Michael .
BUSINESS HORIZONS, 2010, 53 (01) :59-68
[39]   Short term effects of air pollution on health: A European approach using epidemiologic time series data: The APHEA protocol [J].
Katsouyanni, K ;
Schwartz, J ;
Spix, C ;
Touloumi, G ;
Zmirou, D ;
Zanobetti, A ;
Wojtyniak, B ;
Vonk, JM ;
Tobias, A ;
Ponka, A ;
Medina, S ;
Bacharova, L ;
Anderson, HR .
JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 1996, 50 :S12-S18
[40]   Human Activity Recognition and Pattern Discovery [J].
Kim, Eunju ;
Helal, Sumi ;
Cook, Diane .
IEEE PERVASIVE COMPUTING, 2010, 9 (01) :48-53