Detection of hierarchical crowd activity structures in geographic point data

被引:0
作者
Salazar J.M. [1 ]
López-Ramírez P. [1 ]
Siordia O.S. [2 ]
机构
[1] Center for Research in Geospatial Information Sciences (Centrogeo), Tlalpan, Mexico City
[2] National Geointeligence Laboratory, Yucatan, Merida
关键词
Clustering; Crowd activity; Gis; Hierarchical scales; Point pattern analysis;
D O I
10.7717/PEERJ-CS.978
中图分类号
学科分类号
摘要
The pervasive adoption of GPS-enabled sensors has lead to an explosion on the amount of geolocated data that captures a wide range of social interactions. Part of this data can be conceptualized as event data, characterized by a single point signal at a given location and time. Event data has been used for several purposes such as anomaly detection and land use extraction, among others. To unlock the potential offered by the granularity of this new sources of data it is necessary to develop new analytical tools stemming from the intersection of computational science and geographical analysis. Our approach is to link the geographical concept of hierarchical scale structures with density based clustering in databases with noise to establish a common framework for the detection of crowd activity hierarchical structures in geographic point data. Our contribution is threefold: first, we develop a tool to generate synthetic data according to a distribution commonly found on geographic event data sets; second, we propose an improvement of the available methods for automatic parameter selection in densitybased spatial clustering of applications with noise (DBSCAN) algorithm that allows its iterative application to uncover hierarchical scale structures on event databases and, lastly, we propose a framework for the evaluation of different algorithms to extract hierarchical scale structures. Our results show that our approach is successful both as a general framework for the comparison of crowd activity detection algorithms and, in the case of our automatic DBSCAN parameter selection algorithm, as a novel approach to uncover hierarchical structures in geographic point data sets. © Copyright 2022 Salazar et al.
引用
收藏
相关论文
共 45 条
[31]  
Openshaw S., The modifiable areal unit problem, (1984)
[32]  
O'Sullivan D., Big Data: why (Oh Why?) This computational social science?, Geography and the Geoweb: rethinking Research in the Advent of Big Data, (2017)
[33]  
O'Sullivan D, Manson SM., Do physicists have geography envy? And what can geographers learn from it?, Annals of the Association of American Geographers, 105, 4, pp. 704-722, (2015)
[34]  
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E., Scikit-learn: machine Learning in Python, Journal of Machine Learning Research, 12, pp. 2825-2830, (2011)
[35]  
Robertson C, Feick R., Inference and analysis across spatial supports in the big data era: uncertain point observations and geographic contexts, Transactions in GIS, 22, 2, pp. 455-476, (2018)
[36]  
Salazar JM, Lopez-Ramrez P, Sanchez-Siordia O., HierarchicalGeoClustering, (2021)
[37]  
Satopaa V, Albrecht J, Irwin D, Raghavan B., Finding a ‘‘Kneedle’’ in a Haystack: detecting knee points in system behavior, 2011 31st international conference on distributed computing systems workshops, pp. 166-171, (2011)
[38]  
Singleton A, Arribas-Bel D., Geographic data science, Geographical Analysis, 53, 1, pp. 61-75, (2021)
[39]  
Starczewski A, Goetzen P, Er MJ., A new method for automatic determining of the DBSCAN parameters, Journal of Artificial Intelligence and Soft Computing Research, 10, 3, pp. 209-221, (2020)
[40]  
Steiger E, Resch B, Zipf A., Exploration of spatiotemporal and semantic clusters of twitter data using unsupervised neural networks, International Journal of Geographical Information Science, 30, 9, pp. 1694-1716, (2016)