Extracting Human Activity Areas from Large-Scale Spatial Data with Varying Densities

被引:1
|
作者
Shen, Xiaoqi [1 ]
Shi, Wenzhong [2 ]
Liu, Zhewei [2 ]
Zhang, Anshu [2 ]
Wang, Lukang [1 ]
Zeng, Fanxin [2 ]
机构
[1] China Univ Min & Technol, Sch Environm Sci & Spatial Informat, Xuzhou 221116, Jiangsu, Peoples R China
[2] Hong Kong Polytech Univ, Otto Poon Charitable Fdn Smart City Res Inst, Hong Kong 999077, Peoples R China
基金
国家重点研发计划;
关键词
human activity; area extraction; large-scale spatial data; varying density; clustering algorithm; HOTSPOT DETECTION; BIG DATA; FOOTPRINTS; PATTERNS; MOBILITY; GPS;
D O I
10.3390/ijgi11070397
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human activity area extraction, a popular research topic, refers to mining meaningful location clusters from raw activity data. However, varying densities of large-scale spatial data create a challenge for existing extraction methods. This research proposes a novel area extraction framework (ELV) aimed at tackling the challenge by using clustering with an adaptive distance parameter and a re-segmentation strategy with noise recovery. Firstly, a distance parameter was adaptively calculated to cluster high-density points, which can reduce the uncertainty introduced by human subjective factors. Secondly, the remaining points were assigned according to the spatial characteristics of the clustered points for a more reasonable judgment of noise points. Then, to face the varying density problem, a re-segmentation strategy was designed to segment the appropriate clusters into low- and high-density clusters. Lastly, the noise points produced in the re-segmentation step were recovered to reduce unnecessary noise. Compared with other algorithms, ELV showed better performance on real-life datasets and reached 0.42 on the Silhouette coefficient (SC) indicator, with an improvement of more than 16.67%. ELV ensures reliable clustering results, especially when the density differences of the activity points are large, and can be valuable in some applications, such as location prediction and recommendation.
引用
收藏
页数:35
相关论文
共 50 条
  • [31] Data Integration for Large-Scale Models of Species Distributions
    Isaac, Nick J. B.
    Jarzyna, Marta A.
    Keil, Petr
    Dambly, Lea I.
    Boersch-Supan, Philipp H.
    Browning, Ella
    Freeman, Stephen N.
    Golding, Nick
    Guillera-Arroita, Gurutzeta
    Henrys, Peter A.
    Jarvis, Susan
    Lahoz-Monfort, Jose
    Pagel, Joern
    Pescott, Oliver L.
    Schmucki, Reto
    Simmonds, Emily G.
    O'Hara, Robert B.
    TRENDS IN ECOLOGY & EVOLUTION, 2020, 35 (01) : 56 - 67
  • [32] DIFF: a relational interface for large-scale data explanation
    Firas Abuzaid
    Peter Kraft
    Sahaana Suri
    Edward Gan
    Eric Xu
    Atul Shenoy
    Asvin Ananthanarayan
    John Sheu
    Erik Meijer
    Xi Wu
    Jeff Naughton
    Peter Bailis
    Matei Zaharia
    The VLDB Journal, 2021, 30 : 45 - 70
  • [33] Polynomial Data Compression for Large-Scale Physics Experiments
    Aubert P.
    Vuillaume T.
    Maurin G.
    Jacquemier J.
    Lamanna G.
    Emad N.
    Computing and Software for Big Science, 2018, 2 (1)
  • [34] Review of Statistical Analysis Methods of Large-Scale Data
    Hajirahimova, Makrufa S.
    Aliyeva, Aybeniz S.
    2015 9TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2015, : 67 - 71
  • [35] Visualizing Large-scale and High-dimensional Data
    Tang, Jian
    Liu, Jingzhou
    Zhang, Ming
    Mei, Qiaozhu
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 287 - 297
  • [36] Visual Cascade Analytics of Large-Scale Spatiotemporal Data
    Deng, Zikun
    Weng, Di
    Liang, Yuxuan
    Bao, Jie
    Zheng, Yu
    Schreck, Tobias
    Xu, Mingliang
    Wu, Yingcai
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (06) : 2486 - 2499
  • [37] The Family of MapReduce and Large-Scale Data Processing Systems
    Sakr, Sherif
    Liu, Anna
    Fayoumi, Ayman G.
    ACM COMPUTING SURVEYS, 2013, 46 (01)
  • [38] Data-driven Authoring of Large-scale Ecosystems
    Kapp, Konrad
    Gain, James
    Guerin, Eric
    Galin, Eric
    Peytavie, Adrien
    ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (06):
  • [39] DIFF: a relational interface for large-scale data explanation
    Abuzaid, Firas
    Kraft, Peter
    Suri, Sahaana
    Gan, Edward
    Xu, Eric
    Shenoy, Atul
    Ananthanarayan, Asvin
    Sheu, John
    Meijer, Erik
    Wu, Xi
    Naughton, Jeff
    Bailis, Peter
    Zaharia, Matei
    VLDB JOURNAL, 2021, 30 (01) : 45 - 70
  • [40] Large-scale Semantic Integration of Linked Data: A Survey
    Mountantonakis, Michalis
    Tzitzikas, Yannis
    ACM COMPUTING SURVEYS, 2019, 52 (05)