Extracting Human Activity Areas from Large-Scale Spatial Data with Varying Densities

被引:1
作者
Shen, Xiaoqi [1 ]
Shi, Wenzhong [2 ]
Liu, Zhewei [2 ]
Zhang, Anshu [2 ]
Wang, Lukang [1 ]
Zeng, Fanxin [2 ]
机构
[1] China Univ Min & Technol, Sch Environm Sci & Spatial Informat, Xuzhou 221116, Jiangsu, Peoples R China
[2] Hong Kong Polytech Univ, Otto Poon Charitable Fdn Smart City Res Inst, Hong Kong 999077, Peoples R China
基金
国家重点研发计划;
关键词
human activity; area extraction; large-scale spatial data; varying density; clustering algorithm; HOTSPOT DETECTION; BIG DATA; FOOTPRINTS; PATTERNS; MOBILITY; GPS;
D O I
10.3390/ijgi11070397
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human activity area extraction, a popular research topic, refers to mining meaningful location clusters from raw activity data. However, varying densities of large-scale spatial data create a challenge for existing extraction methods. This research proposes a novel area extraction framework (ELV) aimed at tackling the challenge by using clustering with an adaptive distance parameter and a re-segmentation strategy with noise recovery. Firstly, a distance parameter was adaptively calculated to cluster high-density points, which can reduce the uncertainty introduced by human subjective factors. Secondly, the remaining points were assigned according to the spatial characteristics of the clustered points for a more reasonable judgment of noise points. Then, to face the varying density problem, a re-segmentation strategy was designed to segment the appropriate clusters into low- and high-density clusters. Lastly, the noise points produced in the re-segmentation step were recovered to reduce unnecessary noise. Compared with other algorithms, ELV showed better performance on real-life datasets and reached 0.42 on the Silhouette coefficient (SC) indicator, with an improvement of more than 16.67%. ELV ensures reliable clustering results, especially when the density differences of the activity points are large, and can be valuable in some applications, such as location prediction and recommendation.
引用
收藏
页数:35
相关论文
共 67 条
[1]  
Aggarwal C.C., 2018, DATA CLUSTERING CHAP, V231, P258
[2]   Survey of State-of-the-Art Mixed Data Clustering Algorithms [J].
Ahmad, Amir ;
Khan, Shehroz S. .
IEEE ACCESS, 2019, 7 :31883-31902
[3]  
[Anonymous], 2010, SIGSPATIAL
[4]  
[Anonymous], ArcGIS Pro
[5]   Using GPS to learn significant locations and predict movement across multiple users [J].
Ashbrook, Daniel ;
Starner, Thad .
PERSONAL AND UBIQUITOUS COMPUTING, 2003, 7 (05) :275-286
[6]   Learning Tableau: A data visualization tool [J].
Batt, Steven ;
Grealis, Tara ;
Harmon, Oskar ;
Tomolonis, Paul .
JOURNAL OF ECONOMIC EDUCATION, 2020, 51 (3-4) :317-328
[7]   A survey of clustering algorithms for an industrial context [J].
Benabdellah, Abla Chaouni ;
Benghabrit, Asmaa ;
Bouhaddou, Imane .
SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018), 2019, 148 :291-302
[8]   A Novel Clustering Algorithm Based on DPC and PSO [J].
Cai, Jianghui ;
Wei, Huiling ;
Yang, Haifeng ;
Zhao, Xujun .
IEEE ACCESS, 2020, 8 :88200-88214
[9]   Design and Application of an Attractiveness Index for Urban Hotspots Based on GPS Trajectory Data [J].
Cai, Li ;
Jiang, Fang ;
Zhou, Wei ;
Li, Keqin .
IEEE ACCESS, 2018, 6 :55976-55985
[10]  
Calinski T., 1974, Communications in Statistics, V3, P1, DOI [DOI 10.1080/03610927408827101, 10.1080/03610927408827101]