An Improved Clustering Approach for Identifying Significant Locations from Spatio-temporal Data

被引:0
作者
Rigzin Angmo
Naveen Aggarwal
Veenu Mangat
Anurag Lal
Simarpreet Kaur
机构
[1] UIET Panjab University,Department of Computer Science and Engineering
[2] UIET Panjab University,Department of Information Technology
[3] Deloitte,Deloitte USI
[4] Infosys,undefined
来源
Wireless Personal Communications | 2021年 / 121卷
关键词
Privacy; Clustering; GPS trajectories; Location Based Services;
D O I
暂无
中图分类号
学科分类号
摘要
The rapid development and adoption of Internet-of-Things (IoT) and sensors such as Global Positioning System (GPS) in our daily life allows gathering of wealth of information. This tremendous amount of sensor data can be clustered to infer information about the whereabouts of users. Researchers have proposed various algorithms for analyzing spatio-temporal data and clustering such datasets. However, a big challenge is to discover clusters with large density variation and to solve this problem most of the existing clustering algorithms manually set the input parameters. In the Density-based spatial clustering (DBSCAN) clustering algorithm, the epsilon and minimum point parameters have to be set manually for further computation. In this paper, we propose an improved DBSCAN algorithm to adaptively set minimum point parameter for dynamic length of data, based on merge sort and silhouette analysis. Validation of the proposed algorithm has been done on GeoLife GPS dataset. Experimental results show that the proposed algorithm is competitive with state-of-the-art methods for identifying users’ significant locations and whereabouts with the density variation. The paper also aims to highlight the issues pertaining to the invasion of user privacy by the potential application of the clustering algorithm on the aforementioned data.
引用
收藏
页码:985 / 1009
页数:24
相关论文
共 94 条
[1]  
Zheng Y(2010)Geolife: A collaborative social networking service among user, location and trajectory IEEE Data Eng. Bull. 33 32-39
[2]  
Xie X(2018)Identifying tourists and analyzing spatial patterns of their destinations from location-based social media data Transportation Research Part C: Emerging Technologies 96 38-54
[3]  
Ma WY(1979)AK-means clustering algorithm Journal of the Royal Statistical Society: Series C (Applied Statistics) 28 100-108
[4]  
Hasnat MM(2011)Density-based clustering Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 1 231-240
[5]  
Hasan S(2014)T-DBSCAN: A spatiotemporal density clustering for GPS trajectory segmentation International Journal of Online Engineering (iJOE) 10 19-24
[6]  
Hartigan JA(2010)EIDBSCAN: An Extended Improving DBSCAN algorithm with sampling techniques International Journal of Business Intelligence and Data Mining 5 94-111
[7]  
Wong MA(1999)OPTICS: Ordering points to identify the clustering structure SIGMOD Record (ACM Special Interest Group on Management of Data) 93 221-237
[8]  
Kriegel HP(2021)Monitoring of Ball Bearing Based on Improved Real-Time OPTICS Clustering Journal of Signal Processing Systems 10 1-51
[9]  
Kröger P(2015)Hierarchical density estimates for data clustering visualization, and outlier detection ACM Transactions on Knowledge Discovery from Data 8 82-49
[10]  
Sander J(2019)A varied density-based clustering approach for event detection from heterogeneous twitter data ISPRS International Journal of Geo-Information 8 218-43720