Clustering Algorithms for Spatial Big Data

被引:5
|
作者
Schoier, Gabriella [1 ]
Gregorio, Caterina [1 ]
机构
[1] Univ Trieste, Dept Econ Business Math & Stat Sci Bruno de Finet, DEAMS, Tigor 22, I-34100 Trieste, Italy
来源
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2017, PT IV | 2017年 / 10407卷
关键词
Spatial data mining; Clustering algorithms; DBSCAN; FSDP; K-Means; Arbitrary shape of clusters; Handling noise; Image analysis;
D O I
10.1007/978-3-319-62401-3_41
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In our time people and devices constantly generate data. User activity generates data about needs and preferences as well as the quality of their experiences in different ways: i. e. streaming a video, looking at the news, searching for a restaurant or a an hotel, playing a game with others, making purchases, driving a car. Even when people put their devices in their pockets, the network is generating location and other data that keeps services running and ready to use. This rapid developments in the availability and access to data and in particular spatially referenced data in a different areas, has induced the need for better analysis techniques to understand the various phenomena. Spatial clustering algorithms, which groups similar spatial objects into classes, can be used for the identification of areas sharing common characteristics. The aim of this paper is to analyze the performance of three different clustering algorithms i. e. the Density-Based Spatial Clustering of Applications with Noise algorithm (DBSCAN), the Fast Search by Density Peak (FSDP) algorithm and the classic K-means algorithm (K-Means) as regards the analysis of spatial big data. We propose a modification of the FSDP algorithm in order to improve its efficiency in large databases. The applications concern both synthetic data sets and satellite images.
引用
收藏
页码:571 / 583
页数:13
相关论文
共 50 条
  • [31] Review and compare clustering algorithms for navigation data analysis tasks
    Ponomareva, A. V.
    Meyta, R. V.
    PROCEEDINGS OF THE 2016 CONFERENCE ON INFORMATION TECHNOLOGIES IN SCIENCE, MANAGEMENT, SOCIAL SPHERE AND MEDICINE (ITSMSSM), 2016, 51 : 270 - 273
  • [32] Application and visualization of typical clustering algorithms in seismic data analysis
    Fan, Z.
    Xu, X.
    10TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2019) / THE 2ND INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40 2019) / AFFILIATED WORKSHOPS, 2019, 151 : 171 - 178
  • [33] The Same Size Distribution of Data Based on Unsupervised Clustering Algorithms
    Rashidov A.
    Akhatov A.
    Nazarov F.
    Lecture Notes on Data Engineering and Communications Technologies, 2023, 180 : 437 - 447
  • [34] Spatial Data Mining in the Context of Big Data
    Wang, Shuliang
    Yuan, Hanning
    2013 19TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2013), 2013, : 486 - 491
  • [35] Using clustering algorithms and GPM data to identify spatial precipitation patterns over southeastern Brazil
    Miranda, Bruno Guerreiro
    Negri, Rogerio Galante
    Pampuch, Luana Albertani
    ATMOSFERA, 2023, 37 : 365 - 381
  • [36] Similarity Measures for Spatial Clustering
    Hamdad, Leila
    Benatchba, Karima
    Ifrez, Soraya
    Mohguen, Yasmine
    COMPUTATIONAL INTELLIGENCE AND ITS APPLICATIONS, 2018, 522 : 25 - 36
  • [37] Fast and effective Big Data exploration by clustering
    Ianni, Michele
    Masciari, Elio
    Mazzeo, Giuseppe M.
    Mezzanzanica, Mario
    Zaniolo, Carlo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 102 : 84 - 94
  • [38] Approximate Clustering Ensemble Method for Big Data
    Mahmud, Mohammad Sultan
    Huang, Joshua Zhexue
    Ruby, Rukhsana
    Ngueilbaye, Alladoumbaye
    Wu, Kaishun
    IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (04) : 1142 - 1155
  • [39] Big Data Clustering based on Summary Statistics
    Fu, Junsong
    Liu, Yun
    Zhang, Zhenjiang
    Xiong, Fei
    2015 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE THEORY, SYSTEMS AND APPLICATIONS (CCITSA 2015), 2015, : 87 - 91
  • [40] A Novel Clustering Technique for Efficient Clustering of Big Data in Hadoop Ecosystem
    Sunil Kumar
    Maninder Singh
    Big Data Mining and Analytics, 2019, 2 (04) : 240 - 247