Clustering Algorithms for Spatial Big Data

被引:5
|
作者
Schoier, Gabriella [1 ]
Gregorio, Caterina [1 ]
机构
[1] Univ Trieste, Dept Econ Business Math & Stat Sci Bruno de Finet, DEAMS, Tigor 22, I-34100 Trieste, Italy
来源
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2017, PT IV | 2017年 / 10407卷
关键词
Spatial data mining; Clustering algorithms; DBSCAN; FSDP; K-Means; Arbitrary shape of clusters; Handling noise; Image analysis;
D O I
10.1007/978-3-319-62401-3_41
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In our time people and devices constantly generate data. User activity generates data about needs and preferences as well as the quality of their experiences in different ways: i. e. streaming a video, looking at the news, searching for a restaurant or a an hotel, playing a game with others, making purchases, driving a car. Even when people put their devices in their pockets, the network is generating location and other data that keeps services running and ready to use. This rapid developments in the availability and access to data and in particular spatially referenced data in a different areas, has induced the need for better analysis techniques to understand the various phenomena. Spatial clustering algorithms, which groups similar spatial objects into classes, can be used for the identification of areas sharing common characteristics. The aim of this paper is to analyze the performance of three different clustering algorithms i. e. the Density-Based Spatial Clustering of Applications with Noise algorithm (DBSCAN), the Fast Search by Density Peak (FSDP) algorithm and the classic K-means algorithm (K-Means) as regards the analysis of spatial big data. We propose a modification of the FSDP algorithm in order to improve its efficiency in large databases. The applications concern both synthetic data sets and satellite images.
引用
收藏
页码:571 / 583
页数:13
相关论文
共 50 条
  • [1] On the Problem of Clustering Spatial Big Data
    Schoier, Gabriella
    Borruso, Giuseppe
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2015, PT III, 2015, 9157 : 688 - 697
  • [2] Big Data and Clustering Algorithms
    Ajin, V. W.
    Kumar, Lekshmy D.
    2016 INTERNATIONAL CONFERENCE ON RESEARCH ADVANCES IN INTEGRATED NAVIGATION SYSTEMS (RAINS), 2016,
  • [3] A survey on parallel clustering algorithms for Big Data
    Dafir, Zineb
    Lamari, Yasmine
    Slaoui, Said Chah
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (04) : 2411 - 2443
  • [4] Scalable Clustering Algorithms for Big Data: A Review
    Mahdi, Mahmoud A.
    Hosny, Khalid M.
    Elhenawy, Ibrahim
    IEEE ACCESS, 2021, 9 : 80015 - 80027
  • [5] A survey on parallel clustering algorithms for Big Data
    Zineb Dafir
    Yasmine Lamari
    Said Chah Slaoui
    Artificial Intelligence Review, 2021, 54 : 2411 - 2443
  • [6] Spatial Clustering Based on Analysis of Big Data in Digital Marketing
    Ivaschenko, Anton
    Stolbova, Anastasia
    Golovnin, Oleg
    ARTIFICIAL INTELLIGENCE: (RCAI 2019), 2019, 1093 : 335 - 347
  • [7] Parallel and distributed clustering framework for big spatial data mining
    Bendechache, Malika
    Tari, A-Kamel
    Kechadi, M-Tahar
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2019, 34 (06) : 671 - 689
  • [8] The Modeling and Simulation of Data Clustering Algorithms in Data Mining with Big Data
    Chen, Weiru
    Oliverio, Jared
    Kim, Jin Ho
    Shen, Jiayue
    JOURNAL OF INDUSTRIAL INTEGRATION AND MANAGEMENT-INNOVATION AND ENTREPRENEURSHIP, 2019, 4 (01):
  • [9] Differential identifiability clustering algorithms for big data analysis
    Shang, Tao
    Zhao, Zheng
    Ren, Xujie
    Liu, Jianwei
    SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (05)
  • [10] A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis
    Fahad, Adil
    Alshatri, Najlaa
    Tari, Zahir
    Alamri, Abdullah
    Khalil, Ibrahim
    Zomaya, Albert Y.
    Foufou, Sebti
    Bouras, Abdelaziz
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (03) : 267 - 279