GeoZ: a Region-Based Visualization of Clustering Algorithms

被引:9
作者
ElHaj, Khalid [1 ,2 ]
Alshamsi, Dalal [1 ,2 ]
Aldahan, Ala [1 ]
机构
[1] United Arab Emirates Univ, Dept Geosci, POB 15551, Al Ain, U Arab Emirates
[2] United Arab Emirates Univ, Natl Water & Energy Ctr, POB 15551, Al Ain, U Arab Emirates
关键词
California; Clustering; Geographic coordinate system (GCS); Groundwater (GW); Machine learning (ML); Support vector machines (SVM);
D O I
10.1007/s41651-023-00146-0
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The spatial display of clustered data using machine learning (ML) as regions (bordered areas) is currently unfeasible. This problem is commonly encountered in various research fields that utilize clustering algorithms in their workflow. We present in this study an approach utilizing ML algorithm models that can be trained to any specific dataset to produce decision boundaries. These boundaries are overlaid onto the geographic coordinate system (GCS) to generate geographic clustering regions. The proposed approach is implemented in the Python Package Index (PyPI) as a geovisualization library called geographic decision zones (GeoZ). The efficiency of GeoZ was tested using a dataset of groundwater wells in the State of California. We experimented with 13 different ML models to determine the best model that predicts the existing regional distribution (subbasins). The support vector machine (SVM) algorithm produced a relatively high accuracy score and fulfilled the required criteria better than the other models. Consequently, the tested SVM model with optimized parameters was implemented in the GeoZ open-source library. However, it is important to note that limitations in the application of GeoZ may arise from the nature of the SVM algorithm, as well as the volume, discontinuity, and distribution of the data. We have attempted to address these limitations through various suggestions and solutions.
引用
收藏
页数:14
相关论文
共 27 条
  • [1] The Quickhull algorithm for convex hulls
    Barber, CB
    Dobkin, DP
    Huhdanpaa, H
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1996, 22 (04): : 469 - 483
  • [2] California Department of Water Resources (DWR), 2021, CAL GROUNDW UPD 2020, V485
  • [3] California Natural Resources Agency, 2021, PER GROUNDW LEV MEAS
  • [4] Carle David., 2015, INTRO WATER CALIFORN
  • [5] De Marchi S, 2020, BIT, V60, P441, DOI 10.1007/s10543-019-00786-z
  • [6] ElHaj K, 2023, GITHUB REPOSITORY
  • [7] ESRI, 2013, MAP SERV WORLD TOP M
  • [8] Gillies Sean, 2023, Zenodo
  • [9] Impact of Parameter Tuning for Optimizing Deep Neural Network Models for Predicting Software Faults
    Gupta, Mansi
    Rajnish, Kumar
    Bhattacharjee, Vandana
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [10] Matplotlib: A 2D graphics environment
    Hunter, John D.
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2007, 9 (03) : 90 - 95