Uncertainty and Overfitting in Fluvial Landform Classification Using Laser Scanned Data and Machine Learning: A Comparison of Pixel and Object-Based Approaches

被引:12
作者
Szabo, Zsuzsanna Csatarine [1 ]
Mikita, Tomas [2 ]
Negyesi, Gabor [3 ]
Varga, Orsolya Gyongyi [4 ]
Burai, Peter [5 ]
Takacs-Szilagyi, Laszlo [4 ]
Szabo, Szilard [3 ]
机构
[1] Univ Debrecen, Doctoral Sch Earth Sci, Dept Phys Geog & Geoinformat, Egyet Ter 1, H-4032 Debrecen, Hungary
[2] Mendel Univ Brno, Dept Forest Management & Appl Geoinformat, Zemedelska 3, Brno 61300, Czech Republic
[3] Univ Debrecen, Dept Phys Geog & Geoinformat, Egyet Ter 1, H-4032 Debrecen, Hungary
[4] Envirosense Hungary Ltd, H-4281 Letavertes, Hungary
[5] Univ Debrecen, Remote Sensing Ctr, Boszormenyi Ut 138, H-4023 Debrecen, Hungary
关键词
geomorphometry; terrain analysis; floodplain; random forest; F1; recursive feature elimination; DIGITAL ELEVATION MODELS; AIRBORNE LIDAR; RANDOM FORESTS; TISZA RIVER; POINT-BAR; GEOMORPHOMETRY; HABITAT; ENVIRONMENTS; DEPOSITION; DIVERSITY;
D O I
10.3390/rs12213652
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Floodplains are valuable scenes of water management and nature conservation. A better understanding of their geomorphological characteristic helps to understand the main processes involved. We performed a classification of floodplain forms in a naturally developed area in Hungary using a Digital Terrain Model (DTM) of aerial laser scanning. We derived 60 geomorphometric variables from the DTM and prepared a geomorphological map of 265 forms (crevasse channels, point bars, swales, levees). Random Forest classification was conducted with Recursive Feature Elimination (RFE) on the objects (mean pixel values by forms) and on the pixels of the variables. We also evaluated the classification probabilities (CP), the spatial uncertainties (SU), and the overfitting in the function of the number of the variables. We found that the object-based method had a better performance (95%) than the pixel-based method (78%). RFE helped to identify the most important 13-20 variables, maintaining the high model performance and reducing the overfitting. However, CP and SU were not efficient measures of classification accuracy as they were not in accordance with the class level accuracy metric. Our results help to understand classification results and the specific limits of laser scanned DTMs. This methodology can be useful in geomorphologic mapping.
引用
收藏
页码:1 / 29
页数:29
相关论文
共 111 条
[1]   Comparison of Machine Learning Methods for Mapping the Stand Characteristics of Temperate Forests Using Multi-Spectral Sentinel-2 Data [J].
Ahmadi, Kourosh ;
Kalantar, Bahareh ;
Saeidi, Vahideh ;
Harandi, Elaheh K. G. ;
Janizadeh, Saeid ;
Ueda, Naonori .
REMOTE SENSING, 2020, 12 (18)
[2]   When power analyses based on pilot data are biased: Inaccurate effect size estimators and follow-up bias [J].
Albers, Casper ;
Lakens, Daniel .
JOURNAL OF EXPERIMENTAL SOCIAL PSYCHOLOGY, 2018, 74 :187-195
[3]  
Alho P., 2009, Nord. J. Surv. Real Estate Res, V6, p21 39
[4]  
Allen J.R.L., 1965, SEDIMENTOLOGY, V5, P89, DOI 10.1111/j.1365-3091.1965.tb01561.x
[5]  
[Anonymous], 2001, P POSTER PRESENTATIO
[6]  
[Anonymous], 2019, SUSTAINABILITY-BASEL, DOI DOI 10.3390/su11185140
[7]  
[Anonymous], 2018, Discovering Statistics using IBM SPSS Statistics
[8]   DEM-based depth in sink as an environmental estimator [J].
Antonic, O ;
Hatic, D ;
Pernar, R .
ECOLOGICAL MODELLING, 2001, 138 (1-3) :247-254
[9]   Seasonal evaporation cycle in oxbow lakes formed along the Tisza River in Hungary for flood control [J].
Babka, Beata ;
Futo, Istvan ;
Szabo, Szilard .
HYDROLOGICAL PROCESSES, 2018, 32 (13) :2009-2019
[10]  
Barrile Vincenzo, 2018, WSEAS Transactions on Environment and Development, V14, P45