Isolation Forests to Evaluate Class Separability and the Representativeness of Training and Validation Areas in Land Cover Classification

被引:19
作者
Alonso-Sarria, Francisco [1 ]
Valdivieso-Ros, Carmen [1 ]
Gomariz-Castillo, Francisco [1 ,2 ]
机构
[1] Univ Murcia, Inst Univ Agua & Medio Ambiente, Edificio D,Campus Espinardo S-N, E-30001 Murcia, Spain
[2] Inst Euromediterraneo Agua, Campus Espinardo S-N, Murcia 30001, Spain
关键词
training area representativeness; class separability; Landsat-8; random tree ensembles; random forest; SAMPLE SELECTION; ACCURACY; INDEX;
D O I
10.3390/rs11243000
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Supervised land cover classification from remote sensing imagery is based on gathering a set of training areas to characterise each of the classes and to train a predictive model that is then used to predict land cover in the rest of the image. This procedure relies mainly on the assumptions of statistical separability of the classes and the representativeness of the training areas. This paper uses isolation forests, a type of random tree ensembles, to analyse both assumptions and to easily correct lack of representativeness by digitising new training areas where needed to improve the classification of a Landsat-8 set of images with Random Forest. The results show that the improved set of training areas after the isolation forest analysis is more representative of the whole image and increases classification accuracy. Besides, the distribution of isolation values can be useful to estimate class separability. A class separability parameter that summarises such distributions is proposed. This parameter is more correlated to omission and commission errors than other separability measures such as the Jeffries-Matusita distance.
引用
收藏
页数:21
相关论文
共 41 条
[1]   Land use/cover classification of arid and semi-arid Mediterranean landscapes using Landsat ETM [J].
Alrababah, M. A. ;
Alhamad, M. N. .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2006, 27 (13) :2703-2718
[2]  
[Anonymous], 2009, Kernel methods for remote sensing data analysis
[4]  
Berk R.A., 2016, STAT LEARNING REGRES
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Modification of the random forest algorithm to avoid statistical dependence problems when classifying remote sensing imagery [J].
Canovas-Garcia, Fulgencio ;
Alonso-Sarria, Francisco ;
Gomariz-Castillo, Francisco ;
Onate-Valdivieso, Fernando .
COMPUTERS & GEOSCIENCES, 2017, 103 :1-11
[7]  
Chavez PS, 1996, PHOTOGRAMM ENG REM S, V62, P1025
[8]   Remote sensing image-based analysis of the relationship between urban heat island and land use/cover changes [J].
Chen, Xiao-Ling ;
Zhao, Hong-Mei ;
Li, Ping-Xiang ;
Yin, Zhi-Yong .
REMOTE SENSING OF ENVIRONMENT, 2006, 104 (02) :133-146
[9]  
Chuvieco E., 2010, Teledeteccion ambiental: La observacion de la tierra desde el espacio
[10]  
Congalton R., 2008, ASSESSING ACCURACY R, P335