Selection of Optimal Object Features in Object-Based Image Analysis Using Filter-Based Algorithms

被引:14
作者
Colkesen, Ismail [1 ]
Kavzoglu, Taskin [1 ]
机构
[1] Gebze Tech Univ, Dept Geomat Engn, TR-41400 Gebze, Kocaeli, Turkey
关键词
Object-based classification; Feature selection; Support vector machine; Random forest; Chi square; Information gain; MACHINE-LEARNING ALGORITHMS; LAND-COVER; CLASSIFICATION; MULTIRESOLUTION; SCALE;
D O I
10.1007/s12524-018-0807-x
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
With the increase in spatial resolution of recent sensors, object-based image analysis (OBIA) has gained importance for producing detailed land use maps. One of the main advantages of OBIA is that a variety of spectral, spatial and textural features can be extracted for the segmented image objects that are later utilized in classification. However, using a large number of features not only increases the required computational time, but also requires a large number of ground samples, which is unavailable in most cases. For these reasons, feature selection (FS) has become an important research topic for OBIA based classification studies. In this study, three filter-based FS algorithms namely, Chi square, information gain and ReliefF were applied to determine the most effective object features that ensure high separability among landscape features. For this purpose, importance degree (i.e. ranks) of 110 input object features were firstly estimated by the algorithms, and correlation-based merit function was then applied to determine optimum feature subset size. Multi-resolution segmentation algorithm was applied for segmenting a WorldView-2 image. Support vector machine, random forest and nearest neighbour classifiers were all utilized to classify segmented image objects using the selected object features. Results revealed that the FS algorithms were effective for selecting the most relevant features. Also, the classifiers produced the highest performances with 24 out of 110 features selected by the information gain (IG) algorithm. Particularly, the support vector machine classifier produced the highest overall accuracy (92.00%) with 24 selected features determined by the IG algorithm. A significant improvement of about 4% was achieved by applying FS procedures that was found statistically significant in terms of Wilcoxon signed-ranks test.
引用
收藏
页码:1233 / 1242
页数:10
相关论文
共 41 条
[1]   Fine spatial resolution simulated satellite sensor imagery for land cover mapping in the United Kingdom [J].
Aplin, P ;
Atkinson, PM ;
Curran, PJ .
REMOTE SENSING OF ENVIRONMENT, 1999, 68 (03) :206-216
[2]   Random forest in remote sensing: A review of applications and future directions [J].
Belgiu, Mariana ;
Dragut, Lucian .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 114 :24-31
[3]   Quantitative evaluation of variations in rule-based classifications of land cover in urban neighbourhoods using WorldView-2 imagery [J].
Belgiu, Mariana ;
Dragut, Lucian ;
Strobl, Josef .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 87 :205-215
[4]   Multi-resolution, object-oriented fuzzy analysis of remote sensing data for GIS-ready information [J].
Benz, UC ;
Hofmann, P ;
Willhauck, G ;
Lingenfelder, I ;
Heynen, M .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2004, 58 (3-4) :239-258
[5]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28
[8]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[9]  
Congalton RG, 2019, Assessing the accuracy of remotely sensed data: principles and practices, V3
[10]   ESP: a tool to estimate scale parameter for multiresolution image segmentation of remotely sensed data [J].
Dragut, Lucian ;
Tiede, Dirk ;
Levick, Shaun R. .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2010, 24 (06) :859-871