Transferability of Recursive Feature Elimination (RFE)-Derived Feature Sets for Support Vector Machine Land Cover Classification

被引:23
作者
Ramezan, Christopher A. A. [1 ]
机构
[1] West Virginia Univ, Dept Management Informat Syst, Morgantown, WV 26506 USA
关键词
feature selection; machine learning; recursive feature elimination; feature set transferability; Sentinel-2A; multispectral imagery; sample size; FEATURE-SELECTION; IMAGE-ANALYSIS; VEGETATION; TEXTURE; SUBSET; GEOBIA;
D O I
10.3390/rs14246218
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Remote sensing analyses frequently use feature selection methods to remove non-beneficial feature variables from the input data, which often improve classification accuracy and reduce the computational complexity of the classification. Many remote sensing analyses report the results of the feature selection process to provide insights on important feature variable for future analyses. Are these feature selection results generalizable to other classification models, or are they specific to the input dataset and classification model they were derived from? To investigate this, a series of radial basis function (RBF) support vector machines (SVM) supervised machine learning land cover classifications of Sentinel-2A Multispectral Instrument (MSI) imagery were conducted to assess the transferability of recursive feature elimination (RFE)-derived feature sets between different classification models using different training sets acquired from the same remotely sensed image, and to classification models of other similar remotely sensed imagery. Feature selection results for various training sets acquired from the same image and different images widely varied on small training sets (n = 108). Variability in feature selection results between training sets acquired from different images was reduced as training set size increased; however, each RFE-derived feature set was unique, even when training sample size was increased over 10-fold (n = 1895). The transferability of an RFE-derived feature set from a high performing classification model was, on average, slightly more accurate in comparison to other classification models of the same image, but provided, on average, slightly lower accuracies when generalized to classification models of other, similar remotely sensed imagery. However, the effects of feature set transferability on classification accuracy were inconsistent and varied per classification model. Specific feature selection results in other classification models or remote sensing analyses, while useful for providing general insights on feature variables, may not always generalize to provide comparable accuracies for other classification models of the same dataset, or other, similar remotely sensed datasets. Thus, feature selection should be individually conducted for each training set within an analysis to determine the optimal feature set for the classification model.
引用
收藏
页数:25
相关论文
共 57 条
[1]  
[Anonymous], 1908, BIOMETRIKA, V6, P1
[2]  
[Anonymous], 2009, ISH J HYDRAUL ENG, DOI DOI 10.1080/09715010.2009.10514975
[3]   Object-based classification of earthquake damage from high-resolution optical imagery using machine learning [J].
Bialas, James ;
Oommen, Thomas ;
Rebbapragada, Umaa ;
Levin, Eugene .
JOURNAL OF APPLIED REMOTE SENSING, 2016, 10
[4]   Object based image analysis for remote sensing [J].
Blaschke, T. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2010, 65 (01) :2-16
[5]   Geographic Object-Based Image Analysis - Towards a new paradigm [J].
Blaschke, Thomas ;
Hay, Geoffrey J. ;
Kelly, Maggi ;
Lang, Stefan ;
Hofmann, Peter ;
Addink, Elisabeth ;
Feitosa, Raul Queiroz ;
van der Meer, Freek ;
van der Werff, Harald ;
van Coillie, Frieke ;
Tiede, Dirk .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 87 :180-191
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Chen L., 2016, J SOFTW ENG, V10, P318, DOI [10.3923/jse.2016.318.327, DOI 10.3923/JSE.2016.318.327]
[8]  
Commission for Environmental Cooperation, 1997, Ecological regions of North America: toward a common perspective
[9]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[10]   Recursive Feature Elimination and Random Forest Classification of Natura 2000 Grasslands in Lowland River Valleys of Poland Based on Airborne Hyperspectral and LiDAR Data Fusion [J].
Demarchi, Luca ;
Kania, Adam ;
Ciezkowski, Wojciech ;
Piorkowski, Hubert ;
Ogwiecimska-Piasko, Zuzanna ;
Chormanski, Jaroslaw .
REMOTE SENSING, 2020, 12 (11)