The Sensitivity of Mapping Methods to Reference Data Quality: Training Supervised Image Classifications with Imperfect Reference Data

被引:75
作者
Foody, Giles M. [1 ]
Pal, Mahesh [2 ]
Rocchini, Duccio [3 ]
Garzon-Lopez, Carol X. [4 ]
Bastin, Lucy [5 ]
机构
[1] Univ Nottingham, Sch Geog, Nottingham NG7 2RD, England
[2] Natl Inst Technol, Dept Civil Engn, Kurukshetra 136119, Haryana, India
[3] Fdn Edmund Mach, Res & Innovat Ctr, Dept Biodivers & Mol Ecol, Via E Mach 1, I-38010 San Michele All Adige, TN, Italy
[4] Univ Picardy Jules Verne, FRE CNRS 3498, Ecol & Dynam Human Influenced Syst Res Unit EDYSA, 1 Rue Louvels, FR-80037 Amiens 1, France
[5] Aston Univ, Sch Engn & Appl Sci, Birmingham B4 7ET, W Midlands, England
关键词
classification; training; error; accuracy; remote sensing; land cover; LAND-COVER CLASSIFICATION; SUPPORT VECTOR MACHINES; ACCURACY ASSESSMENT; SPECIES MISIDENTIFICATION; DISCRIMINANT-ANALYSIS; ECOSYSTEM SERVICES; NEURAL-NETWORK; ERROR; MAPS; SVM;
D O I
10.3390/ijgi5110199
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The accuracy of a map is dependent on the reference dataset used in its construction. Classification analyses used in thematic mapping can, for example, be sensitive to a range of sampling and data quality concerns. With particular focus on the latter, the effects of reference data quality on land cover classifications from airborne thematic mapper data are explored. Variations in sampling intensity and effort are highlighted in a dataset that is widely used in mapping and modelling studies; these may need accounting for in analyses. The quality of the labelling in the reference dataset was also a key variable influencing mapping accuracy. Accuracy varied with the amount and nature of mislabelled training cases with the nature of the effects varying between classifiers. The largest impacts on accuracy occurred when mislabelling involved confusion between similar classes. Accuracy was also typically negatively related to the magnitude of mislabelled cases and the support vector machine (SVM), which has been claimed to be relatively insensitive to training data error, was the most sensitive of the set of classifiers investigated, with overall classification accuracy declining by 8% (significant at 95% level of confidence) with the use of a training set containing 20% mislabelled cases.
引用
收藏
页数:20
相关论文
共 51 条
[41]  
PAL M, 2005, INT J REMOTE SENS
[42]   Evaluation of SVM, RVM and SMLR for Accurate Image Classification With Limited Ground Data [J].
Pal, Mahesh ;
Foody, Giles M. .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2012, 5 (05) :1344-1355
[43]   Feature Selection for Classification of Hyperspectral Data by SVM [J].
Pal, Mahesh ;
Foody, Giles M. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2010, 48 (05) :2297-2307
[44]   Sources of error in accuracy assessment of thematic land-cover maps in the Brazilian Amazon [J].
Powell, RL ;
Matzke, N ;
de Souza, C ;
Clark, M ;
Numata, I ;
Hess, LL ;
Roberts, DA ;
Clark, M ;
Numata, I ;
Hess, LL ;
Roberts, DA .
REMOTE SENSING OF ENVIRONMENT, 2004, 90 (02) :221-234
[45]   Automated Training Sample Extraction for Global Land Cover Mapping [J].
Radoux, Julien ;
Lamarche, Celine ;
Van Bogaert, Eric ;
Bontemps, Sophie ;
Brockmann, Carsten ;
Defourny, Pierre .
REMOTE SENSING, 2014, 6 (05) :3965-3987
[46]   Assessing species misidentification rates through quality assurance of vegetation monitoring [J].
Scott, WA ;
Hallam, CJ .
PLANT ECOLOGY, 2003, 165 (01) :101-115
[47]   Sparse Bayesian learning and the relevance vector machine [J].
Tipping, ME .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (03) :211-244
[48]  
TOM CH, 1984, PHOTOGRAMM ENG REM S, V50, P193
[49]   Global characterization and monitoring of forest cover using Landsat data: opportunities and challenges [J].
Townshend, John R. ;
Masek, Jeffrey G. ;
Huang, Chengquan ;
Vermote, Eric. F. ;
Gao, Feng ;
Channan, Saurabh ;
Sexton, Joseph O. ;
Feng, Min ;
Narasimhan, Raghuram ;
Kim, Dohyung ;
Song, Kuan ;
Song, Danxia ;
Song, Xiao-Peng ;
Noojipady, Praveen ;
Tan, Bin ;
Hansen, Matthew C. ;
Li, Mengxue ;
Wolfe, Robert E. .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2012, 5 (05) :373-397
[50]  
Vapnik V., 1999, The nature of statistical learning theory