Impact of Training Set Size and Lead Time on Early Tomato Crop Mapping Accuracy

被引:6
作者
Croci, Michele [1 ,2 ]
Impollonia, Giorgio [1 ,2 ]
Blandinieres, Henri [1 ]
Colauzzi, Michele [1 ]
Amaducci, Stefano [1 ,2 ]
机构
[1] Univ Cattolica Sacro Cuore, Dept Sustainable Crop Prod, I-29122 Piacenza, Italy
[2] Univ Cattolica Sacro Cuore, Remote Sensing & Spatial Anal Res Ctr CRAST, I-29122 Piacenza, Italy
关键词
lead time; Sentinel-2; training set size; RSM; machine learning; agrifood supply chains; LAND-COVER CLASSIFICATION; IMAGE CLASSIFICATION; SATELLITE DATA; VEGETATION; SELECTION; INDEX; VALIDATION; SENTINEL-2; MACHINE; SERIES;
D O I
10.3390/rs14184540
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Estimating key crop parameters (e.g., phenology, yield prediction) is a prerequisite for optimizing agrifood supply chains through the use of satellite imagery, but requires timely and accurate crop mapping. The moment in the season and the number of training sites used are two main drivers of crop classification performance. The combined effect of these two parameters was analysed for tomato crop classification, through 125 experiments, using the three main machine learning (ML) classifiers (neural network, random forest, and support vector machine) using a response surface methodology (RSM). Crop classification performance between minority (tomato) and majority ('other crops') classes was assessed through two evaluation metrics: Overall Accuracy (OA) and G-Mean (GM), which were calculated on large independent test sets (over 400,000 fields). RSM results demonstrated that lead time and the interaction between the number of majority and minority classes were the two most important drivers for crop classification performance for all three ML classifiers. The results demonstrate the feasibility of preharvest classification of tomato with high performance, and that an RSM-based approach enables the identification of simultaneous effects of several factors on classification performance. SVM achieved the best grading performances across the three ML classifiers, according to both evaluation metrics. SVM reached highest accuracy (0.95 of OA and 0.97 of GM) earlier in the season (low lead time) and with less training sites than the other two classifiers, permitting a reduction in cost and time for ground truth collection through field campaigns.
引用
收藏
页数:17
相关论文
共 58 条
[1]  
[Anonymous], GDAL DOCUMENTATION
[2]  
[Anonymous], 2001, Geocarto Int, DOI [DOI 10.1080/10106040108542184, 10.1080/10106040108542184]
[3]   A survey of cross-validation procedures for model selection [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS SURVEYS, 2010, 4 :40-79
[4]   Assessing in-season crop classification performance using satellite data: a test case in Northern Italy [J].
Azar, Ramin ;
Villa, Paolo ;
Stroppiana, Daniela ;
Crema, Alberto ;
Boschetti, Mirco ;
Brivio, Pietro Alessandro .
EUROPEAN JOURNAL OF REMOTE SENSING, 2016, 49 :361-380
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
Congalton R.G., 2008, Assessing the Accuracy of Remotely Sensed Data: Principles and Practices, V2nd ed.
[7]  
Craig M., 2013, A literature review of crop area estimation
[8]  
Dean A, 2017, SPRINGER TEXTS STAT, P565, DOI 10.1007/978-3-319-52250-0_16
[9]   Sentinel-2: ESA's Optical High-Resolution Mission for GMES Operational Services [J].
Drusch, M. ;
Del Bello, U. ;
Carlier, S. ;
Colin, O. ;
Fernandez, V. ;
Gascon, F. ;
Hoersch, B. ;
Isola, C. ;
Laberinti, P. ;
Martimort, P. ;
Meygret, A. ;
Spoto, F. ;
Sy, O. ;
Marchese, F. ;
Bargellini, P. .
REMOTE SENSING OF ENVIRONMENT, 2012, 120 :25-36
[10]  
FAO IFAD IMF OECD UNCTAD WFP the World Bank the WTO IFPRI the UN HLTF, 2011, Price Volatility in Food and Agricultural Markets: Policy Responses Policy Report for the G-20