Training sample selection for robust multi-year within-season crop classification using machine learning

被引:18
作者
Gao, Zitian [1 ,2 ]
Guo, Danlu [1 ,3 ]
Ryu, Dongryeol [1 ]
Western, Andrew W. [1 ]
机构
[1] Univ Melbourne, Dept Infrastructure Engn, Parkville, Vic 3010, Australia
[2] Commonwealth Sci & Ind Res Org, Environm, Black Mt, ACT 2601, Australia
[3] Australian Natl Univ, ANU Coll Engn Comp & Cybernet, Sch Engn, Canberra, ACT, Australia
基金
澳大利亚研究理事会;
关键词
Within -season crop classification; Random forest; Support vector machine; Training sample selection; Multi; -year; Landsat; 8; RANDOM FOREST; ACCURACY; CORN; SET;
D O I
10.1016/j.compag.2023.107927
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Within-season crop classification using multispectral imagery is an effective way to generate timely crop maps that can support water and crop management; however, developing such models is challenging due to limited satellite imagery and ground truth data available during the season. This study investigated ways to optimize the use of multi-year samples in a within-season crop classification model, aiming to enable accurate within-season crop mapping across years. Our study focused on classifying field-scale corn/maize, cotton, and rice in south-eastern Australia from 2013 to 2019. The crop classification model was based on the random forest and sup-port vector machine algorithms applied to Landsat 8 multispectral bands. We designed four experiments to understand the influences of training sample selection on model accuracy. Specifically, we analyzed how the within-season classification accuracies are affected by 1) training sample size; 2) proportions of classification classes; 3) the inclusion of a non-crop class (e.g., fallow land) in the training sample, and 4) training samples collected from different years. We found that 1) the training sample size should be sufficiently large to ensure within-season classification accuracy; 2) using training samples for each crop type in proportion to their occurrence within the landscape results in more accurate multi-year classification; 3) the inclusion of the non -crop class can reduce the accuracy with which crop types are distinguished, so the proportion of the non-crop class should be maintained at a relatively low level, and 4) predicting the current year with training samples from previous years can lead to a minor decline in accuracy compared to using samples only from the current year. These training sample settings were adopted to develop a final model. We found that the model accuracy continues to improve as more input imagery is added as the cropping season progresses, with a rapid rate of initial improvement which then slows. December, the third month of the summer growing season, is the earliest time that reliable maps were generated, with an overall accuracy of 86 % and user's accuracies for all crops exceeding 80 %. Our proposed experiments are robust and transferable to other regions and seasons to assist the development of within-season crop maps, and can thus be valuable tools to support agricultural management.
引用
收藏
页数:16
相关论文
共 41 条
[1]   Assessing in-season crop classification performance using satellite data: a test case in Northern Italy [J].
Azar, Ramin ;
Villa, Paolo ;
Stroppiana, Daniela ;
Crema, Alberto ;
Boschetti, Mirco ;
Brivio, Pietro Alessandro .
EUROPEAN JOURNAL OF REMOTE SENSING, 2016, 49 :361-380
[2]  
BoM, 2021, EVAPOTRANSPIRATION C
[3]  
BOM, 2022, Climate data online
[4]   A high-performance and in-season classification system of field-level crop types using time-series Landsat data and a machine learning approach [J].
Cai, Yaping ;
Guan, Kaiyu ;
Peng, Jian ;
Wang, Shaowen ;
Seifert, Christopher ;
Wardlow, Brian ;
Li, Zhan .
REMOTE SENSING OF ENVIRONMENT, 2018, 210 :35-47
[5]   Assessing the Accuracy of Multiple Classification Algorithms for Crop Classification Using Landsat-8 and Sentinel-2 Data [J].
Chakhar, Amal ;
Ortega-Terol, Damian ;
Hernandez-Lopez, David ;
Ballesteros, Rocio ;
Ortega, Jose E. ;
Moreno, Miguel A. .
REMOTE SENSING, 2020, 12 (11)
[6]  
CICL, 2021, BRIEF OV CICL
[7]  
CICL, 2019, ANN COMPL REP
[8]   A REVIEW OF ASSESSING THE ACCURACY OF CLASSIFICATIONS OF REMOTELY SENSED DATA [J].
CONGALTON, RG .
REMOTE SENSING OF ENVIRONMENT, 1991, 37 (01) :35-46
[9]   In-Season Mapping of Irrigated Crops Using Landsat 8 and Sentinel-1 Time Series [J].
Demarez, Valerie ;
Helen, Florian ;
Marais-Sicre, Claire ;
Baup, Frederic .
REMOTE SENSING, 2019, 11 (02)
[10]   Training set size requirements for the classification of a specific class [J].
Foody, Giles M. ;
Mathur, Ajay ;
Sanchez-Hernandez, Carolina ;
Boyd, Doreen S. .
REMOTE SENSING OF ENVIRONMENT, 2006, 104 (01) :1-14