Sugarcane Yield Prediction Through Data Mining and Crop Simulation Models

被引:0
作者
Ralph G. Hammer
Paulo C. Sentelhas
Jean C. Q. Mariano
机构
[1] ESALQ,Department of Biosystems Engineering
[2] University of São Paulo,undefined
[3] Independent Data Science Specialist,undefined
来源
Sugar Tech | 2020年 / 22卷
关键词
Yield estimation; Random forest; Boosting; Support vector machines; Crop model;
D O I
暂无
中图分类号
学科分类号
摘要
The understanding of the hierarchical importance of the factors which influence sugarcane yield can subsidize its modeling, thus contributing to the optimization of agricultural planning and crop yield estimates. The objectives of this study were to identify and ordinate the main variables that condition sugarcane yield, according to their relative importance, as well as to develop mathematical models for predicting sugarcane yield by using data mining (DM) techniques. For this, three DM techniques were applied in the analyses of databases of several sugar mills in the state of São Paulo, Brazil. Meteorological and crop management variables were analyzed through the following DM techniques: random forest; boosting; and support vector machine, and the resulting models were tested through the comparison with an independent data set. Finally, the predictive performances of these models were compared with the performance of a simple agrometeorological model, applied in the same data set. The results allowed to conclude that, within all the variables assessed, the number of cuts was the most important factor considered by all DM techniques. The comparison between the observed yields and those estimated by the DM models resulted in a root mean square error (RMSE) ranging between 19.70 and 20.03 t ha−1, which was much better than the performance of the Agroecological Zone Model, which presented RMSE ≈ 34 t ha−1.
引用
收藏
页码:216 / 225
页数:9
相关论文
共 63 条
[1]  
Baquero JE(2012)Soil physical properties and sugarcane root growth in a red Oxisol Revista Brasileira de Ciência do Solo 36 63-70
[2]  
Ralisch R(1997)Avaliação do desempenho de diferentes métodos de estimativa de evapotranspiração potencial no Estado de são Paulo, Brasil Revista Brasileira de Meteorologia 5 89-97
[3]  
Medina CC(2017)Evaluation of three sugarcane simulation models and their ensemble for yield estimation in commercially managed fields Field Crops Research 213 174-185
[4]  
Tavares Filho J(2016)Accurate prediction of sugarcane yield using a random forest algorithm Agronomy for Sustainable Development 27 1-9
[5]  
Guimarães MF(2009)An analysis of the factors that influence sugarcane yield in Northern Argentina using classification and regression trees Field Crops Research 112 149-157
[6]  
Camargo AP(1999)Modeling sugarcane production system I. Development and performance of the sugarcane module Field Crops Research 61 253-271
[7]  
Sentelhas PC(2017)Different types of data mining techniques used in agriculture—a survey International Journal of Advanced Engineering Research and Science 4 17-23
[8]  
Dias HB(2008)Building predictive models in R using the caret package Journal of Statistical Software 28 1-26
[9]  
Sentelhas PC(2002)Factors affecting cane yield and commercial cane sugar in the Tully district Australian Journal of Experimental Agriculture 42 473-480
[10]  
Everingham YL(2011)Parametrization and evaluation of predictions of CSM-CANEGRO for Brazilian sugarcane Agrononmy Journal 103 100-110