Accurate prediction of sugarcane yield using a random forest algorithm

被引:230
作者
Everingham, Yvette [1 ,2 ]
Sexton, Justin [1 ,2 ]
Skocaj, Danielle [2 ,3 ]
Inman-Bamber, Geoff [2 ,4 ]
机构
[1] James Cook Univ, Ctr Trop Environm & Sustainabil Sci, Townsville, Qld 4811, Australia
[2] James Cook Univ, Coll Sci Technol & Engn, James Cook Dr, Townsville, Qld 4811, Australia
[3] Sugar Res Australia, Tully, Qld 4068, Australia
[4] Crop Sci Consulting, Townsville, Qld 4811, Australia
关键词
APSIM; Agriculture; Nitrogen; Fertilizer; Value chain; Random forest; SELECTION; CITIES;
D O I
10.1007/s13593-016-0364-z
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Foreknowledge about sugarcane crop size can help industry members make more informed decisions. There exists many different combinations of climate variables, seasonal climate prediction indices, and crop model outputs that could prove useful in explaining sugarcane crop size. A data mining method like random forests can cope with generating a prediction model when the search space of predictor variables is large. Research that has investigated the accuracy of random forests to explain annual variation in sugarcane productivity and the suitability of predictor variables generated from crop models coupled with observed climate and seasonal climate prediction indices is limited. Simulated biomass from the APSIM (Agricultural Production Systems sIMulator) sugarcane crop model, seasonal climate prediction indices and observed rainfall, maximum and minimum temperature, and radiation were supplied as inputs to a random forest classifier and a random forest regression model to explain annual variation in regional sugarcane yields at Tully, in northeastern Australia. Prediction models were generated on 1 September in the year before harvest, and then on 1 January and 1 March in the year of harvest, which typically runs from June to November. Our results indicated that in 86.36 % of years, it was possible to determine as early as September in the year before harvest if production would be above the median. This accuracy improved to 95.45 % by January in the year of harvest. The R-squared of the random forest regression model gradually improved from 66.76 to 79.21 % from September in the year before harvest through to March in the same year of harvest. All three sets of variables-(i) simulated biomass indices, (ii) observed climate, and (iii) seasonal climate prediction indices-were typically featured in the models at various stages. Better crop predictions allows farmers to improve their nitrogen management to meet the demands of the new crop, mill managers could better plan the mill's labor requirements and maintenance scheduling activities, and marketers can more confidently manage the forward sale and storage of the crop. Hence, accurate yield forecasts can improve industry sustainability by delivering better environmental and economic outcomes.
引用
收藏
页数:9
相关论文
共 36 条
[1]   Random forest regression and spectral band selection for estimating sugarcane leaf nitrogen concentration using EO-1 Hyperion hyperspectral data [J].
Abdel-Rahman, Elfatih M. ;
Ahmed, Fethi B. ;
Ismail, Riyad .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2013, 34 (02) :712-728
[2]   Creating Smart-er Cities: An Overview [J].
Allwinkle, Sam ;
Cruickshank, Peter .
JOURNAL OF URBAN TECHNOLOGY, 2011, 18 (02) :1-16
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
Cao S., 2014, Journal of International & Interdisciplinary Business Research, V1, P23
[5]   Smart Cities in Europe [J].
Caragliu, Andrea ;
Del Bo, Chiara ;
Nijkamp, Peter .
JOURNAL OF URBAN TECHNOLOGY, 2011, 18 (02) :65-82
[6]  
CHEN JF, 2012, MATH PROBL ENG, DOI DOI 10.1155/2012/915053
[7]  
Craig E., 2009, Intelligent data analysis: Devloping new methodologies through pattern discovery and recovery, P65, DOI [10.4018/978-1-59904-982-3.ch004, DOI 10.4018/978-1-59904-982-3.CH004]
[8]  
De'ath G, 2000, ECOLOGY, V81, P3178, DOI 10.2307/177409
[9]  
Everingham Y., 2015, Agricultural Sciences, V6, P870
[10]  
Everingham Y., 2015, Proceedings of the 37th Annual Conference of the Australian Society of Sugar Cane Technologists, V37, P8