Random forest machine learning for maize yield and agronomic efficiency prediction in Ghana

被引:8
作者
Asamoah, Eric [1 ,2 ,3 ,4 ]
Heuvelink, Gerard B. M. [1 ,4 ]
Chairi, Ikram [5 ]
Bindraban, Prem S. [6 ]
Logah, Vincent [7 ]
机构
[1] Wageningen Univ & Res, Soil Geog & Landscape Grp, POB 47, NL-6700 AA Wageningen, Netherlands
[2] Mohammed VI Polytech Univ, Agr Innovat & Technol Transfer Ctr, Lot 660,Hay Moulay Rachid, Benguerir 43150, Morocco
[3] CSIR, Soil Res Inst, Kumasi, Ghana
[4] ISRIC World Soil Informat, POB 353, NL-6700 AJ Wageningen, Netherlands
[5] Mohammed VI Polytech Univ, Modelling Simulat & Data Anal, Lot 660,Hay Moulay Rachid, Benguerir 43150, Morocco
[6] Int Fertilizer Dev Ctr, Muscle Shoals, AL 35662 USA
[7] Kwame Nkrumah Univ Sci & Technol, Dept Crop & Soil Sci, Kumasi, Ghana
关键词
Agronomic efficiency; Maize yield; Modelling; Random forest algorithm; Uncertainty assessment; SUB-SAHARAN AFRICA; SOIL; MANAGEMENT; MODEL;
D O I
10.1016/j.heliyon.2024.e37065
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Maize ( Zea mays) ) is an important staple crop for food security in Sub-Saharan Africa. However, there is need to increase production to feed a growing population. In Ghana, this is mainly done by increasing acreage with adverse environmental consequences, rather than yield increment per unit area. Accurate prediction of maize yields and nutrient use efficiency in production is critical to making informed decisions toward economic and ecological sustainability. We trained the random forest machine learning algorithm to predict maize yield and agronomic efficiency in Ghana using soil, climate, environment, and management factors, including fertilizer application. We calibrated and evaluated the performance of the random forest machine learning algorithm using a 5 x 10-fold nested cross-validation approach. Data from 482 maize field trials consisting of 3136 georeferenced treatment plots conducted in Ghana from 1991 to 2020 were used to train the algorithm, identify important predictor variables, and quantify the uncertainties associated with the random forest predictions. The mean error, root mean squared error, model efficiency coefficient and 90 % prediction interval coverage probability were calculated. The results obtained on test data demonstrate good prediction performance for yield (MEC = 0.81) and moderate performance for agronomic efficiency (MEC = 0.63, 0.55 and 0.54 for AE-N, AE-P and AE-K, respectively). We found that climatic variables were less important predictors than soil variables for yield prediction, but temperature was of key importance to yield prediction and rainfall to agronomic efficiency. The developed random forest models provided a better understanding of the drivers of maize yield and agronomic efficiency in a tropical climate and an insight towards improving fertilizer recommendations for sustainable maize production and food security in SubSaharan Africa.
引用
收藏
页数:20
相关论文
共 82 条
[21]  
Departamento de Asuntos Economicos y Sociales de las Naciones Unidas, 2019, Dep. Econ. Soc. Aff. World Popul. Prospect., V2019, P2
[22]   Nested leave-two-out cross-validation for the optimal crop yield model selection [J].
Dinh, Thi Lan Anh ;
Aires, Filipe .
GEOSCIENTIFIC MODEL DEVELOPMENT, 2022, 15 (09) :3519-3535
[23]  
Dobermann A., 2007, PROC INT FERTIL IND, P22
[24]   The impact of agricultural management on soil aggregation and carbon storage is regulated by climatic thresholds across a 3000 km European gradient [J].
Edlinger, Anna ;
Garland, Gina ;
Banerjee, Samiran ;
Degrune, Florine ;
Garcia-Palacios, Pablo ;
Herzog, Chantal ;
Pescador, David Sanchez ;
Romdhane, Sana ;
Ryo, Masahiro ;
Saghai, Aurelien ;
Hallin, Sara ;
Maestre, Fernando T. T. ;
Philippot, Laurent ;
Rillig, Matthias C. C. ;
van Der Heijden, Marcel G. A. .
GLOBAL CHANGE BIOLOGY, 2023, 29 (11) :3177-3192
[25]   Forecasting yield by integrating agrarian factors and machine learning models: A survey [J].
Elavarasan, Dhivya ;
Vincent, Durai Raj ;
Sharma, Vishal ;
Zomaya, Albert Y. ;
Srinivasan, Kathiravan .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 155 :257-282
[26]   Accurate prediction of sugarcane yield using a random forest algorithm [J].
Everingham, Yvette ;
Sexton, Justin ;
Skocaj, Danielle ;
Inman-Bamber, Geoff .
AGRONOMY FOR SUSTAINABLE DEVELOPMENT, 2016, 36 (02)
[27]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[28]   Random Forests modelling for the estimation of mango (Mangifera indica L. cv. Chok Anan) fruit yields under different irrigation regimes [J].
Fukuda, Shinji ;
Spreer, Wolfram ;
Yasunaga, Eriko ;
Yuge, Kozue ;
Sardsud, Vicha ;
Mueller, Joachim .
AGRICULTURAL WATER MANAGEMENT, 2013, 116 :142-150
[29]  
Ghana Statistical Service, 2021, Population and Housing Census: Preliminary Report
[30]   Geostatistical modelling of uncertainty in soil science [J].
Goovaerts, P .
GEODERMA, 2001, 103 (1-2) :3-26