Random forest machine learning for maize yield and agronomic efficiency prediction in Ghana

被引:8
作者
Asamoah, Eric [1 ,2 ,3 ,4 ]
Heuvelink, Gerard B. M. [1 ,4 ]
Chairi, Ikram [5 ]
Bindraban, Prem S. [6 ]
Logah, Vincent [7 ]
机构
[1] Wageningen Univ & Res, Soil Geog & Landscape Grp, POB 47, NL-6700 AA Wageningen, Netherlands
[2] Mohammed VI Polytech Univ, Agr Innovat & Technol Transfer Ctr, Lot 660,Hay Moulay Rachid, Benguerir 43150, Morocco
[3] CSIR, Soil Res Inst, Kumasi, Ghana
[4] ISRIC World Soil Informat, POB 353, NL-6700 AJ Wageningen, Netherlands
[5] Mohammed VI Polytech Univ, Modelling Simulat & Data Anal, Lot 660,Hay Moulay Rachid, Benguerir 43150, Morocco
[6] Int Fertilizer Dev Ctr, Muscle Shoals, AL 35662 USA
[7] Kwame Nkrumah Univ Sci & Technol, Dept Crop & Soil Sci, Kumasi, Ghana
关键词
Agronomic efficiency; Maize yield; Modelling; Random forest algorithm; Uncertainty assessment; SUB-SAHARAN AFRICA; SOIL; MANAGEMENT; MODEL;
D O I
10.1016/j.heliyon.2024.e37065
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Maize ( Zea mays) ) is an important staple crop for food security in Sub-Saharan Africa. However, there is need to increase production to feed a growing population. In Ghana, this is mainly done by increasing acreage with adverse environmental consequences, rather than yield increment per unit area. Accurate prediction of maize yields and nutrient use efficiency in production is critical to making informed decisions toward economic and ecological sustainability. We trained the random forest machine learning algorithm to predict maize yield and agronomic efficiency in Ghana using soil, climate, environment, and management factors, including fertilizer application. We calibrated and evaluated the performance of the random forest machine learning algorithm using a 5 x 10-fold nested cross-validation approach. Data from 482 maize field trials consisting of 3136 georeferenced treatment plots conducted in Ghana from 1991 to 2020 were used to train the algorithm, identify important predictor variables, and quantify the uncertainties associated with the random forest predictions. The mean error, root mean squared error, model efficiency coefficient and 90 % prediction interval coverage probability were calculated. The results obtained on test data demonstrate good prediction performance for yield (MEC = 0.81) and moderate performance for agronomic efficiency (MEC = 0.63, 0.55 and 0.54 for AE-N, AE-P and AE-K, respectively). We found that climatic variables were less important predictors than soil variables for yield prediction, but temperature was of key importance to yield prediction and rainfall to agronomic efficiency. The developed random forest models provided a better understanding of the drivers of maize yield and agronomic efficiency in a tropical climate and an insight towards improving fertilizer recommendations for sustainable maize production and food security in SubSaharan Africa.
引用
收藏
页数:20
相关论文
共 82 条
[61]  
Ragasa C., 2014, GSSP Policy Note 5
[62]   EarthEnv-DEM90: A nearly-global, void-free, multi-scale smoothed, 90m digital elevation model from fused ASTER and SRTM data [J].
Robinson, Natalie ;
Regetz, James ;
Guralnick, Robert P. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 87 :57-67
[63]   Effects of soil texture and rates of K input on potassium balance in tropical soil [J].
Rosolem, C. A. ;
Steiner, F. .
EUROPEAN JOURNAL OF SOIL SCIENCE, 2017, 68 (05) :658-666
[64]  
Ryu Choonghyun, 2024, CRAN, DOI 10.32614/CRAN.package.dlookr
[65]   Agronomic gain: Definition, approach, and application [J].
Saito, Kazuki ;
Six, Johan ;
Komatsu, Shota ;
Snapp, Sieglinde ;
Rosenstock, Todd ;
Arouna, Aminou ;
Cole, Steven ;
Taulya, Godfrey ;
Vanlauwe, Bernard .
FIELD CROPS RESEARCH, 2021, 270
[66]   Terra and aqua MODIS products available from NASA GES DAAC [J].
Savtchenko, A ;
Ouzounov, D ;
Ahmad, S ;
Acker, J ;
Leptoukh, G ;
Koziana, J ;
Nickless, D .
TRACE CONSTITUENTS IN THE TROPOSPHERE AND LOWER STRATOSPHERE, 2004, 34 (04) :710-714
[67]   Hyperparameter tuning and performance assessment of statistical and machine-learning algorithms using spatial data [J].
Schratz, Patrick ;
Muenchow, Jannes ;
Iturritxa, Eugenia ;
Richter, Jakob ;
Brenning, Alexander .
ECOLOGICAL MODELLING, 2019, 406 :109-120
[68]   A novel method to estimate model uncertainty using machine learning techniques [J].
Solomatine, Dimitri P. ;
Shrestha, Durga Lal .
WATER RESOURCES RESEARCH, 2009, 45
[69]   Bias in random forest variable importance measures: Illustrations, sources and a solution [J].
Strobl, Carolin ;
Boulesteix, Anne-Laure ;
Zeileis, Achim ;
Hothorn, Torsten .
BMC BIOINFORMATICS, 2007, 8 (1)
[70]   Support vector machine-based open crop model (SBOCM): Case of rice production in China [J].
Su Ying-xue ;
Xu Huan ;
Yan Li-jiao .
SAUDI JOURNAL OF BIOLOGICAL SCIENCES, 2017, 24 (03) :537-547