Artificial intelligence approach for the prediction of Robusta coffee yield using soil fertility properties

被引:113
作者
Kouadio, Louis [1 ]
Deo, Ravinesh C. [2 ,3 ]
Byrareddy, Vivekananda [1 ]
Adamowski, Jan F. [4 ]
Mushtaq, Shahbaz [1 ]
Van Phuong Nguyen [5 ]
机构
[1] Univ Southern Queensland, Inst Life Sci & Environm, Ctr Appl Climate Sci, Toowoomba, Qld 4350, Australia
[2] Univ Southern Queensland, Ctr Sustainable Agr Syst, Sch Agr Computat & Environm Sci, Springfield, Qld 4300, Australia
[3] Univ Southern Queensland, Ctr Appl Climate Sci, Inst Life Sci & Environm, Springfield, Qld 4300, Australia
[4] McGill Univ, Fac Agr & Environm Sci, Dept Bioresource Engn, Montreal, PQ H9X 3V9, Canada
[5] Western Highlands Agr & Forestry Sci Inst, Buon Ma Thuot, Dak Lak, Vietnam
关键词
Smallholder farms; Robusta coffee; Soil fertility; Extreme learning machine; Machine learning in agriculture; EXTREME LEARNING MACHINES; GLOBAL SOLAR-RADIATION; NEURAL-NETWORK MODEL; RANDOM FORESTS; ORGANIC-MATTER; REFINED INDEX; REGRESSION; CLASSIFICATION; PERFORMANCE; ALGORITHM;
D O I
10.1016/j.compag.2018.10.014
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
As a commodity for daily consumption, coffee plays a crucial role in the economy of several African, American and Asian countries; yet, the accurate prediction of coffee yield based on environmental, climatic and soil fertility conditions remains a challenge for agricultural system modellers. The ability of an Extreme Learning Machine (ELM) model to analyse soil fertility properties and to generate an accurate estimation of Robusta coffee yield was assessed in this study. The performance of 18 different ELM-based models with single and multiple combinations of the predictor variables based on the soil organic matter (SOM), available potassium, boron, sulphur, zinc, phosphorus, nitrogen, exchangeable calcium, magnesium, and pH, was evaluated. The ELM model's performance was compared to that of existing predictive tools: Multiple Linear Regression (MLR) and Random Forest (RF). Individual model performance and inter-model performance comparisons were based on the root mean square error (RMSE), mean absolute error (MAE), Willmott's Index (WI), Nash-Sutcliffe efficiency coefficient (E-NS), and the Legates and McCabe's Index (E-LM,) in the independent testing dataset . In the independent testing phase, an ELM model constructed with SOM, available potassium and available sulphur as predictor variables generated the most accurate coffee yield estimate (i.e., RMSE = 496.35 kg ha(-1) or 13.6%, and MAE = 326.40 kg ha(-1) or +/- 7.9%). This contrasted with the less accurate MLR (RMSE = 1072.09 kg ha(-1) and MAE = 797.60 kg ha(-1)) and RF (RMSE = 1087.35 kg ha(-1) and MAE = 769.57 kg ha(-1)) model. Normalized metrics showed the ELM model's ability to yield highly accurate results: WI = 0.9952, E-NS = 0.406 and E-LM = 0.431. In comparison to the MLR and RF models, the adoption of the ELM model as an improved class of artificial intelligence models for coffee yield prediction in smallholder farms in this study constitutes an original contribution to the agronomic sector, particularly with respect to the appropriate selection of most optimal soil properties that can be used in the prediction of optimal coffee yield. The potential utility of coupling artificial intelligence algorithms with biophysical-crop models (i.e., as a data-intelligent automation tool) in decision-support systems that implement precision agriculture, in an effort to improve yield in smallholder farms based on carefully screened soil fertility dataset was confirmed.
引用
收藏
页码:324 / 338
页数:15
相关论文
共 85 条
  • [1] Cotton yield prediction with Markov Chain Monte Carlo-based simulation model integrated with genetic programing algorithm: A new hybrid copula driven approach
    Ali, Mumtaz
    Deo, Ravinesh C.
    Downs, Nathan J.
    Maraseni, Tek
    [J]. AGRICULTURAL AND FOREST METEOROLOGY, 2018, 263 : 428 - 448
  • [2] Allison L. E., 1965, AGRONOMY, V9, P1346, DOI 10.2134/agronmonogr9.2.2ed.c29
  • [3] [Anonymous], 2016, STOCH ENV RES RISK A, DOI DOI 10.1007/S00477-016-1265-Z
  • [4] [Anonymous], 2016, HIST DAT GLOB COFF T
  • [5] Apaydm A, 1994, UYGULAMAH ISTATISTIK
  • [6] Mapping groundwater contamination risk of multiple aquifers using multi-model ensemble of machine learning algorithms
    Barzegar, Rahim
    Moghaddam, Asghar Asghari
    Deo, Ravinesh
    Fijani, Elham
    Tziritis, Evangelos
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 621 : 697 - 712
  • [7] Is the productivity of organic farms restricted by the supply of available nitrogen?
    Berry, PM
    Sylvester-Bradley, R
    Philipps, L
    Hatch, DJ
    Cuttle, SP
    Rayns, FW
    Gosling, P
    [J]. SOIL USE AND MANAGEMENT, 2002, 18 : 248 - 255
  • [8] DETERMINATION OF TOTAL, ORGANIC, AND AVAILABLE FORMS OF PHOSPHORUS IN SOILS
    BRAY, RH
    KURTZ, LT
    [J]. SOIL SCIENCE, 1945, 59 (01) : 39 - 45
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] Estimating generalization error on two-class datasets using out-of-bag estimates
    Bylander, T
    [J]. MACHINE LEARNING, 2002, 48 (1-3) : 287 - 297