A surrogate model based on feature selection techniques and regression learners to improve soybean yield prediction in southern France

被引:23
作者
Corrales, David Camilo [1 ,2 ]
Schoving, Celine [3 ]
Raynal, Helene [1 ]
Debaeke, Philippe [1 ]
Journet, Etienne-Pascal [1 ,4 ]
Constantin, Julie [1 ]
机构
[1] Univ Toulouse, INRAE, UMR AGIR, F-31326 Castanet Tolosan, France
[2] Univ Cauca, Grp Ingn Telemat, Popayan, Colombia
[3] Terres Inovia, Baziege, France
[4] Univ Toulouse, INRAE, CNRS, LIPME, F-31326 Castanet Tolosan, France
基金
欧盟地平线“2020”;
关键词
STICS; Regression learners; Filter; Wrapper; Embedded; SOIL-CROP MODEL; CLASSIFICATION; STICS; CALIBRATION; ACCURACY; FILTER; WATER;
D O I
10.1016/j.compag.2021.106578
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Empirical and process-based models are currently used to predict crop yield at field and regional levels. A mechanistic model named STICS (Multidisciplinary Simulator for Standard Crops) has been used to simulate soybean grain yield in several environments, including southern France. STICS simulates at a daily step the effects of climate, soil and management practices on plant growth, development and production. In spite of good performances to predict total aboveground biomass, poor results were obtained for final grain yield. In order to improve yield prediction, a surrogate model was developed from STICS dynamic simulations, feature selection techniques and regression learners. STICS was used to simulate functional variables at given growth stages and over selected phenological phases. The most representative variables were selected through feature selection techniques (filter, wrapper and embedded), and a subset of variables were used to train the regression learners Linear regression (LR), Support vector regression (SVR), Back propagation neural network (BPNN), Random forest (RF), Least Absolute Shrinkage and Selection Operator (LASSO) and M5 decision tree. The subset of variables selected by wrapper method combined with regression models SVR (R2 = 0. 7102; subset of variables = 6) and LR (R2 = 0. 6912; subset of variables = 14) provided the best results. SVR and LR models improved significantly the soybean yield predictions in southern France in comparison to STICS simulations (R2 = 0.040).
引用
收藏
页数:19
相关论文
共 71 条
  • [1] [Anonymous], 2009, FORMALISATIONS PARAM
  • [2] [Anonymous], 2021, Food and Agriculture Organization of the United Nations. Crop. Prod. Data
  • [3] Gauging the sources of uncertainty in soybean yield simulations using the MONICA model
    Battisti, Rafael
    Parker, Phillip S.
    Sentelhas, Paulo C.
    Nendel, Claas
    [J]. AGRICULTURAL SYSTEMS, 2017, 155 : 9 - 18
  • [4] Analysis of potential yields and yield gaps of rainfed soybean in India using CROPGRO-Soybean model
    Bhatia, V. S.
    Singh, Piara
    Wani, S. P.
    Chauhan, G. S.
    Rao, A. V. R. Kesava
    Mishra, A. K.
    Sriniuas, K.
    [J]. AGRICULTURAL AND FOREST METEOROLOGY, 2008, 148 (8-9) : 1252 - 1265
  • [5] Bischl B, 2016, J MACH LEARN RES, V17
  • [6] Breiman L., 2001, Machine Learning, V45, P5
  • [7] Breiman L., 1984, Classification and Regression Trees, DOI DOI 10.1201/9781315139470
  • [8] Support Vector Machines for classification and regression
    Brereton, Richard G.
    Lloyd, Gavin R.
    [J]. ANALYST, 2010, 135 (02) : 230 - 267
  • [9] STICS: a generic model for the simulation of crops and their water and nitrogen balances. I. Theory and parameterization applied to wheat and corn
    Brisson, N
    Mary, B
    Ripoche, D
    Jeuffroy, MH
    Ruget, F
    Nicoullaud, B
    Gate, P
    Devienne-Barret, F
    Antonioletti, R
    Durr, C
    Richard, G
    Beaudoin, N
    Recous, S
    Tayot, X
    Plenet, D
    Cellier, P
    Machet, JM
    Meynard, JM
    Delecolle, R
    [J]. AGRONOMIE, 1998, 18 (5-6): : 311 - 346
  • [10] EMPIRICAL-MODELS FOR THE SPATIAL-DISTRIBUTION OF WILDLIFE
    BUCKLAND, ST
    ELSTON, DA
    [J]. JOURNAL OF APPLIED ECOLOGY, 1993, 30 (03) : 478 - 495