An evaluation of the statistical methods for testing the performance of crop models with observed data

被引:263
作者
Yang, J. M. [1 ]
Yang, J. Y. [2 ]
Liu, S. [2 ,3 ]
Hoogenboom, G. [4 ]
机构
[1] Jilin Agr Univ, Coll Resource & Environm Sci, Changchun 130118, Peoples R China
[2] Agr & Agri Food Canada, Greenhouse & Proc Crops Res Ctr, Harrow, ON N0R 1G0, Canada
[3] Shanxi Univ, Inst Loess Plateau, Taiyuan 030006, Peoples R China
[4] Washington State Univ, AgWeatherNet, Prosser, WA 99350 USA
关键词
Statistical evaluation; Test statistics; Deviation statistics; Autocorrelation; Heteroskedasticity; Crop simulation model; GROWN OFF-SEASON; WATER; NITROGEN; SIMULATION; AUTOCORRELATION; TEMPERATURE; VALIDATION; IMPACT; YIELD;
D O I
10.1016/j.agsy.2014.01.008
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Calibration and evaluation are two important steps prior to the application of a crop simulation model. The objective of this paper was to review common statistical methods that are being used for crop model calibration and evaluation. A group of deviation statistics were reviewed, including root mean squired error (RMSE), normalize-RMSE (nRMSE), mean absolute error (MAE), mean error (E), paired-t, index of agreement (d), modified index of agreement (d(1)), revised index of agreement (d(1)'), modeling efficiency (EF) and revised modeling efficiency (EF1). A case study of the statistical evaluation was conducted for the DSSAT Cropping System Model (CSM) using 10 experimental datasets for maize, peanut, soybean, wheat and potato from Brazil, China, Ghana, and the USA. The results indicated that R-2 was not a good statistic for model evaluation because it is insensitive to regression coefficients (alpha and beta) of the linear model y = alpha + beta x + epsilon. However, linear regression can be used for model evaluation (test H0: alpha = 0, beta = 1) if auto-correlation, normality and heteroskedasticaity of the error term (epsilon) are tested or the proper data transfers are made. The results also illustrated that statistical evaluation of total dataset across treatments might be insufficient. Hence the evaluation of each treatment is necessary to make the right conclusion, especially when evaluating soil water content under different planting date treatments and soil mineral N under different N treatments. Co-variability analysis among dimensionless statistics (d, d(1), d(1)', EF and EF1) recommended that d and EF are inflated by the sum of squares-based deviations, i.e., the larger deviations contribute more weight on the statistic than the smaller deviation due to the squared term. However, EF had a larger range and a clear physical meaning at EF = 0, making it superior to d. Values of d = 0.75 were obtained from regression with all positive values of EF (EF >= 0), indicating that values of d >= 0.75 and EF >= 0 should be the minimum values for plant growth evaluation. Values of d >= 0.60 and EF >= -1.0 should be the minimum values for soil outputs evaluation combined with t-test due to the fact that the soil parameters in the DSSAT SOIL module are difficult to calibrate compared with plant growth parameters because of no sufficient observed soil dataset. Due to the statistical nature, no single statistic is more robust over others but some statistics are highly correlated. Therefore, several statistics may be used from each of the following correlated groups (RMSE, MAE), (E, t-test), (d, d(1), d(1)') and (EF, EF1) in one assessment of model evaluation so that a representative statistical conclusion can be obtained with respect to model performance. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:81 / 89
页数:9
相关论文
共 45 条
[1]   COMPUTER-SIMULATION OF CHANGES IN SOIL MINERAL NITROGEN AND CROP NITROGEN DURING AUTUMN, WINTER AND SPRING [J].
ADDISCOTT, TM ;
WHITMORE, AP .
JOURNAL OF AGRICULTURAL SCIENCE, 1987, 109 :141-157
[2]  
Aigner DennisJ., 1971, Basic Econometrics
[3]   Evaluation of LEACHMN under dryland conditions. I. Simulation of water and solute transport [J].
Akinremi, OO ;
Jame, YW ;
Campbell, CA ;
Zentner, RP ;
Chang, C ;
de Jong, R .
CANADIAN JOURNAL OF SOIL SCIENCE, 2005, 85 (02) :223-232
[4]  
[Anonymous], 1999, DSSAT V3
[5]   Temporal autocorrelation functions for movement rates from global positioning system radiotelemetry data [J].
Boyce, Mark S. ;
Pitt, Justin ;
Northrup, Joseph M. ;
Morehouse, Andrea T. ;
Knopff, Kyle H. ;
Cristescu, Bogdan ;
Stenhouse, Gordon B. .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2010, 365 (1550) :2213-2219
[6]   Autocorrelated growth of tropical forest trees: Unraveling patterns and quantifying consequences [J].
Brienen, Roel J. W. ;
Zuidema, Pieter A. ;
During, Heinjo J. .
FOREST ECOLOGY AND MANAGEMENT, 2006, 237 (1-3) :179-190
[7]   Comparison of Crop Model Validation Methods [J].
Cao Hong-xin ;
Hanan, Jim Scott ;
Liu Yan ;
Liu Yong-xia ;
Yue Yan-bin ;
Zhu Da-wei ;
Lu Jian-fei ;
Sun Jin-ying ;
Shi Chun-lin ;
Ge Dao-kuo ;
Wei Xiu-fang ;
Yao An-qing ;
Tian Ping-ping ;
Bao Tai-lin .
JOURNAL OF INTEGRATIVE AGRICULTURE, 2012, 11 (08) :1274-1285
[8]  
Hoogenboom G., 2010, DECIS SUPPORT SYST
[9]   The DSSAT cropping system model [J].
Jones, JW ;
Hoogenboom, G ;
Porter, CH ;
Boote, KJ ;
Batchelor, WD ;
Hunt, LA ;
Wilkens, PW ;
Singh, U ;
Gijsman, AJ ;
Ritchie, JT .
EUROPEAN JOURNAL OF AGRONOMY, 2003, 18 (3-4) :235-265
[10]   Comparing simulated and measured values using mean squared deviation and its components [J].
Kobayashi, K ;
Salam, MU .
AGRONOMY JOURNAL, 2000, 92 (02) :345-352