Overinterpretation of evaluation results in machine learning studies for maize yield prediction: A systematic review

被引:0
作者
Leukel, Joerg [1 ]
Scheurer, Luca [1 ]
Zimpel, Tobias [1 ]
机构
[1] Univ Hohenheim, Dept Informat Syst & Digital Technol, Schwerzstr 35, D-70599 Stuttgart, Germany
关键词
Artificial intelligence; Prediction model; Reporting quality; Spin; Yield prediction; ABSOLUTE ERROR MAE; CROP YIELD; MODEL; RMSE;
D O I
10.1016/j.compag.2024.109892
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
An increasing number of studies are focused on developing and evaluating machine learning (ML) models for crop yield prediction. However, poor reporting and intentional or unintentional misuse of language can distort the interpretation of evaluation results and make them appear more favorably than they actually are. The prevalence of overinterpretation of evaluation results is not yet known. To address this gap, we determined the frequency of overinterpretation in different sections of articles evaluating prediction models for maize as a specific crop. We conducted a prospectively registered systematic review, searching Scopus and Web of Science for journal articles reporting the evaluation of ML prediction models for maize yield, published from 2014 and 2023. We evaluated 207 articles based on reporting criteria adapted from overinterpretation frameworks in the biomedical literature. Overly positive words were used to describe evaluation results in 58 conclusion sections (28.0%) and 52 abstracts (25.1%). Fifty-two articles (25.1%) reported results for the coefficient of determination (R2) inconsistent between abstract and main text. Forty-eight articles (23.2%) incorrectly interpreted correlation coefficients as a measure of performance. Thirty articles (14.5%) reported wrong formulas for at least one of five standard metrics. In 25 articles (12.1%), at least one metric lacked or had an incorrect unit of measurement. In each 21 articles (10.1%), the reporting of the normalized root mean square error (NRSME) was inconsistent between abstract and main text, and R2 was wrongly interpreted as correlation. In summary, overinterpretation of evaluation results in ML studies for maize yield prediction was a frequently observed phenomenon. The findings have important implications for both the interpretation of individual studies and the broader accumulation of knowledge. Authors, reviewers, and editorial boards should put more emphasis on a complete, unambiguous, and correct reporting of evaluation results for which we submit recommendations.
引用
收藏
页数:9
相关论文
共 67 条
  • [1] Temporal Vegetation Indices and Plant Height from Remotely Sensed Imagery Can Predict Grain Yield and Flowering Time Breeding Value in Maize via Machine Learning Regression
    Adak, Alper
    Murray, Seth C.
    Bozinovic, Sofija
    Lindsey, Regan
    Nakasagga, Shakirah
    Chatterjee, Sumantra
    Anderson, Steven L., II
    Wilde, Scott
    [J]. REMOTE SENSING, 2021, 13 (11)
  • [2] Remote sensing-based framework to predict and assess the interannual variability of maize yields in Pakistan using Landsat imagery
    Ahmad, Ishfaq
    Singh, Aditya
    Fahad, Muhammad
    Waqas, Muhammad Mohsin
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 178
  • [3] MEASUREMENT IN MEDICINE - THE ANALYSIS OF METHOD COMPARISON STUDIES
    ALTMAN, DG
    BLAND, JM
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1983, 32 (03) : 307 - 317
  • [4] Using MODIS Data to Predict Regional Corn Yields
    Ban, Ho-Young
    Kim, Kwang Soo
    Park, No-Wook
    Lee, Byun-Woo
    [J]. REMOTE SENSING, 2017, 9 (01):
  • [5] Guiding questions to avoid data leakage in biological machine learning applications
    Bernett, Judith
    Grimm, Dominik G.
    Haselbeck, Florian
    Blumenthal, David B.
    Joeres, Roman
    Kalinina, Olga V.
    List, Markus
    [J]. NATURE METHODS, 2024, 21 (08) : 1444 - 1453
  • [6] Misrepresentation and distortion of research in biomedical literature
    Boutron, Isabelle
    Ravaud, Philippe
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (11) : 2613 - 2619
  • [7] Root mean square error (RMSE) or mean absolute error (MAE)? - Arguments against avoiding RMSE in the literature
    Chai, T.
    Draxler, R. R.
    [J]. GEOSCIENTIFIC MODEL DEVELOPMENT, 2014, 7 (03) : 1247 - 1250
  • [8] The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation
    Chicco, Davide
    Warrens, Matthijs J.
    Jurman, Giuseppe
    [J]. PEERJ COMPUTER SCIENCE, 2021,
  • [9] 'Spin' in published biomedical literature: A methodological systematic review
    Chiu, Kellia
    Grundy, Quinn
    Bero, Lisa
    [J]. PLOS BIOLOGY, 2017, 15 (09)
  • [10] Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review
    Chlingaryan, Anna
    Sukkarieh, Salah
    Whelan, Brett
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 151 : 61 - 69