Technical Note: The divide and measure nonconformity - how metrics can mislead when we evaluate on different data partitions

被引:3
作者
Klotz, Daniel [1 ]
Gauch, Martin [2 ]
Kratzert, Frederik [3 ]
Nearing, Grey [4 ]
Zscheischler, Jakob [1 ,5 ]
机构
[1] UFZ Helmholtz Ctr Environm Res, Dept Cpd Environm Risks, Leipzig, Germany
[2] Google Res, Zurich, Switzerland
[3] Google Res, Vienna, Austria
[4] Google Res, Mountain View, CA USA
[5] TUD Dresden Univ Technol, Dept Hydro Sci, Dresden, Germany
关键词
NASH-SUTCLIFFE; DATA SET; BENCHMARKING; DIAGNOSTICS; UNCERTAINTY; CHOICE;
D O I
10.5194/hess-28-3665-2024
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
The evaluation of model performance is an essential part of hydrological modeling. However, leveraging the full information that performance criteria provide requires a deep understanding of their properties. This Technical Note focuses on a rather counterintuitive aspect of the perhaps most widely used hydrological metric, the Nash-Sutcliffe efficiency (NSE). Specifically, we demonstrate that the overall NSE of a dataset is not bounded by the NSEs of all its partitions. We term this phenomenon the "divide and measure nonconformity". It follows naturally from the definition of the NSE, yet because modelers often subdivide datasets in a non-random way, the resulting behavior can have unintended consequences in practice. In this note we therefore discuss the implications of the divide and measure nonconformity, examine its empirical and theoretical properties, and provide recommendations for modelers to avoid drawing misleading conclusions.
引用
收藏
页码:3665 / 3673
页数:9
相关论文
共 36 条
  • [1] The CAMELS data set: catchment attributes and meteorology for large-sample studies
    Addor, Nans
    Newman, Andrew J.
    Mizukami, Naoki
    Clark, Martyn P.
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2017, 21 (10) : 5293 - 5313
  • [3] The Abuse of Popular Performance Metrics in Hydrologic Modeling
    Clark, Martyn P.
    Vogel, Richard M.
    Lamontagne, Jonathan R.
    Mizukami, Naoki
    Knoben, Wouter J. M.
    Tang, Guoqiang
    Gharari, Shervan
    Freer, Jim E.
    Whitfield, Paul H.
    Shook, Kevin R.
    Papalexiou, Simon Michael
    [J]. WATER RESOURCES RESEARCH, 2021, 57 (09)
  • [4] A signal-processing-based interpretation of the Nash-Sutcliffe efficiency
    Duc, Le
    Sawada, Yohei
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2023, 27 (09) : 1827 - 1839
  • [5] The suitability of differentiable, physics-informed machine learninghydrologic models for ungauged regions and climate change impact assessment
    Feng, Dapeng
    Beck, Hylke
    Lawson, Kathryn
    Shen, Chaopeng
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2023, 27 (12) : 2357 - 2373
  • [6] In Defense of Metrics: Metrics Sufficiently Encode Typical Human Preferences Regarding Hydrological Model Performance
    Gauch, Martin
    Kratzert, Frederik
    Gilon, Oren
    Gupta, Hoshin
    Mai, Juliane
    Nearing, Grey
    Tolson, Bryan
    Hochreiter, Sepp
    Klotz, Daniel
    [J]. WATER RESOURCES RESEARCH, 2023, 59 (06)
  • [7] THE AMALGAMATION AND GEOMETRY OF 2-BY-2 CONTINGENCY-TABLES
    GOOD, IJ
    MITTAL, Y
    [J]. ANNALS OF STATISTICS, 1987, 15 (02) : 694 - 711
  • [8] Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling
    Gupta, Hoshin V.
    Kling, Harald
    Yilmaz, Koray K.
    Martinez, Guillermo F.
    [J]. JOURNAL OF HYDROLOGY, 2009, 377 (1-2) : 80 - 91
  • [9] DESIGN AND ANALYSIS OF PATTERN RECOGNITION EXPERIMENTS
    HIGHLEYMAN, WH
    [J]. BELL SYSTEM TECHNICAL JOURNAL, 1962, 41 (02): : 723 - +
  • [10] OPERATIONAL TESTING OF HYDROLOGICAL SIMULATION-MODELS
    KLEMES, V
    [J]. HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 1986, 31 (01): : 13 - 24