Using the Pearson's correlation coefficient as the sole metric to measure the accuracy of quantitative trait prediction: is it sufficient?

被引:1
|
作者
Pan, Shouhui [1 ,2 ]
Liu, Zhongqiang [1 ,2 ]
Han, Yanyun [1 ,2 ]
Zhang, Dongfeng [1 ,2 ]
Zhao, Xiangyu [1 ,2 ]
Li, Jinlong [1 ,2 ]
Wang, Kaiyi [1 ,2 ]
机构
[1] Beijing Acad Agr & Forestry Sci, Informat Technol Res Ctr, Beijing, Peoples R China
[2] Natl Engn Res Ctr Informat Technol Agr, Beijing, Peoples R China
来源
FRONTIERS IN PLANT SCIENCE | 2024年 / 15卷
关键词
genomic selection; quantitative trait prediction; Pearson's correlation coefficient; evaluation metric; regression prediction;
D O I
10.3389/fpls.2024.1480463
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
How to evaluate the accuracy of quantitative trait prediction is crucial to choose the best model among several possible choices in plant breeding. Pearson's correlation coefficient (PCC), serving as a metric for quantifying the strength of the linear association between two variables, is widely used to evaluate the accuracy of the quantitative trait prediction models, and generally performs well in most circumstances. However, PCC may not always offer a comprehensive view of predictive accuracy, especially in cases involving nonlinear relationships or complex dependencies in machine learning-based methods. It has been found that many papers on quantitative trait prediction solely use PCC as a single metric to evaluate the accuracy of their models, which is insufficient and limited from a formal perspective. This study addresses this crucial issue by presenting a typical example and conducting a comparative analysis of PCC and nine other evaluation metrics using four traditional methods and four machine learning-based methods, thereby contributing to the improvement of practical applicability and reliability of plant quantitative trait prediction models. It is recommended to employ PCC in conjunction with other evaluation metrics in a targeted manner based on specific application scenarios to reduce the likelihood of drawing misleading conclusions.
引用
收藏
页数:6
相关论文
共 3 条
  • [1] Hyperspectral endmember extraction using Pearson's correlation coefficient
    Shah, Dharambhai
    Zaveri, Tanish
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2021, 24 (01) : 89 - 97
  • [2] Hiding Sensitive Items Using Pearson's Correlation Coefficient Weighing Mechanism
    Rao, K. Srinivasa
    Babu, Ch Suresh
    Damodaram, A.
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 3, INDIA 2016, 2016, 435 : 257 - 264
  • [3] Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle
    Saeed Hassani
    Mahdi Saatchi
    Rohan L. Fernando
    Dorian J. Garrick
    Genetics Selection Evolution, 47