Kernel-based whole-genome prediction of complex traits: a review

被引:113
作者
Morota, Gota [1 ]
Gianola, Daniel [2 ,3 ,4 ]
机构
[1] Univ Nebraska, Dept Anim Sci, Lincoln, NE 68533 USA
[2] Univ Wisconsin, Dept Anim Sci, Madison, WI USA
[3] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI USA
[4] Univ Wisconsin, Dept Dairy Sci, Madison, WI 53706 USA
关键词
LINEAR UNBIASED PREDICTION; MARKER-ASSISTED SELECTION; HILBERT-SPACES REGRESSION; ENABLED PREDICTION; CROSS-VALIDATION; GENETIC VALUES; QUANTITATIVE TRAITS; RIDGE-REGRESSION; BREEDING VALUES; LITTER SIZE;
D O I
10.3389/fgene.2014.00363
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Prediction of genetic values has been a focus of applied quantitative genetics since the beginning of the 20th century, with renewed interest following the advent of the era of whole genome-enabled prediction. Opportunities offered by the emergence of high-dimensional genomic data fueled by post-Sanger sequencing technologies, especially molecular markers, have driven researchers to extend Ronald Fisher and Sewall Wright's models to confront new challenges. In particular, kernel methods are gaining consideration as a regression method of choice for genome-enabled prediction. Complex traits are presumably influenced by many genomic regions working in concert with others (clearly so when considering pathways), thus generating interactions. Motivated by this view, a growing number of statistical approaches based on kernels attempt to capture non-additive effects, either parametrically or non-parametrically. This review centers on whole-genome regression using kernel methods applied to a wide range of quantitative traits of agricultural importance in animals and plants. We discuss various kernel-based approaches tailored to capturing total genetic variation, with the aim of arriving at an enhanced predictive performance in the light of available genome annotation information. Connections between prediction machines born in animal breeding, statistics, and machine learning are revisited, and their empirical prediction performance is discussed. Overall, while some encouraging results have been obtained with non-parametric kernels, recovering non-additive genetic variation in a validation dataset remains a challenge in quantitative genetics.
引用
收藏
页数:13
相关论文
共 113 条
  • [1] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [2] Angeles Perez-Cabal M., 2012, Frontiers in Genetics, V3, P27, DOI 10.3389/fgene.2012.00027
  • [3] [Anonymous], 1973, J ANIM SCI, DOI DOI 10.1093/ANSCI/1973.SYMPOSIUM.10
  • [4] [Anonymous], 1990, INTRO APPL GEOSTATIS
  • [5] Berk RA, 2008, SPRINGER SER STAT, P1, DOI 10.1007/978-0-387-77501-2_1
  • [6] A Bayesian Mixed Regression Based Prediction of Quantitative Traits from Molecular Marker and Gene Expression Data
    Bhattacharjee, Madhuchhanda
    Sillanpaa, Mikko J.
    [J]. PLOS ONE, 2011, 6 (11):
  • [7] Genomic breeding value prediction: methods and procedures
    Calus, M. P. L.
    [J]. ANIMAL, 2010, 4 (02) : 157 - 164
  • [8] Systems genetics approaches to understand complex traits
    Civelek, Mete
    Lusis, Aldons J.
    [J]. NATURE REVIEWS GENETICS, 2014, 15 (01) : 34 - 48
  • [9] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES
    COHEN, J
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) : 37 - 46
  • [10] Genomic prediction in CIMMYT maize and wheat breeding programs
    Crossa, J.
    Perez, P.
    Hickey, J.
    Burgueno, J.
    Ornella, L.
    Ceron-Rojas, J.
    Zhang, X.
    Dreisigacker, S.
    Babu, R.
    Li, Y.
    Bonnett, D.
    Mathews, K.
    [J]. HEREDITY, 2014, 112 (01) : 48 - 60