Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat

被引:200
作者
Gianola, Daniel [1 ,2 ,3 ]
Okut, Hayrettin [1 ,4 ]
Weigel, Kent A. [2 ]
Rosa, Guilherme J. M. [1 ,3 ]
机构
[1] Univ Wisconsin, Dept Anim Sci, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Dairy Sci, Madison, WI 53706 USA
[3] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53706 USA
[4] Yuzuncu Yil Univ, Biometry & Genet Branch, Dept Anim Sci, TR-65080 Van, Turkey
关键词
GENOMIC-ASSISTED PREDICTION; HILBERT-SPACES REGRESSION; GENETIC VALUES; VARIANCE-COMPONENTS; ENABLED PREDICTION; MOLECULAR MARKERS; PEDIGREE; MODELS;
D O I
10.1186/1471-2156-12-87
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: In the study of associations between genomic data and complex phenotypes there may be relationships that are not amenable to parametric statistical modeling. Such associations have been investigated mainly using single-marker and Bayesian linear regression models that differ in their distributions, but that assume additive inheritance while ignoring interactions and non-linearity. When interactions have been included in the model, their effects have entered linearly. There is a growing interest in non-parametric methods for predicting quantitative traits based on reproducing kernel Hilbert spaces regressions on markers and radial basis functions. Artificial neural networks (ANN) provide an alternative, because these act as universal approximators of complex functions and can capture non-linear relationships between predictors and responses, with the interplay among variables learned adaptively. ANNs are interesting candidates for analysis of traits affected by cryptic forms of gene action. Results: We investigated various Bayesian ANN architectures using for predicting phenotypes in two data sets consisting of milk production in Jersey cows and yield of inbred lines of wheat. For the Jerseys, predictor variables were derived from pedigree and molecular marker (35,798 single nucleotide polymorphisms, SNPS) information on 297 individually cows. The wheat data represented 599 lines, each genotyped with 1,279 markers. The ability of predicting fat, milk and protein yield was low when using pedigrees, but it was better when SNPs were employed, irrespective of the ANN trained. Predictive ability was even better in wheat because the trait was a mean, as opposed to an individual phenotype in cows. Non-linear neural networks outperformed a linear model in predictive ability in both data sets, but more clearly in wheat. Conclusion: Results suggest that neural networks may be useful for predicting complex traits using high-dimensional genomic information, a situation where the number of unknowns exceeds sample size. ANNs can capture nonlinearities, adaptively. This may be useful when prediction of phenotypes is crucial.
引用
收藏
页数:14
相关论文
共 44 条
[1]   Estimating UV erythemal irradiance by means of neural networks [J].
Alados, I ;
Mellado, JA ;
Ramos, F ;
Alados-Arboledas, L .
PHOTOCHEMISTRY AND PHOTOBIOLOGY, 2004, 80 (02) :351-358
[2]  
[Anonymous], 2006, Pattern recognition and machine learning
[3]  
BEAL MH, 2010, NEURAL NETWORK TOOLB
[4]   Prediction of Genetic Values of Quantitative Traits in Plant Breeding Using Pedigree and Molecular Markers [J].
Crossa, Jose ;
de los Campos, Gustavo ;
Perez, Paulino ;
Gianola, Daniel ;
Burgueno, Juan ;
Luis Araus, Jose ;
Makumbi, Dan ;
Singh, Ravi P. ;
Dreisigacker, Susanne ;
Yan, Jianbing ;
Arief, Vivi ;
Banziger, Marianne ;
Braun, Hans-Joachim .
GENETICS, 2010, 186 (02) :713-U406
[5]   Reproducing kernel Hilbert spaces regression: A general framework for genetic evaluation [J].
de los Campos, G. ;
Gianola, D. ;
Rosa, G. J. M. .
JOURNAL OF ANIMAL SCIENCE, 2009, 87 (06) :1883-1887
[6]   Predicting genetic predisposition in humans: the promise of whole-genome markers [J].
de los Campos, Gustavo ;
Gianola, Daniel ;
Allison, David B. .
NATURE REVIEWS GENETICS, 2010, 11 (12) :880-886
[7]   Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods [J].
de los Campos, Gustavo ;
Gianola, Daniel ;
Rosa, Guilherme J. M. ;
Weigel, Kent A. ;
Crossa, Jose .
GENETICS RESEARCH, 2010, 92 (04) :295-308
[8]  
Demuth H., 2009, NEURAL NETWORK TOOLB
[9]  
Falconer D. S., 1996, Introduction to quantitative genetics.
[10]   Ensembles of Bayesian-regularized Genetic Neural Networks for modeling of acetylcholinesterase inhibition by huprines [J].
Fernandez, Michael ;
Caballero, Julio .
CHEMICAL BIOLOGY & DRUG DESIGN, 2006, 68 (04) :201-212