Graphical Model Selection to Infer the Partial Correlation Network of Allelic Effects in Genomic Prediction With an Application in Dairy Cattle

被引:0
作者
Martinez, Carlos A. [1 ]
Khare, Kshitij [2 ]
Rahman, Syed [3 ]
Baez, Giovanni M. [4 ]
机构
[1] Univ Nacl Colombia, Dept Prod Anim, Bogota, Colombia
[2] Univ Florida, Dept Stat, Gainesville, FL USA
[3] Apple Inc, Seattle, WA USA
[4] Univ Francisco Paula Santander, Dept Ciencias Agr & Pecuarias, Cucuta, Colombia
关键词
Bayesian statistics; EM algorithm; genetic networks; genomic selection; statistical genomics; WISHART DISTRIBUTIONS; COVARIANCE ESTIMATION; MAXIMUM-LIKELIHOOD; GENETIC VALUE; FRAMEWORK; PRIORS;
D O I
10.1111/jbg.12921
中图分类号
S8 [畜牧、 动物医学、狩猎、蚕、蜂];
学科分类号
0905 ;
摘要
We addressed genomic prediction accounting for partial correlation of marker effects, which entails the estimation of the partial correlation network/graph (PCN) and the precision matrix of an unobservable m-dimensional random variable. To this end, we developed a set of statistical models and methods by extending the canonical model selection problem in Gaussian concentration, and directed acyclic graph models. Our frequentist formulations combined existing methods with the EM algorithm and were termed Glasso-EM, Concord-EM and CSCS-EM, whereas our Bayesian formulations corresponded to hierarchical models termed Bayes G-Sel and Bayes DAG-Sel. We implemented our methods in a real bull fertility dataset and then carried out gene annotation of seven markers having the highest degrees in the estimated PCN. Our findings brought biological evidence supporting the usefulness of identifying genomic regions that are highly connected in the inferred PCN. Moreover, a simulation study showed that some of our methods can accurately recover the PCN (accuracy up to 0.98 using Concord-EM), estimate the precision matrix (Concord-EM yielded the best results) and predict breeding values (the best reliability was 0.85 for a trait with heritability of 0.5 using Glasso-EM).
引用
收藏
页数:13
相关论文
共 55 条
[1]   Monte Carlo method for computing the marginal likelihood in nondecomposable Gaussian graphical models [J].
Atay-Kayis, A ;
Massam, H .
BIOMETRIKA, 2005, 92 (02) :317-335
[2]  
Banerjee O, 2008, J MACH LEARN RES, V9, P485
[3]   Differential Genes Expression between Fertile and Infertile Spermatozoa Revealed by Transcriptome Analysis [J].
Bansal, Sandeep Kumar ;
Gupta, Nishi ;
Sankhwar, Satya Narayan ;
Rajender, Singh .
PLOS ONE, 2015, 10 (05)
[4]  
Ben-David E, 2015, Arxiv, DOI arXiv:1109.4371
[5]   COVARIANCE REGULARIZATION BY THRESHOLDING [J].
Bickel, Peter J. ;
Levina, Elizaveta .
ANNALS OF STATISTICS, 2008, 36 (06) :2577-2604
[6]   Gene co-expression network topology provides a framework for molecular characterization of cellular state [J].
Carter, SL ;
Brechbühler, CM ;
Griffin, M ;
Bond, AT .
BIOINFORMATICS, 2004, 20 (14) :2242-2250
[7]   Simulation of hyper-inverse Wishart distributions in graphical models [J].
Carvalho, Carlos M. ;
Massam, Helene ;
West, Mike .
BIOMETRIKA, 2007, 94 (03) :647-659
[8]   Precursor RNA processing 3 is required for male fertility, and germline stem cell self-renewal and differentiation via regulating spliceosome function in Drosophila testes [J].
Chen, Xia ;
Luan, Xiaojin ;
Zheng, Qianwen ;
Qiao, Chen ;
Chen, Wanyin ;
Wang, Min ;
Yan, Yidan ;
Xie, Bing ;
Shen, Cong ;
He, Zeyu ;
Zhang, Jun ;
Liu, Mingxi ;
Hu, Xing ;
Li, Hong ;
Zheng, Bo ;
Fang, Jie ;
Yu, Jun .
SCIENTIFIC REPORTS, 2019, 9 (1)
[9]   HYPER MARKOV LAWS IN THE STATISTICAL-ANALYSIS OF DECOMPOSABLE GRAPHICAL MODELS [J].
DAWID, AP ;
LAURITZEN, SL .
ANNALS OF STATISTICS, 1993, 21 (03) :1272-1317
[10]  
DAWID AP, 1979, J ROY STAT SOC B MET, V41, P1