Application of Bayesian networks to GAW20 genetic and blood lipid data

被引:3
作者
Howey R.A.J. [1 ]
Cordell H.J. [1 ]
机构
[1] Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle-upon-Tyne
基金
美国国家卫生研究院;
关键词
Bayesian Network Analysis; Genetics Of Lipid Lowering Drugs And Diet Network (GOLDN); Causal SNPs; GAW20 Simulated Data; Single Nucleotide Polymorphisms (SNPs);
D O I
10.1186/s12919-018-0116-y
中图分类号
学科分类号
摘要
Background: Bayesian networks have been proposed as a way to identify possible causal relationships between measured variables based on their conditional dependencies and independencies. We explored the use of Bayesian network analyses applied to the GAW20 data to identify possible causal relationships between differential methylation of cytosine-phosphate-guanine dinucleotides (CpGs), single-nucleotide polymorphisms (SNPs), and blood lipid trait (triglycerides [TGs]). Methods: After initial exploratory linear regression analyses, 2 Bayesian networks analyses were performed. First, we used the real data and modeled the effects of 4 CpGs previously found to be associated with TGs in the Genetics of Lipid Lowering Drugs and Diet Network Study (GOLDN). Second, we used the simulated data and modeled the effect of a fictional lipid modifying drug with 5 known causal SNPs and 5 corresponding CpGs. Results: In the real data we show that relationships are present between the CpGs, TGs, and other variables - age, sex, and center. In the simulated data, we show, using linear regression, that no CpGs and only 1 SNP were associated with a change in TG levels, and, using Bayesian network analysis, that relationships are present between the change in TG levels and most SNPs, but not with CpGs. Conclusions: Even when the causal relationships between variables are known, as with the simulated data, if the relationships are not strong then it is challenging to reproduce them in a Bayesian network. © 2018 The Author(s).
引用
收藏
相关论文
共 9 条
[1]  
Ainsworth H.F., Shin So.-Y., Cordell H.J., A comparison of methods for inferring causal relationships between genotype and phenotype using additional biological measurements, Genetic Epidemiology, 41, 7, pp. 577-586, (2017)
[2]  
Genetic Analysis Workshop 20 (GAW20)
[3]  
Irvin M.R., Zhi D., Joehanes R., Mendelson M., Aslibekyan S., Claas S.A., Thibeault K.S., Patel N., Day K., Jones L.W., Et al., Epigenome-wide association study of fasting blood lipids in the genetics of lipid-lowering drugs and diet network study, Circulation, 130, 7, pp. 565-572, (2014)
[4]  
R: A Language and Environment for Statistical Computing, (2017)
[5]  
Scutari M., Denis J.-B., Bayesian Networks with Examples in R, Texts in Statistical Science, (2014)
[6]  
Howey R., BayesNetty: Bayesian Network Software for Genetic Analyses
[7]  
Csardi G., Nepusz T., The igraph software package for complex network research, Inter J Complex Syst., (2006)
[8]  
Lippert C., Listgarten J., Liu Y., Kadie C.M., Davidson R.I., Heckerman D., FaST linear mixed models for genome-wide association studies, Nat Methods, 8, 10, pp. 833-835, (2011)
[9]  
Sayols-Baixeras S., Tiwari H.K., Aslibekyan S.W., Disentangling associations between DNA methylation and blood lipids: A Mendelian randomization approach, BMC Proc., 12, (2018)