Epi-GTBN: an approach of epistasis mining based on genetic Tabu algorithm and Bayesian network

被引:21
作者
Guo, Yang [1 ]
Zhong, Zhiman [1 ]
Yang, Chen [1 ]
Hu, Jiangfeng [1 ]
Jiang, Yaling [1 ]
Liang, Zizhen [1 ]
Gao, Hui [1 ]
Liu, Jianxiao [1 ]
机构
[1] Huazhong Agr Univ, Coll Informat, Hubei Key Lab Agr Bioinformat, Wuhan 430070, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Epistasis; Genetic algorithm; Tabu; Bayesian network; INFERENCE;
D O I
10.1186/s12859-019-3022-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundMining epistatic loci which affects specific phenotypic traits is an important research issue in the field of biology. Bayesian network (BN) is a graphical model which can express the relationship between genetic loci and phenotype. Until now, it has been widely used into epistasis mining in many research work. However, this method has two disadvantages: low learning efficiency and easy to fall into local optimum. Genetic algorithm has the excellence of rapid global search and avoiding falling into local optimum. It is scalable and easy to integrate with other algorithms. This work proposes an epistasis mining approach based on genetic tabu algorithm and Bayesian network (Epi-GTBN). It uses genetic algorithm into the heuristic search strategy of Bayesian network. The individual structure can be evolved through the genetic operations of selection, crossover and mutation. It can help to find the optimal network structure, and then further to mine the epistasis loci effectively. In order to enhance the diversity of the population and obtain a more effective global optimal solution, we use the tabu search strategy into the operations of crossover and mutation in genetic algorithm. It can help to accelerate the convergence of the algorithm.ResultsWe compared Epi-GTBN with other recent algorithms using both simulated and real datasets. The experimental results demonstrate that our method has much better epistasis detection accuracy in the case of not affecting the efficiency for different datasets.ConclusionsThe presented methodology (Epi-GTBN) is an effective method for epistasis detection, and it can be seen as an interesting addition to the arsenal used in complex traits analyses.
引用
收藏
页数:18
相关论文
共 45 条
[11]   An information-gain approach to detecting three-way epistatic interactions in genetic association studies [J].
Hu, Ting ;
Chen, Yuanzhu ;
Kiralis, Jeff W. ;
Collins, Ryan L. ;
Wejse, Christian ;
Sirugo, Giorgio ;
Williams, Scott M. ;
Moore, Jason H. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (04) :630-636
[12]   LEAP: Biomarker Inference Through Learning and Evaluating Association Patterns [J].
Jiang, Xia ;
Neapolitan, Richard E. .
GENETIC EPIDEMIOLOGY, 2015, 39 (03) :173-184
[13]   Learning genetic epistasis using Bayesian network scoring criteria [J].
Jiang, Xia ;
Neapolitan, Richard E. ;
Barmada, M. Michael ;
Visweswaran, Shyam .
BMC BIOINFORMATICS, 2011, 12
[14]   Heterogeneous multimodal biomarkers analysis for Alzheimer's disease via Bayesian network [J].
Jin, Yan ;
Su, Yi ;
Zhou, Xiao-Hua ;
Huang, Shuai .
EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2016, 2016 (01)
[15]   MACOED: a multi-objective ant colony optimization algorithm for SNP epistasis detection in genome-wide association studies [J].
Jing, Peng-Jie ;
Shen, Hong-Bin .
BIOINFORMATICS, 2015, 31 (05) :634-641
[16]   Complement factor H polymorphism in age-related macular degeneration [J].
Klein, RJ ;
Zeiss, C ;
Chew, EY ;
Tsai, JY ;
Sackler, RS ;
Haynes, C ;
Henning, AK ;
SanGiovanni, JP ;
Mane, SM ;
Mayne, ST ;
Bracken, MB ;
Ferris, FL ;
Ott, J ;
Barnstable, C ;
Hoh, J .
SCIENCE, 2005, 308 (5720) :385-389
[17]   IGENT: efficient entropy based algorithm for genome-wide gene-gene interaction analysis [J].
Kwon, Min-Seok ;
Park, Mira ;
Park, Taesung .
BMC MEDICAL GENOMICS, 2014, 7
[18]   Detecting gene-gene interactions using a permutation-based random forest method [J].
Li, Jing ;
Malley, James D. ;
Andrew, Angeline S. ;
Karagas, Margaret R. ;
Moore, Jason H. .
BIODATA MINING, 2016, 9
[19]   Identification of genetic interaction networks via an evolutionary algorithm evolved Bayesian network [J].
Li, Ruowang ;
Dudek, Scott M. ;
Kim, Dokyoon ;
Hall, Molly A. ;
Bradford, Yuki ;
Peissig, Peggy L. ;
Brilliant, Murray H. ;
Linneman, James G. ;
McCarty, Catherine A. ;
Bao, Le ;
Ritchie, Marylyn D. .
BIODATA MINING, 2016, 9
[20]   A fast and exhaustive method for heterogeneity and epistasis analysis based on multi-objective optimization [J].
Li, Xiong .
BIOINFORMATICS, 2017, 33 (18) :2829-2836