Multi-marker-LD based genetic algorithm for tag SNP selection

被引:0
作者
Amer E. Mouawad
Nashat Mansour
机构
[1] Lebanese American University,Department of Computer Science and Mathematics
来源
Interdisciplinary Sciences: Computational Life Sciences | 2014年 / 6卷
关键词
disease-SNP association; genetic algorithm; multi-marker linkage disequilibrium; single nucleotide polymorphism; tag SNP;
D O I
暂无
中图分类号
学科分类号
摘要
Despite the advances in genotyping technologies which have led to large reduction in genotyping cost, the Tag SNP Selection problem remains an important problem for computational biologists and geneticists. Selecting the smallest subset of tag SNPs that can predict the other SNPs would considerably minimize the complexity of genome-wide or block-based SNP-disease association studies. These studies would lead to better diagnosis and treatment of diseases. In this work, we propose three variations of a genetic algorithm based on two-marker linkage disequilibrium, multi-marker linkage disequilibrium, and a third measure that we denote by prediction power. The performance of the three algorithms are compared with those of a recognized tag SNP selection algorithm using three different real data sets from the HapMap project. The results indicate that the multi-marker linkage disequilibrium based genetic algorithm yields better prediction accuracy.
引用
收藏
页码:303 / 311
页数:8
相关论文
共 72 条
  • [1] Daly MJ(2001)High resolution haplotype structure in the human genome Nat Genet 29 229-232
  • [2] Rioux JD(2007)GEVALT: An integrated software tool for genotype analysis BMC Bioinformatics 8 36-322
  • [3] Schaffner SF(1995)A comparison of linkage disequilibrium measures for fine-scale mapping Genomics 29 311-2229
  • [4] Hudson TJ(2002)The structure of haplotype blocks in the human genome Science 296 2225-2561
  • [5] Lander ES(2006)MLR-tagging: informative SNP selection for un-phased genotypes based on multiple linear regression Bioinformatics 22 2558-67
  • [6] Davidovich O(2007)Informative SNP selection methods based on SNP prediction IEEE Trans Nanobioscience 6 60-288
  • [7] Kimmel G(2003)Efficient selective screening of haplotype tag SNPs Bioinformatics 19 287-162
  • [8] Shamir R(2005)GERBIL: Genotype resolution and block identification using likelihood Proc. Natl Acad Sci USA 102 158-80
  • [9] Devlin B(2010)FastTagger: an efficient algorithm for genome-wide tag SNP selection using multi-marker linkage disequilibrium BMC Bioinformatics 11 66-225
  • [10] Risch N(1994)Parallel physical optimization algorithms for allocating data to multicomputer nodes Journal of Supercomputing 8 53-36