Haplotyping populations by pure parsimony based on compatible genotypes and greedy heuristics

被引:1
作者
Wang, I-Lin [1 ]
Yang, Hui-E [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Ind & Informat Management, Tainan 701, Taiwan
关键词
Haplotype inference; Integer programming; Bioinformatics; Compatibility graph; Heuristics; INFERENCE; RECONSTRUCTION; SAMPLES; COMPLEXITY; ALGORITHM;
D O I
10.1016/j.amc.2011.04.073
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The population haplotype inference problem based on the pure parsimony criterion ( HIPP) infers an m x n genotype matrix for a population by a 2m x n haplotype matrix with the minimum number of distinct haplotypes. Previous integer programming based HIPP solution methods are time-consuming, and their practical effectiveness remains unevaluated. On the other hand, previous heuristic HIPP algorithms are efficient, but their theoretical effectiveness in terms of optimality gaps has not been evaluated, either. We propose two new heuristic HIPP algorithms (MGP and GHI) and conduct more complete computational experiments. In particular, MGP exploits the compatible relations among genotypes to solve a reduced integer linear programming problem so that a solution of good quality can be obtained very quickly; GHI exploits a weight mechanism to selects better candidate haplotypes in a greedy fashion. The computational results show that our proposed algorithms are efficient and effective, especially for solving cases with larger recombination rates. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:9798 / 9809
页数:12
相关论文
共 29 条
  • [1] [Anonymous], ALGEBRAIC BIOL
  • [2] The haplotyping problem: An overview of computational models and solutions
    Bonizzoni, P
    Della Vedova, G
    Dondi, R
    Li, J
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2003, 18 (06) : 675 - 688
  • [3] Integer programming approaches to haplotype inference by pure parsimony
    Brown, DG
    Harrower, IM
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2006, 3 (02) : 141 - 154
  • [4] CLARK AG, 1990, MOL BIOL EVOL, V7, P111
  • [5] Complex promoter and coding region β2-adrenergic receptor haplotypes alter receptor expression and predict in vivo responsiveness
    Drysdale, CM
    McGraw, DW
    Stack, CB
    Stephens, JC
    Judson, RS
    Nandabalan, K
    Arnold, K
    Ruano, G
    Liggett, SB
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (19) : 10483 - 10488
  • [6] An extensible SAT-solver
    Eén, N
    Sörensson, N
    [J]. THEORY AND APPLICATIONS OF SATISFIABILITY TESTING, 2004, 2919 : 502 - 518
  • [7] En Niklas., 2006, Boolean Modeling and Computation, V2, P1
  • [8] EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921
  • [9] Graca A, 2008, LECT NOTES COMPUT SC, V5015, P308, DOI 10.1007/978-3-540-68155-7_28
  • [10] Gusfield D, 2003, LECT NOTES COMPUT SC, V2676, P144