Haplotype inference with pseudo-Boolean optimization

被引:10
作者
Graca, Ana [1 ,2 ]
Marques-Silva, Joao [3 ]
Lynce, Ines [1 ,2 ]
Oliveira, Arlindo L. [1 ,2 ]
机构
[1] Univ Tecn Lisboa, IST, Lisbon, Portugal
[2] INESC ID Lisboa, Lisbon, Portugal
[3] Univ Coll Dublin, Sch Comp Sci & Informat, Complex & Adapt Syst Lab, Dublin 2, Ireland
关键词
Haplotype inference; Pure parsimony; Pseudo-Boolean optimization; GENOTYPE DATA; PURE PARSIMONY; RECONSTRUCTION; ALGORITHMS; DIVERSITY; MODEL;
D O I
10.1007/s10479-009-0675-4
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The fast development of sequencing techniques in the recent past has required an urgent development of efficient and accurate haplotype inference tools. Besides being a crucial issue in genetics, haplotype inference is also a challenging computational problem. Among others, pure parsimony is a viable modeling approach to solve the problem of haplotype inference and also an interesting NP-hard problem in itself. Recently, the introduction of SAT-based methods, including pseudo-Boolean optimization (PBO) methods, has produced very efficient solvers. This paper provides a detailed description of RPoly, a PBO approach for the haplotype inference by pure parsimony (HIPP) problem. Moreover, an extensive evaluation of existent HIPP solvers, on a comprehensive set of instances, confirms that RPoly is currently the most efficient and robust HIPP approach.
引用
收藏
页码:137 / 162
页数:26
相关论文
共 45 条
[1]   Generic ILP versus specialized 0-1 ILP: An update [J].
Aloul, FA ;
Ramani, A ;
Markov, IL ;
Sakallah, KA .
IEEE/ACM INTERNATIONAL CONFERENCE ON CAD-02, DIGEST OF TECHNICAL PAPERS, 2002, :450-457
[2]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[3]  
[Anonymous], 2006, Journal on Satisfiability, Boolean Modeling and Computation
[4]   Integer programming approaches to haplotype inference by pure parsimony [J].
Brown, DG ;
Harrower, IM .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2006, 3 (02) :141-154
[5]  
Brown DG, 2004, LECT NOTES COMPUT SC, V3240, P254
[6]   Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering [J].
Browning, Sharon R. ;
Browning, Brian L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) :1084-1097
[7]   Clone-based systematic haplotyping (CSH): A procedure for physical haplotyping of whole genomes [J].
Burgtorf, C ;
Kepper, P ;
Hoehe, M ;
Schmitt, C ;
Reinhardt, R ;
Lehrach, H ;
Sauer, S .
GENOME RESEARCH, 2003, 13 (12) :2717-2724
[8]   High-resolution haplotype structure in the human genome [J].
Daly, MJ ;
Rioux, JD ;
Schaffner, SE ;
Hudson, TJ ;
Lander, ES .
NATURE GENETICS, 2001, 29 (02) :229-232
[9]   Shape-IT: new rapid and accurate algorithm for haplotype inference [J].
Delaneau, Olivier ;
Coulonges, Cedric ;
Zagury, Jean-Francois .
BMC BIOINFORMATICS, 2008, 9 (1)
[10]  
DRYSDALE C, 2000, NATL ACAD SCI NAS, V97