An effective haplotype assembly algorithm based on hypergraph partitioning

被引:8
作者
Chen, Xiao [1 ]
Peng, Qinke [1 ]
Han, Libin [1 ]
Zhong, Tao [1 ]
Xu, Tao [1 ]
机构
[1] Xi An Jiao Tong Univ, Syst Engn Inst, Sch Elect & Informat Engn, Xian 710049, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Computational biology; Single individual haplotyping; Shared nearest neighbors; Single nucleotide polymorphism; RECONSTRUCTION; SEQUENCE; STEADY; PSEAAC; INHIBITOR; IDENTIFY; KINETICS; RULES;
D O I
10.1016/j.jtbi.2014.05.034
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The haplotype assembly problem has been proven to be complex. Heuristic algorithms are the main methods that are used to solve the problem. These algorithms perform well when the SNP fragments are error-free, but they are less accurate when the error rate increases. The complex relationships caused by fragment errors present a major barrier to assembling accurate haplotypes. Therefore, modeling the complex relationships is the key to solve the problem. In this study, we model the haplotype assembly problem using hypergraph partitioning formulations and propose a novel method termed HGHap (Hypergraph-based Haplotype assembly method). HGHap approaches the haplotype assembly problem in two phases. In the first phase, a hypergraph is constructed in which each vertex corresponds to a fragment and vertices are multiply connected to form hyperedges. In the second phase, a hypergraph partitioning algorithm is employed to obtain two groups of fragments to construct the haplotypes. The hyperedges capture higher-order relationships among fragments that facilitate the subsequent partitioning. Our results demonstrate that the method performs better than other methods in most cases, especially in cases with a high error rate. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:85 / 92
页数:8
相关论文
共 63 条
[1]  
ALTHAUS IW, 1993, J BIOL CHEM, V268, P14875
[2]  
ALTHAUS IW, 1993, J BIOL CHEM, V268, P6119
[3]   KINETIC-STUDIES WITH THE NONNUCLEOSIDE HIV-1 REVERSE-TRANSCRIPTASE INHIBITOR-U-88204E [J].
ALTHAUS, IW ;
CHOU, JJ ;
GONZALES, AJ ;
DEIBEL, MR ;
CHOU, KC ;
KEZDY, FJ ;
ROMERO, DL ;
PALMER, JR ;
THOMAS, RC ;
ARISTOFF, PA ;
TARPLEY, WG ;
REUSSER, F .
BIOCHEMISTRY, 1993, 32 (26) :6548-6554
[4]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[5]   Kinetic plasticity and the determination of product ratios for kinetic schemes leading to multiple products without rate laws - New methods based on directed graphs [J].
Andraos, John .
CANADIAN JOURNAL OF CHEMISTRY, 2008, 86 (04) :342-357
[6]   HapCUT: an efficient and accurate algorithm for the haplotype assembly problem [J].
Bansal, Vikas ;
Bafna, Vineet .
BIOINFORMATICS, 2008, 24 (16) :I153-I159
[7]   Frequent item set mining [J].
Borgelt, Christian .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (06) :437-456
[8]   Hypergraph-partitioning-based remapping models for image-space-parallel direct volume rendering of unstructured grids [J].
Cambazoglu, Berkant Barla ;
Aykanat, Cevdet .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2007, 18 (01) :3-16
[9]   iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition [J].
Chen, Wei ;
Feng, Peng-Mian ;
Lin, Hao ;
Chou, Kuo-Chen .
NUCLEIC ACIDS RESEARCH, 2013, 41 (06) :e68
[10]   iNuc-PhysChem: A Sequence-Based Predictor for Identifying Nucleosomes via Physicochemical Properties [J].
Chen, Wei ;
Lin, Hao ;
Feng, Peng-Mian ;
Ding, Chen ;
Zuo, Yong-Chun ;
Chou, Kuo-Chen .
PLOS ONE, 2012, 7 (10)