Mate pair library sequencing is an effective and economical method for detecting genomic structural variants and chromosomal abnormalities. Unfortunately, the mapping and alignment of mate-pair read pairs to a reference genome is a challenging and time-consuming process for most next-generation sequencing alignment programs. Large insert sizes, introduction of library preparation protocol artifacts (biotin junction reads, paired-end read contamination, chimeras, etc.) and presence of structural variant breakpoints within reads increase mapping and alignment complexity. We describe an algorithm that is up to 20 times faster and 25% more accurate than popular next-generation sequencing alignment programs when processing mate pair sequencing.
机构:
Broad Inst, Cambridge, MA 02142 USABroad Inst, Cambridge, MA 02142 USA
Li, Heng
;
Homer, Nils
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90024 USABroad Inst, Cambridge, MA 02142 USA
机构:
Broad Inst, Cambridge, MA 02142 USABroad Inst, Cambridge, MA 02142 USA
Li, Heng
;
Homer, Nils
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90024 USABroad Inst, Cambridge, MA 02142 USA