Graph accordance of next-generation sequence assemblies

被引:41
|
作者
Yao, Guohui [1 ]
Ye, Liang [1 ]
Gao, Hongyu [1 ]
Minx, Patrick [1 ]
Warren, Wesley C. [1 ]
Weinstock, George M. [1 ]
机构
[1] Washington Univ, Sch Med, Genome Inst, St Louis, MO 63108 USA
关键词
ALIGNMENT; QUALITY;
D O I
10.1093/bioinformatics/btr588
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: No individual assembly algorithm addresses all the known limitations of assembling short-length sequences. Overall reduced sequence contig length is the major problem that challenges the usage of these assemblies. We describe an algorithm to take advantages of different assembly algorithms or sequencing platforms to improve the quality of next-generation sequence (NGS) assemblies. Results: The algorithm is implemented as a graph accordance assembly (GAA) program. The algorithm constructs an accordance graph to capture the mapping information between the target and query assemblies. Based on the accordance graph, the contigs or scaffolds of the target assembly can be extended, merged or bridged together. Extra constraints, including gap sizes, mate pairs, scaffold order and orientation, are explored to enforce those accordance operations in the correct context. We applied GAA to various chicken NGS assemblies and the results demonstrate improved contiguity statistics and higher genome and gene coverage.
引用
收藏
页码:13 / 16
页数:4
相关论文
共 50 条
  • [1] Next-generation sequence analysis
    H Craig Mak
    Nature Biotechnology, 2011, 29 (1) : 45 - 46
  • [2] Next-generation sequencing and large genome assemblies
    Henson, Joseph
    Tischler, German
    Ning, Zemin
    PHARMACOGENOMICS, 2012, 13 (08) : 901 - 915
  • [3] A next-generation human genome sequence
    Church, Deanna M.
    SCIENCE, 2022, 376 (6588) : 34 - 35
  • [4] Towards Next-Generation Cybersecurity with Graph AI
    Bowman B.
    Howie Huang H.
    Operating Systems Review (ACM), 2021, 55 (01): : 61 - 67
  • [5] Graph Queries in a Next-Generation Datalog System
    Shkapsky, Alexander
    Zeng, Kai
    Zaniolo, Carlo
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12): : 1258 - 1261
  • [6] Limitations of next-generation genome sequence assembly
    Alkan C.
    Sajjadian S.
    Eichler E.E.
    Nature Methods, 2011, 8 (1) : 61 - 65
  • [7] Limitations of next-generation genome sequence assembly
    Alkan, Can
    Sajjadian, Saba
    Eichler, Evan E.
    NATURE METHODS, 2011, 8 (01) : 61 - 65
  • [8] Managing and Analyzing Next-Generation Sequence Data
    Richter, Brent G.
    Sexton, David P.
    PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (06)
  • [9] GRASS: a generic algorithm for scaffolding next-generation sequencing assemblies
    Gritsenko, Alexey A.
    Nijkamp, Jurgen F.
    Reinders, Marcel J. T.
    de Ridder, Dick
    BIOINFORMATICS, 2012, 28 (11) : 1429 - 1437
  • [10] Next-generation diagnostics: Eliminating the excessive sequence processing associated with next-generation sequencing using EDNA
    Schneider, W. L.
    Stobbe, A. H.
    Daniels, J.
    Espindola, A. S.
    Verma, R.
    Blagden, T.
    Fletcher, J.
    Ochoa-Corona, F.
    Garzon, C.
    Hoyt, P. R.
    Melcher, U.
    PHYTOPATHOLOGY, 2012, 102 (07) : 155 - 155