Genome alignment with graph data structures: a comparison

被引:29
|
作者
Kehr, Birte [1 ,2 ]
Trappe, Kathrin [1 ]
Holtgrewe, Manuel [1 ]
Reinert, Knut [1 ]
机构
[1] Free Univ Berlin, Dept Comp Sci, D-14195 Berlin, Germany
[2] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
MULTIPLE SEQUENCE ALIGNMENT; ALGORITHM; PERMUTATIONS; ACCURACY;
D O I
10.1186/1471-2105-15-99
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Recent advances in rapid, low-cost sequencing have opened up the opportunity to study complete genome sequences. The computational approach of multiple genome alignment allows investigation of evolutionarily related genomes in an integrated fashion, providing a basis for downstream analyses such as rearrangement studies and phylogenetic inference. Graphs have proven to be a powerful tool for coping with the complexity of genome-scale sequence alignments. The potential of graphs to intuitively represent all aspects of genome alignments led to the development of graph-based approaches for genome alignment. These approaches construct a graph from a set of local alignments, and derive a genome alignment through identification and removal of graph substructures that indicate errors in the alignment. Results: We compare the structures of commonly used graphs in terms of their abilities to represent alignment information. We describe how the graphs can be transformed into each other, and identify and classify graph substructures common to one or more graphs. Based on previous approaches, we compile a list of modifications that remove these substructures. Conclusion: We show that crucial pieces of alignment information, associated with inversions and duplications, are not visible in the structure of all graphs. If we neglect vertex or edge labels, the graphs differ in their information content. Still, many ideas are shared among all graph-based approaches. Based on these findings, we outline a conceptual framework for graph-based genome alignment that can assist in the development of future genome alignment tools.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Optimal Alignment of Structures for Finite and Periodic Systems
    Griffiths, Matthew
    Niblett, Samuel P.
    Wales, David J.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2017, 13 (10) : 4914 - 4931
  • [32] Multiple Alignment of Structures Using Center Of ProTeins
    Roy, Kaushik
    Panigrahi, Satish Chandra
    Mukhopadhyay, Asish
    BIOINFORMATICS RESEARCH AND APPLICATIONS (ISBRA 2015), 2015, 9096 : 284 - 296
  • [33] MLProbs: A Data-Centric Pipeline for Better Multiple Sequence Alignment
    Kuang, Mengmeng
    Zhang, Yong
    Lam, Tak-Wah
    Ting, Hing-Fung
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 524 - 533
  • [34] MetaFast: fast reference-free graph-based comparison of shotgun metagenomic data
    Ulyantsev, Vladimir I.
    Kazakov, Sergey V.
    Dubinkina, Veronika B.
    Tyakht, Alexander V.
    Alexeev, Dmitry G.
    BIOINFORMATICS, 2016, 32 (18) : 2760 - 2767
  • [35] SuiteMSA: visual tools for multiple sequence alignment comparison and molecular sequence simulation
    Anderson, Catherine L.
    Strope, Cory L.
    Moriyama, Etsuko N.
    BMC BIOINFORMATICS, 2011, 12
  • [36] Whole-Genome Alignment: Methods, Challenges, and Future Directions
    Saada, Bacem
    Zhang, Tianchi
    Siga, Estevao
    Zhang, Jing
    Magalhaes Muniz, Maria Malane
    APPLIED SCIENCES-BASEL, 2024, 14 (11):
  • [37] Multiple genome alignment in the telomere-to-telomere assembly era
    Kille, Bryce
    Balaji, Advait
    Sedlazeck, Fritz J.
    Nute, Michael
    Treangen, Todd J.
    GENOME BIOLOGY, 2022, 23 (01) : 182
  • [38] A new graph-based method for pairwise global network alignment
    Klau, Gunnar W.
    BMC BIOINFORMATICS, 2009, 10
  • [39] A graph-based approach to diploid genome assembly
    Garg, Shilpa
    Rautiainen, Mikko
    Novak, Adam M.
    Garrison, Erik
    Durbin, Richard
    Marschall, Tobias
    BIOINFORMATICS, 2018, 34 (13) : 105 - 114
  • [40] Fast RNA Structure Alignment for Crossing Input Structures
    Backofen, Rolf
    Landau, Gad M.
    Moehl, Mathias
    Tsur, Dekel
    Weimann, Oren
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2009, 5577 : 236 - +