Genome alignment with graph data structures: a comparison

被引:29
|
作者
Kehr, Birte [1 ,2 ]
Trappe, Kathrin [1 ]
Holtgrewe, Manuel [1 ]
Reinert, Knut [1 ]
机构
[1] Free Univ Berlin, Dept Comp Sci, D-14195 Berlin, Germany
[2] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
MULTIPLE SEQUENCE ALIGNMENT; ALGORITHM; PERMUTATIONS; ACCURACY;
D O I
10.1186/1471-2105-15-99
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Recent advances in rapid, low-cost sequencing have opened up the opportunity to study complete genome sequences. The computational approach of multiple genome alignment allows investigation of evolutionarily related genomes in an integrated fashion, providing a basis for downstream analyses such as rearrangement studies and phylogenetic inference. Graphs have proven to be a powerful tool for coping with the complexity of genome-scale sequence alignments. The potential of graphs to intuitively represent all aspects of genome alignments led to the development of graph-based approaches for genome alignment. These approaches construct a graph from a set of local alignments, and derive a genome alignment through identification and removal of graph substructures that indicate errors in the alignment. Results: We compare the structures of commonly used graphs in terms of their abilities to represent alignment information. We describe how the graphs can be transformed into each other, and identify and classify graph substructures common to one or more graphs. Based on previous approaches, we compile a list of modifications that remove these substructures. Conclusion: We show that crucial pieces of alignment information, associated with inversions and duplications, are not visible in the structure of all graphs. If we neglect vertex or edge labels, the graphs differ in their information content. Still, many ideas are shared among all graph-based approaches. Based on these findings, we outline a conceptual framework for graph-based genome alignment that can assist in the development of future genome alignment tools.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Graph-based data mining: A new tool for the analysis and comparison of scientific domains represented as scientograms
    Quirin, Arnaud
    Cordon, Oscar
    Vargas-Quesada, Benjamin
    de Moya-Anegon, Felix
    JOURNAL OF INFORMETRICS, 2010, 4 (03) : 291 - 312
  • [42] A greedy, graph-based algorithm for the alignment of multiple homologous gene lists
    Fostier, Jan
    Proost, Sebastian
    Dhoedt, Bart
    Saeys, Yvan
    Demeester, Piet
    Van de Peer, Yves
    Vandepoele, Klaas
    BIOINFORMATICS, 2011, 27 (06) : 749 - 756
  • [43] Multiple Sequence Alignment Algorithm Based on a Dispersion Graph and Ant Colony Algorithm
    Chen, Weiyang
    Liao, Bo
    Zhu, Wen
    Xiang, Xuyu
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2009, 30 (13) : 2031 - 2038
  • [44] Global alignment of protein-protein interaction networks by graph matching methods
    Zaslavskiy, Mikhail
    Bach, Francis
    Vert, Jean-Philippe
    BIOINFORMATICS, 2009, 25 (12) : I259 - I267
  • [45] Optimization of consistency-based multiple sequence alignment using Big Data technologies
    Jordi Lladós
    Fernando Cores
    Fernando Guirado
    The Journal of Supercomputing, 2019, 75 : 1310 - 1322
  • [46] Optimization of consistency-based multiple sequence alignment using Big Data technologies
    Llados, Jordi
    Cores, Fernando
    Guirado, Fernando
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (03) : 1310 - 1322
  • [47] TALCO: Tiling Genome Sequence Alignment using Convergence of Traceback Pointers
    Walia, Sumit
    Ye, Cheng
    Bera, Arkid
    Lodhavia, Dhruvi
    Turakhia, Yatish
    2024 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA 2024, 2024, : 498 - 514
  • [48] icoshift: An effective tool for the alignment of chromatographic data
    Tomasi, Giorgio
    Savorani, Francesco
    Engelsen, Soren B.
    JOURNAL OF CHROMATOGRAPHY A, 2011, 1218 (43) : 7832 - 7840
  • [49] An Interactive Gameplay to Crowdsource Multiple Sequence Alignment of Genome Sequences: Genenigma
    Meedeniya, D. A.
    Rukshan, S. A. P. A.
    Welivita, A.
    2019 9TH INTERNATIONAL CONFERENCE ON BIOSCIENCE, BIOCHEMISTRY AND BIOINFORMATICS (ICBBB 2019), 2019, : 28 - 35
  • [50] Heuristics for Breakpoint Graph Decomposition with Applications in Genome Rearrangement Problems
    Pinheiro, Pedro Olimpio
    Alexandrino, Alexsandro Oliveira
    Oliveira, Andre Rodrigues
    de Souza, Cid Carvalho
    Dias, Zanoni
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2020, 2020, 12558 : 129 - 140