Genome alignment with graph data structures: a comparison

被引:29
|
作者
Kehr, Birte [1 ,2 ]
Trappe, Kathrin [1 ]
Holtgrewe, Manuel [1 ]
Reinert, Knut [1 ]
机构
[1] Free Univ Berlin, Dept Comp Sci, D-14195 Berlin, Germany
[2] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
MULTIPLE SEQUENCE ALIGNMENT; ALGORITHM; PERMUTATIONS; ACCURACY;
D O I
10.1186/1471-2105-15-99
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Recent advances in rapid, low-cost sequencing have opened up the opportunity to study complete genome sequences. The computational approach of multiple genome alignment allows investigation of evolutionarily related genomes in an integrated fashion, providing a basis for downstream analyses such as rearrangement studies and phylogenetic inference. Graphs have proven to be a powerful tool for coping with the complexity of genome-scale sequence alignments. The potential of graphs to intuitively represent all aspects of genome alignments led to the development of graph-based approaches for genome alignment. These approaches construct a graph from a set of local alignments, and derive a genome alignment through identification and removal of graph substructures that indicate errors in the alignment. Results: We compare the structures of commonly used graphs in terms of their abilities to represent alignment information. We describe how the graphs can be transformed into each other, and identify and classify graph substructures common to one or more graphs. Based on previous approaches, we compile a list of modifications that remove these substructures. Conclusion: We show that crucial pieces of alignment information, associated with inversions and duplications, are not visible in the structure of all graphs. If we neglect vertex or edge labels, the graphs differ in their information content. Still, many ideas are shared among all graph-based approaches. Based on these findings, we outline a conceptual framework for graph-based genome alignment that can assist in the development of future genome alignment tools.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Bit-parallel sequence-to-graph alignment
    Rautiainen, Mikko
    Maekinen, Veli
    Marschall, Tobias
    BIOINFORMATICS, 2019, 35 (19) : 3599 - 3607
  • [22] GraphAligner: rapid and versatile sequence-to-graph alignment
    Rautiainen, Mikko
    Marschall, Tobias
    GENOME BIOLOGY, 2020, 21 (01) : 253
  • [23] Certain fuzzy graph structures
    Akram, Muhammad
    Sitara, Muzzamal
    JOURNAL OF APPLIED MATHEMATICS AND COMPUTING, 2019, 61 (1-2) : 25 - 56
  • [24] Certain fuzzy graph structures
    Muhammad Akram
    Muzzamal Sitara
    Journal of Applied Mathematics and Computing, 2019, 61 : 25 - 56
  • [25] On Molecular Graph Comparison
    Melo, Jenny A.
    Daza, Edgar
    CURRENT COMPUTER-AIDED DRUG DESIGN, 2011, 7 (02) : 83 - 89
  • [26] Overlap graphs and de Bruijn graphs: data structures for de novo genome assembly in the big data era
    Rizzi, Raffaella
    Beretta, Stefano
    Patterson, Murray
    Pirola, Yuri
    Previtali, Marco
    Della Vedova, Gianluca
    Bonizzoni, Paola
    QUANTITATIVE BIOLOGY, 2019, 7 (04) : 278 - 292
  • [27] Superstring Graph: A New Approach for Genome Assembly
    Cazaux, Bastien
    Sacomoto, Gustavo
    Rivals, Eric
    ALGORITHMIC ASPECTS IN INFORMATION AND MANAGEMENT, 2016, 9778 : 39 - 52
  • [28] FastZ: Accelerating Gapped Whole Genome Alignment on GPUs
    Gundabolu, Sree Charan
    Vijaykumar, T. N.
    Thottethodi, Mithuna
    SC21: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2021,
  • [29] Enhancing the Efficiency of Unsupervised Network Alignment Using Quotient Graph
    Zhang, Lei
    Qian, Feng
    IEEE ACCESS, 2025, 13 : 46495 - 46513
  • [30] Residue Product of Fuzzy Graph Structures
    Akram, Muhammad
    Sitara, Muzzamal
    Saeid, Arsham Borumand
    JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2020, 34 (3-4) : 365 - 399