A layout framework for genome-wide multiple sequence alignment graphs

被引:0
|
作者
Schebera, Jeremias [1 ,2 ]
Zeckzer, Dirk [1 ]
Wiegreffe, Daniel [1 ]
机构
[1] Univ Leipzig, Inst Comp Sci, Image & Signal Proc Grp, Leipzig, Germany
[2] Univ Leipzig, Ctr Scalable Data Analyt & Artificial Intelligence, Leipzig, Germany
来源
FRONTIERS IN BIOINFORMATICS | 2024年 / 4卷
关键词
genome analysis; multiple sequence alignment; graph drawing; visualization; genome comparison; VISUALIZATION;
D O I
10.3389/fbinf.2024.1358374
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sequence alignments are often used to analyze genomic data. However, such alignments are often only calculated and compared on small sequence intervals for analysis purposes. When comparing longer sequences, these are usually divided into shorter sequence intervals for better alignment results. This usually means that the order context of the original sequence is lost. To prevent this, it is possible to use a graph structure to represent the order of the original sequence on the alignment blocks. The visualization of these graph structures can provide insights into the structural variations of genomes in a semi-global context. In this paper, we propose a new graph drawing framework for representing gMSA data. We produce a hierarchical graph layout that supports the comparative analysis of genomes. Based on a reference, the differences and similarities of the different genome orders are visualized. In this work, we present a complete graph drawing framework for gMSA graphs together with the respective algorithms for each of the steps. Additionally, we provide a prototype and an example data set for analyzing gMSA graphs. Based on this data set, we demonstrate the functionalities of the framework using two examples.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Genome-wide analysis of alternative splicing during human heart development
    Wang, He
    Chen, Yanmei
    Li, Xinzhong
    Chen, Guojun
    Zhong, Lintao
    Chen, Gangbing
    Liao, Yulin
    Liao, Wangjun
    Bin, Jianping
    SCIENTIFIC REPORTS, 2016, 6
  • [42] Complete genome sequence and annotation of the laboratory reference strain Shigella flexneri serotype 5a M90T and genome-wide transcriptional start site determination
    Cervantes-Rivera, Ramon
    Tronnet, Sophie
    Puhar, Andrea
    BMC GENOMICS, 2020, 21 (01)
  • [43] Genome-wide identification and analysis of elongase of very long chain fatty acid genes in the silkworm, Bombyx mori
    Zuo, Weidong
    Li, Chunlin
    Luan, Yue
    Zhang, Hao
    Tong, Xiaoling
    Han, Minjin
    Gao, Rui
    Hu, Hai
    Song, Jiangbo
    Dai, Fangyin
    Lu, Cheng
    GENOME, 2018, 61 (03) : 167 - 176
  • [44] An Algorithm of Multiple Sequence Alignment Based on Consensus Sequence Searched by Simulated Annealing and Star Alignment
    Yao, Dengfeng
    Jiang, Minghu
    You, Xu
    Abulizi, Abudoukelimu
    Hou, Renkui
    2015 INTERNATIONAL SYMPOSIUM ON BIOELECTRONICS AND BIOINFORMATICS (ISBB), 2015, : 3 - 6
  • [45] MSARC: Multiple sequence alignment by residue clustering
    Modzelewski, Michal
    Dojer, Norbert
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2014, 9
  • [46] A NEW GENETIC ALGORITHM FOR MULTIPLE SEQUENCE ALIGNMENT
    Narimani, Zahra
    Beigy, Hamid
    Abolhassani, Hassan
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2012, 11 (04)
  • [47] Characterization of pairwise and multiple sequence alignment errors
    Landan, Giddy
    Graur, Dan
    GENE, 2009, 441 (1-2) : 141 - 147
  • [48] Assessing the efficiency of multiple sequence alignment programs
    Fabiano Sviatopolk-Mirsky Pais
    Patrícia de Cássia Ruy
    Guilherme Oliveira
    Roney Santos Coimbra
    Algorithms for Molecular Biology, 9
  • [49] A time warping approach to multiple sequence alignment
    Arribas-Gil, Ana
    Matias, Catherine
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2017, 16 (02) : 133 - 144
  • [50] Parallel multiple sequence alignment with dynamic scheduling
    Luo, JC
    Ahmad, I
    Ahmed, M
    Paul, R
    ITCC 2005: International Conference on Information Technology: Coding and Computing, Vol 1, 2005, : 8 - 13