ggmsa: a visual exploration tool for multiple sequence alignment and associated data

被引:98
|
作者
Zhou, Lang
Feng, Tingze
Xu, Shuangbin
Gao, Fangluan [1 ]
Lam, Tommy T. [2 ]
Wang, Qianwen [3 ]
Wu, Tianzhi [4 ]
Huang, Huina
Zhan, Li
Li, Lin
Guan, Yi [5 ]
Dai, Zehan [6 ]
Yu, Guangchuang [6 ]
机构
[1] Fujian Agr & Forestry Univ, Inst Plant Virol, Fuzhou, Peoples R China
[2] Univ Hong Kong, Sch Publ Hlth, Hong Kong, Peoples R China
[3] Southern Med Univ, Sch Basic Med Sci, Dept Bioinformat, Guangzhou, Peoples R China
[4] Southern Med Univ, Dept Bioinformat, Guangzhou, Peoples R China
[5] Univ Hong Kong, State Key Lab Emerging Infect Dis, Hong Kong, Peoples R China
[6] Southern Med Univ, Guangzhou, Peoples R China
关键词
multiple sequence alignment; sequence bundle; sequence recombination; phylogeny; VISUALIZATION; RNA; RESIDUES;
D O I
10.1093/bib/bbac222
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The identification of the conserved and variable regions in the multiple sequence alignment (MSA) is critical to accelerating the process of understanding the function of genes. MSA visualizations allow us to transform sequence features into understandable visual representations. As the sequence-structure-function relationship gains increasing attention in molecular biology studies, the simple display of nucleotide or protein sequence alignment is not satisfied. A more scalable visualization is required to broaden the scope of sequence investigation. Here we present ggmsa, an R package for mining comprehensive sequence features and integrating the associated data of MSA by a variety of display methods. To uncover sequence conservation patterns, variations and recombination at the site level, sequence bundles, sequence logos, stacked sequence alignment and comparative plots are implemented. ggmsa supports integrating the correlation of MSA sequences and their phenotypes, as well as other traits such as ancestral sequences, molecular structures, molecular functions and expression levels. We also design a new visualization method for genome alignments in multiple alignment format to explore the pattern of within and between species variation. Combining these visual representations with prime knowledge, ggmsa assists researchers in discovering MSA and making decisions. The ggmsa package is open-source software released under the Artistic-2.0 license, and it is freely available on Bioconductor (https://bioconductor.org/packages/ggmsa) and Github (https://github.com/YuLab-SMU/ggmsa).
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Assessing the efficiency of multiple sequence alignment programs
    Fabiano Sviatopolk-Mirsky Pais
    Patrícia de Cássia Ruy
    Guilherme Oliveira
    Roney Santos Coimbra
    Algorithms for Molecular Biology, 9
  • [32] A time warping approach to multiple sequence alignment
    Arribas-Gil, Ana
    Matias, Catherine
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2017, 16 (02) : 133 - 144
  • [33] Parallel multiple sequence alignment with dynamic scheduling
    Luo, JC
    Ahmad, I
    Ahmed, M
    Paul, R
    ITCC 2005: International Conference on Information Technology: Coding and Computing, Vol 1, 2005, : 8 - 13
  • [34] A survey on the algorithm and development of multiple sequence alignment
    Zhang, Yongqing
    Zhang, Qiang
    Zhou, Jiliu
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)
  • [35] Parametrizing Multicore Architectures for Multiple Sequence Alignment
    Isaza, Sebastian
    Sanchez, Friman
    Cabarcas, Felipe
    Ramirez, Alex
    Gaydadjiev, Georgi
    PROCEEDINGS OF THE 2011 8TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF 2011), 2011,
  • [36] Multiple sequence alignment accuracy and phylogenetic inference
    Ogden, TH
    Rosenberg, MS
    SYSTEMATIC BIOLOGY, 2006, 55 (02) : 314 - 328
  • [37] Metaheuristics for multiple sequence alignment: A systematic review
    Amorim, Anderson Rici
    Zafalon, Geraldo Francisco Donega
    Contessoto, Allan de Godoi
    Valencio, Carlos Roberto
    Sato, Liria Matsumoto
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2021, 94
  • [38] NEURAL TIME WARPING FOR MULTIPLE SEQUENCE ALIGNMENT
    Kawano, Keisuke
    Kutsuna, Takuro
    Koide, Satoshi
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3837 - 3841
  • [39] Recent progress in multiple sequence alignment: a survey
    Notredame, C
    PHARMACOGENOMICS, 2002, 3 (01) : 131 - 144
  • [40] A hybrid genetic search for multiple sequence alignment
    Moon, Seung-Hyun
    Choi, Sung-Soon
    Moon, Byung-Ro
    GECCO 2006: Genetic and Evolutionary Computation Conference, Vol 1 and 2, 2006, : 303 - 304