HomBlocks: A multiple-alignment construction pipeline for organelle phylogenomics based on locally collinear block searching

被引:192
作者
Bi, Guiqi [1 ,2 ]
Mao, Yunxiang [1 ,2 ,3 ]
Xing, Qikun [1 ,2 ]
Cao, Min [1 ,2 ]
机构
[1] Minist Educ, Key Lab Marine Genet & Breeding OUC, Qingdao, Peoples R China
[2] Ocean Univ China, Coll Marine Life Sci, Qingdao, Peoples R China
[3] Qingdao Natl Lab Marine Sci & Technol, Lab Marine Biol & Biotechnol, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
Organelle phylogenomics; Locally collinear blocks; Alignment construction; Efficient pipeline; SELECTION; TOOL;
D O I
10.1016/j.ygeno.2017.08.001
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Organelle phylogenomic analysis requires precisely constructed multi-gene alignment matrices concatenated by pre-aligned single gene datasets. For non-bioinformaticians, it can take days to weeks to manually create high-quality multi-gene alignments comprising tens or hundreds of homologous genes. Here, we describe a new and highly efficient pipeline, HomBlocks, which uses a homologous block searching method to construct multiple sequence alignment. This approach can automatically recognize locally collinear blocks among organelle genomes and excavate phylogenetically informative regions to construct multiple sequence alignment in a few hours. In addition, HomBlocks supports organelle genomes without annotation and makes adjustment to different taxon datasets, thereby enabling the inclusion of as many common genes as possible. Topology comparison of trees built by conventional multi-gene and HomBlocks alignments implemented in different taxon categories shows that the same efficiency can be achieved by HomBlocks as when using the traditional method. The availability of Homblocks makes organelle phylogenetic analyses more accessible to non-bioinformaticians, thereby promising to lead to a better understanding of phylogenic relationships at an organelle genome level. Availability and implementation: HomBlocks is implemented in Perl and is supported by Unix-like operative systems, including Linux and macOS. The Perl source code is freely available for download from https://github.com/fenghen360/HomBlocks.git, and documentation and tutorials are available at https://github.com/fenghen360/HomBlocks.
引用
收藏
页码:18 / 22
页数:5
相关论文
共 15 条
[1]   BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons [J].
Alikhan, Nabil-Fareed ;
Petty, Nicola K. ;
Ben Zakour, Nouri L. ;
Beatson, Scott A. .
BMC GENOMICS, 2011, 12
[2]   Do orthologous gene phylogenies really support tree-thinking? [J].
Bapteste, E ;
Susko, E ;
Leigh, J ;
MacLeod, D ;
Charlebois, RL ;
Doolittle, WF .
BMC EVOLUTIONARY BIOLOGY, 2005, 5 (1)
[3]   trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses [J].
Capella-Gutierrez, Salvador ;
Silla-Martinez, Jose M. ;
Gabaldon, Toni .
BIOINFORMATICS, 2009, 25 (15) :1972-1973
[4]   Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis [J].
Castresana, J .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (04) :540-552
[5]   Choosing the best genes for the job: The case for stationary genes in genome-scale phylogenetics [J].
Collins, TM ;
Fedrigo, O ;
Naylor, GJP .
SYSTEMATIC BIOLOGY, 2005, 54 (03) :493-500
[6]   BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments [J].
Criscuolo, Alexis ;
Gribaldo, Simonetta .
BMC EVOLUTIONARY BIOLOGY, 2010, 10
[7]   Mauve: Multiple alignment of conserved genomic sequence with rearrangements [J].
Darling, ACE ;
Mau, B ;
Blattner, FR ;
Perna, NT .
GENOME RESEARCH, 2004, 14 (07) :1394-1403
[8]   Noisy:: Identification of problematic columns in multiple sequence alignments [J].
Dress, Andreas W. M. ;
Flamm, Christoph ;
Fritzsch, Guido ;
Gruenewald, Stefan ;
Kruspe, Matthias ;
Prohaska, Sonja J. ;
Stadler, Peter F. .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2008, 3 (1)
[9]   Shotgun Mitogenomics Provides a Reference Phylogenetic Framework and Timescale for Living Xenarthrans [J].
Gibb, Gillian C. ;
Condamine, Fabien L. ;
Kuch, Melanie ;
Enk, Jacob ;
Moraes-Barros, Nadia ;
Superina, Mariella ;
Poinar, Hendrik N. ;
Delsuc, Frederic .
MOLECULAR BIOLOGY AND EVOLUTION, 2016, 33 (03) :621-642
[10]   PartitionFinder: Combined Selection of Partitioning Schemes and Substitution Models for Phylogenetic Analyses [J].
Lanfear, Robert ;
Calcott, Brett ;
Ho, Simon Y. W. ;
Guindon, Stephane .
MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (06) :1695-1701