cblaster: a remote search tool for rapid identification and visualization of homologous gene clusters

被引:133
作者
Gilchrist, Cameron L. M. [1 ]
Booth, Thomas J. [1 ]
van Wersch, Bram [2 ]
van Grieken, Liana [2 ]
Medema, Marnix H. [2 ]
Chooi, Yit-Heng [1 ]
机构
[1] Univ Western Australia, Sch Mol Sci, Crawley, WA 6009, Australia
[2] Wageningen Univ, Bioinformat Grp, NL-6708 PB Wageningen, Netherlands
来源
BIOINFORMATICS ADVANCES | 2021年 / 1卷 / 01期
基金
澳大利亚研究理事会;
关键词
BIOSYNTHESIS; BIOLOGY; CLONING;
D O I
10.1093/bioadv/vbab016
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
MotivationGenes involved in coordinated biological pathways, including metabolism, drug resistance and virulence, are often collocalized as gene clusters. Identifying homologous gene clusters aids in the study of their function and evolution, however, existing tools are limited to searching local sequence databases. Tools for remotely searching public databases are necessary to keep pace with the rapid growth of online genomic data.ResultsHere, we present cblaster, a Python-based tool to rapidly detect collocated genes in local and remote databases. cblaster is easy to use, offering both a command line and a user-friendly graphical user interface. It generates outputs that enable intuitive visualizations of large datasets and can be readily incorporated into larger bioinformatic pipelines. cblaster is a significant update to the comparative genomics toolbox.Availability and implementationcblaster source code and documentation is freely available from GitHub under the MIT license (github.com/gamcil/cblaster).Supplementary information are available at Bioinformatics Advances online.
引用
收藏
页数:10
相关论文
共 48 条
[1]  
Agarwala R, 2018, NUCLEIC ACIDS RES, V46, pD8, DOI [10.1093/nar/gks1189, 10.1093/nar/gkx1095, 10.1093/nar/gkq1172]
[2]   antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline [J].
Blin, Kai ;
Shaw, Simon ;
Steinke, Katharina ;
Villebro, Rasmus ;
Ziemert, Nadine ;
Lee, Sang Yup ;
Medema, Marnix H. ;
Weber, Tilmann .
NUCLEIC ACIDS RESEARCH, 2019, 47 (W1) :W81-W87
[3]   D3: Data-Driven Documents [J].
Bostock, Michael ;
Ogievetsky, Vadim ;
Heer, Jeffrey .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (12) :2301-2309
[4]   Fragmentation of an aflatoxin-like gene cluster in a forest pathogen [J].
Bradshaw, Rosie E. ;
Slot, Jason C. ;
Moore, Geromy G. ;
Chettri, Pranav ;
de Wit, Pierre J. G. M. ;
Ehrlich, Kenneth C. ;
Ganley, Austen R. D. ;
Olson, Malin A. ;
Rokas, Antonis ;
Carbone, Ignazio ;
Cox, Murray P. .
NEW PHYTOLOGIST, 2013, 198 (02) :525-535
[5]   Sensitive protein alignments at tree-of-life scale using DIAMOND [J].
Buchfink, Benjamin ;
Reuter, Klaus ;
Drost, Hajk-Georg .
NATURE METHODS, 2021, 18 (04) :366-+
[6]   Next-generation sequencing approach for connecting secondary metabolites to biosynthetic gene clusters in fungi [J].
Cacho, Ralph A. ;
Tang, Yi ;
Chooi, Yit-Heng .
FRONTIERS IN MICROBIOLOGY, 2015, 5
[7]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10
[8]   Evolutionary dynamics of natural product biosynthesis in bacteria [J].
Chevrette, Marc G. ;
Gutierrez-Garcia, Karina ;
Selem-Mojica, Nelly ;
Aguilar-Martinez, Cesar ;
Yanez-Olvera, Alan ;
Ramos-Aboites, Hilda E. ;
Hoskisson, Paul A. ;
Barona-Gomez, Francisco .
NATURAL PRODUCT REPORTS, 2020, 37 (04) :566-599
[9]   Biopython']python: freely available Python']Python tools for computational molecular biology and bioinformatics [J].
Cock, Peter J. A. ;
Antao, Tiago ;
Chang, Jeffrey T. ;
Chapman, Brad A. ;
Cox, Cymon J. ;
Dalke, Andrew ;
Friedberg, Iddo ;
Hamelryck, Thomas ;
Kauff, Frank ;
Wilczynski, Bartek ;
de Hoon, Michiel J. L. .
BIOINFORMATICS, 2009, 25 (11) :1422-1423
[10]  
De los Santos EL., 2017, BIORXIV