MBGD update 2015: microbial genome database for flexible ortholog analysis utilizing a diverse set of genomic data

被引:47
作者
Uchiyama, Ikuo [1 ,2 ]
Mihara, Motohiro [3 ]
Nishide, Hiroyo [2 ]
Chiba, Hirokazu [1 ]
机构
[1] Natl Inst Nat Sci, Lab Genome Informat, Natl Inst Basic Biol, Okazaki, Aichi 4448585, Japan
[2] Natl Inst Nat Sci, Data Integrat & Anal Facil, Natl Inst Basic Biol, Okazaki, Aichi 4448585, Japan
[3] Dynacom Co Ltd, Chuo Ku, Kobe, Hyogo 6510088, Japan
基金
日本学术振兴会;
关键词
GENES; TOOL;
D O I
10.1093/nar/gku1152
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The microbial genome database for comparative analysis (MBGD) (http://mbgd.genome.ad.jp/) is a comprehensive ortholog database for flexible comparative analysis of microbial genomes, where the users are allowed to create an ortholog table among any specified set of organisms. Because of the rapid increase in microbial genome data owing to the next-generation sequencing technology, it becomes increasingly challenging to maintain high-quality orthology relationships while allowing the users to incorporate the latest genomic data available into an analysis. Because many of the recently accumulating genomic data are draft genome sequences for which some complete genome sequences of the same or closely related species are available, MBGD now stores draft genome data and allows the users to incorporate them into a user-specific ortholog database using the MyMBGD functionality. In this function, draft genome data are incorporated into an existing ortholog table created only from the complete genome data in an incremental manner to prevent low-quality draft data from affecting clustering results. In addition, to provide high-quality orthology relationships, the standard ortholog table containing all the representative genomes, which is first created by the rapid classification program DomClust, is now refined using DomRefine, a recently developed program for improving domain-level clustering using multiple sequence alignment information.
引用
收藏
页码:D270 / D276
页数:7
相关论文
共 21 条
[1]   OMA 2011: orthology inference among 1000 complete genomes [J].
Altenhoff, Adrian M. ;
Schneider, Adrian ;
Gonnet, Gaston H. ;
Dessimoz, Christophe .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D289-D294
[2]  
Benson DA, 2007, NUCLEIC ACIDS RES, V35, pD21, DOI [10.1093/nar/gks1195, 10.1093/nar/gkp1024, 10.1093/nar/gkl986, 10.1093/nar/gkx1094, 10.1093/nar/gkw1070, 10.1093/nar/gkr1202, 10.1093/nar/gkn723, 10.1093/nar/gkg057, 10.1093/nar/gkq1079]
[3]   Genome sequencing in clinical microbiology [J].
Chan, Jacqueline Z-M ;
Pallen, Mark J. ;
Oppenheim, Beryl ;
Constantinidou, Chrystala .
NATURE BIOTECHNOLOGY, 2012, 30 (11) :1068-1071
[4]   DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection [J].
Chen, Ting-wen ;
Wu, Timothy H. ;
Ng, Wailap V. ;
Lin, Wen-chang .
BMC BIOINFORMATICS, 2010, 11
[5]   Improvement of domain-level ortholog clustering by optimizing domain-specific sum-of-pairs score [J].
Chiba, Hirokazu ;
Uchiyama, Ikuo .
BMC BIOINFORMATICS, 2014, 15
[6]   MicrobesOnline: an integrated portal for comparative and functional genomics [J].
Dehal, Paramvir S. ;
Joachimiak, Marcin P. ;
Price, Morgan N. ;
Bates, John T. ;
Baumohl, Jason K. ;
Chivian, Dylan ;
Friedland, Greg D. ;
Huang, Katherine H. ;
Keller, Keith ;
Novichkov, Pavel S. ;
Dubchak, Inna L. ;
Alm, Eric J. ;
Arkin, Adam P. .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D396-D400
[7]   TIGRFAMs and Genome Properties in 2013 [J].
Haft, Daniel H. ;
Selengut, Jeremy D. ;
Richter, Roland A. ;
Harkins, Derek ;
Basu, Malay K. ;
Beck, Erin .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D387-D395
[8]   InterPro in 2011: new developments in the family and domain prediction database [J].
Hunter, Sarah ;
Jones, Philip ;
Mitchell, Alex ;
Apweiler, Rolf ;
Attwood, Teresa K. ;
Bateman, Alex ;
Bernard, Thomas ;
Binns, David ;
Bork, Peer ;
Burge, Sarah ;
de Castro, Edouard ;
Coggill, Penny ;
Corbett, Matthew ;
Das, Ujjwal ;
Daugherty, Louise ;
Duquenne, Lauranne ;
Finn, Robert D. ;
Fraser, Matthew ;
Gough, Julian ;
Haft, Daniel ;
Hulo, Nicolas ;
Kahn, Daniel ;
Kelly, Elizabeth ;
Letunic, Ivica ;
Lonsdale, David ;
Lopez, Rodrigo ;
Madera, Martin ;
Maslen, John ;
McAnulla, Craig ;
McDowall, Jennifer ;
McMenamin, Conor ;
Mi, Huaiyu ;
Mutowo-Muellenet, Prudence ;
Mulder, Nicola ;
Natale, Darren ;
Orengo, Christine ;
Pesseat, Sebastien ;
Punta, Marco ;
Quinn, Antony F. ;
Rivoire, Catherine ;
Sangrador-Vegas, Amaia ;
Selengut, Jeremy D. ;
Sigrist, Christian J. A. ;
Scheremetjew, Maxim ;
Tate, John ;
Thimmajanarthanan, Manjulapramila ;
Thomas, Paul D. ;
Wu, Cathy H. ;
Yeats, Corin ;
Yong, Siew-Yit .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D306-D312
[9]   Exploration and grading of possible genes from 183 bacterial strains by a common protocol to identification of new genes: Gene Trek in Prokaryote Space (GTPS) [J].
Kosuge, Takehide ;
Abe, Takashi ;
Okido, Toshihisa ;
Tanaka, Naoto ;
Hirahata, Masaki ;
Maruyama, Yutaka ;
Mashima, Jun ;
Tomiki, Aki ;
Kurokawa, Motoyoshi ;
Himeno, Ryutaro ;
Fukuchi, Satoshi ;
Miyazaki, Satoru ;
Gojobori, Takashi ;
Tateno, Yoshio ;
Sugawara, Hideaki .
DNA RESEARCH, 2006, 13 (06) :245-254
[10]   High-throughput bacterial genome sequencing: an embarrassment of choice, a world of opportunity [J].
Loman, Nicholas J. ;
Constantinidou, Chrystala ;
Chan, Jacqueline Z. M. ;
Halachev, Mihail ;
Sergeant, Martin ;
Penn, Charles W. ;
Robinson, Esther R. ;
Pallen, Mark J. .
NATURE REVIEWS MICROBIOLOGY, 2012, 10 (09) :599-606