COMBREX-DB: an experiment centered database of protein function: knowledge, predictions and knowledge gaps

被引:42
作者
Chang, Yi-Chien [1 ]
Hu, Zhenjun [1 ]
Rachlin, John [2 ]
Anton, Brian P. [3 ]
Kasif, Simon [1 ,4 ]
Roberts, Richard J. [3 ]
Steffen, Martin [4 ,5 ]
机构
[1] Boston Univ, Bioinformat Program, Boston, MA 02215 USA
[2] Diatom Software LLC, Holliston, MA 01746 USA
[3] New England Biolabs Inc, Ipswich, MA 01938 USA
[4] Boston Univ, Dept Biomed Engn, Boston, MA 02215 USA
[5] Boston Univ, Sch Med, Dept Pathol & Lab Med, Boston, MA 02118 USA
关键词
GENES; COLLECTION; IDENTIFICATION; EVOLUTION; ALIGNMENT; LIBRARY; GENOMES; UPDATE; LINKS;
D O I
10.1093/nar/gkv1324
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The COMBREX database (COMBREX-DB; combrex.bu.edu) is an online repository of information related to (i) experimentally determined protein function, (ii) predicted protein function, (iii) relationships among proteins of unknown function and various types of experimental data, including molecular function, protein structure, and associated phenotypes. The database was created as part of the novel COMBREX (COMputational BRidges to EXperiments) effort aimed at accelerating the rate of gene function validation. It currently holds information on similar to 3.3 million known and predicted proteins from over 1000 completely sequenced bacterial and archaeal genomes. The database also contains a prototype recommendation system for helping users identify those proteins whose experimental determination of function would be most informative for predicting function for other proteins within protein families. The emphasis on documenting experimental evidence for function predictions, and the prioritization of uncharacterized proteins for experimental testing distinguish COMBREX from other publicly available microbial genomics resources. This article describes updates to COMBREX-DB since an initial description in the 2011 NAR Database Issue.
引用
收藏
页码:D330 / D335
页数:6
相关论文
共 37 条
[1]   A genome-scale analysis for identification of genes required for growth or survival of Haemophilus influenzae [J].
Akerley, BJ ;
Rubin, EJ ;
Novick, VL ;
Amaya, K ;
Judson, N ;
Mekalanos, JJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) :966-971
[2]   The COMBREX Project: Design, Methodology, and Initial Results [J].
Anton, Brian P. ;
Chang, Yi-Chien ;
Brown, Peter ;
Choi, Han-Pil ;
Faller, Lina L. ;
Guleria, Jyotsna ;
Hu, Zhenjun ;
Klitgord, Niels ;
Levy-Moonshine, Ami ;
Maksad, Almaz ;
Mazumdar, Varun ;
McGettrick, Mark ;
Osmani, Lais ;
Pokrzywa, Revonda ;
Rachlin, John ;
Swaminathan, Rajeswari ;
Allen, Benjamin ;
Housman, Genevieve ;
Monahan, Caitlin ;
Rochussen, Krista ;
Tao, Kevin ;
Bhagwat, Ashok S. ;
Brenner, Steven E. ;
Columbus, Linda ;
de Crecy-Lagard, Valerie ;
Ferguson, Donald ;
Fomenkov, Alexey ;
Gadda, Giovanni ;
Morgan, Richard D. ;
Osterman, Andrei L. ;
Rodionov, Dmitry A. ;
Rodionova, Irina A. ;
Rudd, Kenneth E. ;
Soll, Dieter ;
Spain, James ;
Xu, Shuang-Yong ;
Bateman, Alex ;
Blumenthal, Robert M. ;
Bollinger, J. Martin ;
Chang, Woo-Suk ;
Ferrer, Manuel ;
Friedberg, Iddo ;
Galperin, Michael Y. ;
Gobeill, Julien ;
Haft, Daniel ;
Hunt, John ;
Karp, Peter ;
Klimke, William ;
Krebs, Carsten ;
Macelis, Dana .
PLOS BIOLOGY, 2013, 11 (08)
[3]   UniProt: a hub for protein information [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Apweiler, Rolf ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Cas-tro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightin-gale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Cowley, Andrew ;
Figueira, Luis ;
Li, Weizhong ;
McWilliam, Hamish .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D204-D212
[4]   The Classification and Evolution of Enzyme Function [J].
Cuesta, Sergio Martinez ;
Rahman, Syed Asad ;
Furnham, Nicholas ;
Thornton, Janet M. .
BIOPHYSICAL JOURNAL, 2015, 109 (06) :1082-1086
[5]   Berkeley PHOG: PhyloFacts orthology group prediction web server [J].
Datta, Ruchira S. ;
Meacham, Christopher ;
Samad, Bushra ;
Neyer, Christoph ;
Sjolander, Kimmen .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W84-W89
[6]   A complete collection of single-gene deletion mutants of Acinetobacter baylyi ADP1 [J].
de Berardinis, Veronique ;
Vallenet, David ;
Castelli, Vanina ;
Besnard, Marielle ;
Pinet, Agnes ;
Cruaud, Corinne ;
Samair, Sumitta ;
Lechaplais, Christophe ;
Gyapay, Gabor ;
Richez, Celine ;
Durot, Maxime ;
Kreimeyer, Annett ;
Le Fevre, Francois ;
Schaechter, Vincent ;
Pezo, Valerie ;
Doering, Volker ;
Scarpelli, Claude ;
Medigue, Claudine ;
Cohen, Georges N. ;
Marliere, Philippe ;
Salanoubat, Marcel ;
Weissenbach, Jean .
MOLECULAR SYSTEMS BIOLOGY, 2008, 4 (1)
[7]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797
[8]  
Felsenstein J., 2005, PHYLIP (phylogeny inference package) version 3.6
[9]   Pfam: the protein families database [J].
Finn, Robert D. ;
Bateman, Alex ;
Clements, Jody ;
Coggill, Penelope ;
Eberhardt, Ruth Y. ;
Eddy, Sean R. ;
Heger, Andreas ;
Hetherington, Kirstie ;
Holm, Liisa ;
Mistry, Jaina ;
Sonnhammer, Erik L. L. ;
Tate, John ;
Punta, Marco .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D222-D230
[10]  
Gene Ontology Consortium, 2015, NUCLEIC ACIDS RES, V43, pD1049