A guide to best practices for Gene Ontology (GO) manual annotation

被引:104
作者
Balakrishnan, Rama [1 ]
Harris, Midori A. [2 ]
Huntley, Rachael [3 ]
Van Auken, Kimberly [4 ]
Cherry, J. Michael [1 ]
机构
[1] Stanford Univ, Dept Genet, Saccharomyces Genome Database, Stanford, CA 94305 USA
[2] Univ Cambridge, Dept Biochem, Cambridge Syst Biol Ctr, PomBase, Cambridge CB2 1GA, England
[3] European Bioinformat Inst, UniProt, Hinxton CB10 1SD, Cambs, England
[4] CALTECH, Div Biol, WormBase, Pasadena, CA 91125 USA
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2013年
基金
美国国家卫生研究院; 英国惠康基金;
关键词
DATABASE; IDENTIFICATION; RESOURCE; PROTEIN;
D O I
10.1093/database/bat054
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Gene Ontology Consortium (GOC) is a community-based bioinformatics project that classifies gene product function through the use of structured controlled vocabularies. A fundamental application of the Gene Ontology (GO) is in the creation of gene product annotations, evidence-based associations between GO definitions and experimental or sequence-based analysis. Currently, the GOC disseminates 126 million annotations covering >374 000 species including all the kingdoms of life. This number includes two classes of GO annotations: those created manually by experienced biocurators reviewing the literature or by examination of biological data (1.1 million annotations covering 2226 species) and those generated computationally via automated methods. As manual annotations are often used to propagate functional predictions between related proteins within and between genomes, it is critical to provide accurate consistent manual annotations. Toward this goal, we present here the conventions defined by the GOC for the creation of manual annotation. This guide represents the best practices for manual annotation as established by the GOC project over the past 12 years. We hope this guide will encourage research communities to annotate gene products of their interest to enhance the corpus of GO annotations available to all.
引用
收藏
页数:18
相关论文
共 31 条
[1]   The Impact of Focused Gene Ontology Curation of Specific Mammalian Systems [J].
Alam-Faruque, Yasmin ;
Huntley, Rachael P. ;
Khodiyar, Varsha K. ;
Camon, Evelyn B. ;
Dimmer, Emily C. ;
Sawford, Tony ;
Martin, Maria J. ;
O'Donovan, Claire ;
Talmud, Philippa J. ;
Scambler, Peter ;
Apweiler, Rolf ;
Lovering, Ruth C. .
PLOS ONE, 2011, 6 (12)
[2]   Ongoing and future developments at the Universal Protein Resource [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Antunes, Ricardo ;
Barrell, Daniel ;
Bely, Benoit ;
Bingley, Mark ;
Binns, David ;
Bower, Lawrence ;
Browne, Paul ;
Chan, Wei Mun ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Fazzini, Francesco ;
Fedotov, Alexander ;
Foulger, Rebecca ;
Garavelli, John ;
Castro, Leyla Garcia ;
Huntley, Rachael ;
Jacobsen, Julius ;
Kleen, Michael ;
Laiho, Kati ;
Legge, Duncan ;
Lin, Quan ;
Liu, Wudong ;
Luo, Jie ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pontikos, Nikolas ;
Pruess, Manuela ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Corbett, Matt ;
Donnelly, Mike ;
van Rensburg, Pieter ;
Xenarios, Ioannis ;
Bougueleret, Lydie ;
Auchincloss, Andrea ;
Argoud-Puy, Ghislaine ;
Axelsen, Kristian ;
Bairoch, Amos ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D214-D219
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   An ontology for cell types [J].
Bard, J ;
Rhee, SY ;
Ashburner, M .
GENOME BIOLOGY, 2005, 6 (02)
[5]   The GOA database in 2009-an integrated Gene Ontology Annotation resource [J].
Barrell, Daniel ;
Dimmer, Emily ;
Huntley, Rachael P. ;
Binns, David ;
O'Donovan, Claire ;
Apweiler, Rolf .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D396-D403
[6]   Molecular cloning and functional expression of a Caenorhabditis elegans aminopeptidase structurally related to mammalian leukotriene A4 hydrolases [J].
Baset, HA ;
Ford-Hutchinson, AW ;
O'Neill, GP .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (43) :27978-27987
[7]   The Gene Ontology in 2010: extensions and refinements The Gene Ontology Consortium [J].
Berardini, Tanya Z. ;
Li, Donghui ;
Huala, Eva ;
Bridges, Susan ;
Burgess, Shane ;
McCarthy, Fiona ;
Carbon, Seth ;
Lewis, Suzanna E. ;
Mungall, Christopher J. ;
Abdulla, Amina ;
Wood, Valerie ;
Feltrin, Erika ;
Valle, Giorgio ;
Chisholm, Rex L. ;
Fey, Petra ;
Gaudet, Pascale ;
Kibbe, Warren ;
Basu, Siddhartha ;
Bushmanova, Yulia ;
Eilbeck, Karen ;
Siegele, Deborah A. ;
McIntosh, Brenley ;
Renfro, Daniel ;
Zweifel, Adrienne ;
Hu, James C. ;
Ashburner, Michael ;
Tweedie, Susan ;
Alam-Faruque, Yasmin ;
Apweiler, Rolf ;
Auchinchloss, Andrea ;
Bairoch, Amos ;
Barrell, Daniel ;
Binns, David ;
Blatter, Marie-Claude ;
Bougueleret, Lydie ;
Boutet, Emmanuel ;
Breuza, Lionel ;
Bridge, Alan ;
Browne, Paul ;
Chan, Wei Mun ;
Coudert, Elizabeth ;
Daugherty, Louise ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Estreicher, Anne ;
Famiglietti, Livia ;
Ferro-Rojas, Serenella ;
Feuermann, Marc ;
Foulger, Rebecca ;
Gruaz-Gumowski, Nadine .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D331-D335
[8]   QuickGO: a web-based tool for Gene Ontology searching [J].
Binns, David ;
Dimmer, Emily ;
Huntley, Rachael ;
Barrell, Daniel ;
O'Donovan, Claire ;
Apweiler, Rolf .
BIOINFORMATICS, 2009, 25 (22) :3045-3046
[9]   Gene Ontology Annotations and Resources [J].
Blake, J. A. ;
Dolan, M. ;
Drabkin, H. ;
Hill, D. P. ;
Ni, Li ;
Sitnikov, D. ;
Bridges, S. ;
Burgess, S. ;
Buza, T. ;
McCarthy, F. ;
Peddinti, D. ;
Pillai, L. ;
Carbon, S. ;
Dietze, H. ;
Ireland, A. ;
Lewis, S. E. ;
Mungall, C. J. ;
Gaudet, P. ;
Chisholm, R. L. ;
Fey, P. ;
Kibbe, W. A. ;
Basu, S. ;
Siegele, D. A. ;
McIntosh, B. K. ;
Renfro, D. P. ;
Zweifel, A. E. ;
Hu, J. C. ;
Brown, N. H. ;
Tweedie, S. ;
Alam-Faruque, Y. ;
Apweiler, R. ;
Auchinchloss, A. ;
Axelsen, K. ;
Bely, B. ;
Blatter, M-C. ;
Bonilla, C. ;
Bougueleret, L. ;
Boutet, E. ;
Breuza, L. ;
Bridge, A. ;
Chan, W. M. ;
Chavali, G. ;
Coudert, E. ;
Dimmer, E. ;
Estreicher, A. ;
Famiglietti, L. ;
Feuermann, M. ;
Gos, A. ;
Gruaz-Gumowski, N. ;
Hieta, R. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D530-D535
[10]   ZFIN: enhancements and updates to the zebrafish model organism database [J].
Bradford, Yvonne ;
Conlin, Tom ;
Dunn, Nathan ;
Fashena, David ;
Frazer, Ken ;
Howe, Douglas G. ;
Knight, Jonathan ;
Mani, Prita ;
Martin, Ryan ;
Moxon, Sierra A. T. ;
Paddock, Holly ;
Pich, Christian ;
Ramachandran, Sridhar ;
Ruef, Barbara J. ;
Ruzicka, Leyla ;
Schaper, Holle Bauer ;
Schaper, Kevin ;
Shao, Xiang ;
Singer, Amy ;
Sprague, Judy ;
Sprunger, Brock ;
Van Slyke, Ceri ;
Westerfield, Monte .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D822-D829