A guide to best practices for Gene Ontology (GO) manual annotation

被引:101
作者
Balakrishnan, Rama [1 ]
Harris, Midori A. [2 ]
Huntley, Rachael [3 ]
Van Auken, Kimberly [4 ]
Cherry, J. Michael [1 ]
机构
[1] Stanford Univ, Dept Genet, Saccharomyces Genome Database, Stanford, CA 94305 USA
[2] Univ Cambridge, Dept Biochem, Cambridge Syst Biol Ctr, PomBase, Cambridge CB2 1GA, England
[3] European Bioinformat Inst, UniProt, Hinxton CB10 1SD, Cambs, England
[4] CALTECH, Div Biol, WormBase, Pasadena, CA 91125 USA
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2013年
基金
美国国家卫生研究院; 英国惠康基金;
关键词
DATABASE; IDENTIFICATION; RESOURCE; PROTEIN;
D O I
10.1093/database/bat054
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Gene Ontology Consortium (GOC) is a community-based bioinformatics project that classifies gene product function through the use of structured controlled vocabularies. A fundamental application of the Gene Ontology (GO) is in the creation of gene product annotations, evidence-based associations between GO definitions and experimental or sequence-based analysis. Currently, the GOC disseminates 126 million annotations covering >374 000 species including all the kingdoms of life. This number includes two classes of GO annotations: those created manually by experienced biocurators reviewing the literature or by examination of biological data (1.1 million annotations covering 2226 species) and those generated computationally via automated methods. As manual annotations are often used to propagate functional predictions between related proteins within and between genomes, it is critical to provide accurate consistent manual annotations. Toward this goal, we present here the conventions defined by the GOC for the creation of manual annotation. This guide represents the best practices for manual annotation as established by the GOC project over the past 12 years. We hope this guide will encourage research communities to annotate gene products of their interest to enhance the corpus of GO annotations available to all.
引用
收藏
页数:18
相关论文
共 31 条
  • [1] The Impact of Focused Gene Ontology Curation of Specific Mammalian Systems
    Alam-Faruque, Yasmin
    Huntley, Rachael P.
    Khodiyar, Varsha K.
    Camon, Evelyn B.
    Dimmer, Emily C.
    Sawford, Tony
    Martin, Maria J.
    O'Donovan, Claire
    Talmud, Philippa J.
    Scambler, Peter
    Apweiler, Rolf
    Lovering, Ruth C.
    [J]. PLOS ONE, 2011, 6 (12):
  • [2] Ongoing and future developments at the Universal Protein Resource
    Apweiler, Rolf
    Martin, Maria Jesus
    O'Donovan, Claire
    Magrane, Michele
    Alam-Faruque, Yasmin
    Antunes, Ricardo
    Barrell, Daniel
    Bely, Benoit
    Bingley, Mark
    Binns, David
    Bower, Lawrence
    Browne, Paul
    Chan, Wei Mun
    Dimmer, Emily
    Eberhardt, Ruth
    Fazzini, Francesco
    Fedotov, Alexander
    Foulger, Rebecca
    Garavelli, John
    Castro, Leyla Garcia
    Huntley, Rachael
    Jacobsen, Julius
    Kleen, Michael
    Laiho, Kati
    Legge, Duncan
    Lin, Quan
    Liu, Wudong
    Luo, Jie
    Orchard, Sandra
    Patient, Samuel
    Pichler, Klemens
    Poggioli, Diego
    Pontikos, Nikolas
    Pruess, Manuela
    Rosanoff, Steven
    Sawford, Tony
    Sehra, Harminder
    Turner, Edward
    Corbett, Matt
    Donnelly, Mike
    van Rensburg, Pieter
    Xenarios, Ioannis
    Bougueleret, Lydie
    Auchincloss, Andrea
    Argoud-Puy, Ghislaine
    Axelsen, Kristian
    Bairoch, Amos
    Baratin, Delphine
    Blatter, Marie-Claude
    Boeckmann, Brigitte
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D214 - D219
  • [3] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [4] An ontology for cell types
    Bard, J
    Rhee, SY
    Ashburner, M
    [J]. GENOME BIOLOGY, 2005, 6 (02)
  • [5] The GOA database in 2009-an integrated Gene Ontology Annotation resource
    Barrell, Daniel
    Dimmer, Emily
    Huntley, Rachael P.
    Binns, David
    O'Donovan, Claire
    Apweiler, Rolf
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D396 - D403
  • [6] Molecular cloning and functional expression of a Caenorhabditis elegans aminopeptidase structurally related to mammalian leukotriene A4 hydrolases
    Baset, HA
    Ford-Hutchinson, AW
    O'Neill, GP
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (43) : 27978 - 27987
  • [7] The Gene Ontology in 2010: extensions and refinements The Gene Ontology Consortium
    Berardini, Tanya Z.
    Li, Donghui
    Huala, Eva
    Bridges, Susan
    Burgess, Shane
    McCarthy, Fiona
    Carbon, Seth
    Lewis, Suzanna E.
    Mungall, Christopher J.
    Abdulla, Amina
    Wood, Valerie
    Feltrin, Erika
    Valle, Giorgio
    Chisholm, Rex L.
    Fey, Petra
    Gaudet, Pascale
    Kibbe, Warren
    Basu, Siddhartha
    Bushmanova, Yulia
    Eilbeck, Karen
    Siegele, Deborah A.
    McIntosh, Brenley
    Renfro, Daniel
    Zweifel, Adrienne
    Hu, James C.
    Ashburner, Michael
    Tweedie, Susan
    Alam-Faruque, Yasmin
    Apweiler, Rolf
    Auchinchloss, Andrea
    Bairoch, Amos
    Barrell, Daniel
    Binns, David
    Blatter, Marie-Claude
    Bougueleret, Lydie
    Boutet, Emmanuel
    Breuza, Lionel
    Bridge, Alan
    Browne, Paul
    Chan, Wei Mun
    Coudert, Elizabeth
    Daugherty, Louise
    Dimmer, Emily
    Eberhardt, Ruth
    Estreicher, Anne
    Famiglietti, Livia
    Ferro-Rojas, Serenella
    Feuermann, Marc
    Foulger, Rebecca
    Gruaz-Gumowski, Nadine
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : D331 - D335
  • [8] QuickGO: a web-based tool for Gene Ontology searching
    Binns, David
    Dimmer, Emily
    Huntley, Rachael
    Barrell, Daniel
    O'Donovan, Claire
    Apweiler, Rolf
    [J]. BIOINFORMATICS, 2009, 25 (22) : 3045 - 3046
  • [9] Gene Ontology Annotations and Resources
    Blake, J. A.
    Dolan, M.
    Drabkin, H.
    Hill, D. P.
    Ni, Li
    Sitnikov, D.
    Bridges, S.
    Burgess, S.
    Buza, T.
    McCarthy, F.
    Peddinti, D.
    Pillai, L.
    Carbon, S.
    Dietze, H.
    Ireland, A.
    Lewis, S. E.
    Mungall, C. J.
    Gaudet, P.
    Chisholm, R. L.
    Fey, P.
    Kibbe, W. A.
    Basu, S.
    Siegele, D. A.
    McIntosh, B. K.
    Renfro, D. P.
    Zweifel, A. E.
    Hu, J. C.
    Brown, N. H.
    Tweedie, S.
    Alam-Faruque, Y.
    Apweiler, R.
    Auchinchloss, A.
    Axelsen, K.
    Bely, B.
    Blatter, M-C.
    Bonilla, C.
    Bougueleret, L.
    Boutet, E.
    Breuza, L.
    Bridge, A.
    Chan, W. M.
    Chavali, G.
    Coudert, E.
    Dimmer, E.
    Estreicher, A.
    Famiglietti, L.
    Feuermann, M.
    Gos, A.
    Gruaz-Gumowski, N.
    Hieta, R.
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D530 - D535
  • [10] ZFIN: enhancements and updates to the zebrafish model organism database
    Bradford, Yvonne
    Conlin, Tom
    Dunn, Nathan
    Fashena, David
    Frazer, Ken
    Howe, Douglas G.
    Knight, Jonathan
    Mani, Prita
    Martin, Ryan
    Moxon, Sierra A. T.
    Paddock, Holly
    Pich, Christian
    Ramachandran, Sridhar
    Ruef, Barbara J.
    Ruzicka, Leyla
    Schaper, Holle Bauer
    Schaper, Kevin
    Shao, Xiang
    Singer, Amy
    Sprague, Judy
    Sprunger, Brock
    Van Slyke, Ceri
    Westerfield, Monte
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D822 - D829