Assessment of community-submitted ontology annotations from a novel database-journal partnership

被引:10
作者
Berardini, Tanya Z. [1 ]
Li, Donghui [1 ]
Muller, Robert [1 ]
Chetty, Raymond [1 ]
Ploetz, Larry [1 ]
Singh, Shanker [1 ]
Wensel, April [1 ]
Huala, Eva [1 ]
机构
[1] Carnegie Inst Sci, Dept Plant Biol, Stanford, CA 94305 USA
来源
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | 2012年
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
GENE; MODEL;
D O I
10.1093/database/bas030
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
As the scientific literature grows, leading to an increasing volume of published experimental data, so does the need to access and analyze this data using computational tools. The most commonly used method to convert published experimental data on gene function into controlled vocabulary annotations relies on a professional curator, employed by a model organism database or a more general resource such as UniProt, to read published articles and compose annotation statements based on the articles' contents. A more cost-effective and scalable approach capable of capturing gene function data across the whole range of biological research organisms in computable form is urgently needed. We have analyzed a set of ontology annotations generated through collaborations between the Arabidopsis Information Resource and several plant science journals. Analysis of the submissions entered using the online submission tool shows that most community annotations were well supported and the ontology terms chosen were at an appropriate level of specificity. Of the 503 individual annotations that were submitted, 97% were approved and community submissions captured 72% of all possible annotations. This new method for capturing experimental results in a computable form provides a cost-effective way to greatly increase the available body of annotations without sacrificing annotation quality.
引用
收藏
页数:13
相关论文
共 22 条
[1]   The Renal Gene Ontology Annotation Initiative [J].
Alam-Faruque, Yasmin ;
Dimmer, Emily C. ;
Huntley, Rachael P. ;
O'Donovan, Claire ;
Scambler, Peter ;
Apweiler, Rolf .
ORGANOGENESIS, 2010, 6 (02) :71-75
[2]  
[Anonymous], CURR PROTOC BIOINFOR
[3]   The GOA database in 2009-an integrated Gene Ontology Annotation resource [J].
Barrell, Daniel ;
Dimmer, Emily ;
Huntley, Rachael P. ;
Binns, David ;
O'Donovan, Claire ;
Apweiler, Rolf .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D396-D403
[4]   The Gene Ontology in 2010: extensions and refinements The Gene Ontology Consortium [J].
Berardini, Tanya Z. ;
Li, Donghui ;
Huala, Eva ;
Bridges, Susan ;
Burgess, Shane ;
McCarthy, Fiona ;
Carbon, Seth ;
Lewis, Suzanna E. ;
Mungall, Christopher J. ;
Abdulla, Amina ;
Wood, Valerie ;
Feltrin, Erika ;
Valle, Giorgio ;
Chisholm, Rex L. ;
Fey, Petra ;
Gaudet, Pascale ;
Kibbe, Warren ;
Basu, Siddhartha ;
Bushmanova, Yulia ;
Eilbeck, Karen ;
Siegele, Deborah A. ;
McIntosh, Brenley ;
Renfro, Daniel ;
Zweifel, Adrienne ;
Hu, James C. ;
Ashburner, Michael ;
Tweedie, Susan ;
Alam-Faruque, Yasmin ;
Apweiler, Rolf ;
Auchinchloss, Andrea ;
Bairoch, Amos ;
Barrell, Daniel ;
Binns, David ;
Blatter, Marie-Claude ;
Bougueleret, Lydie ;
Boutet, Emmanuel ;
Breuza, Lionel ;
Bridge, Alan ;
Browne, Paul ;
Chan, Wei Mun ;
Coudert, Elizabeth ;
Daugherty, Louise ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Estreicher, Anne ;
Famiglietti, Livia ;
Ferro-Rojas, Serenella ;
Feuermann, Marc ;
Foulger, Rebecca ;
Gruaz-Gumowski, Nadine .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D331-D335
[5]   The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics [J].
Blake, Judith A. ;
Bult, Carol J. ;
Kadin, James A. ;
Richardson, Joel E. ;
Eppig, Janan T. .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D842-D848
[6]   Web-Queryable Large-Scale Data Sets for Hypothesis Generation in Plant Biology [J].
Brady, Siobhan M. ;
Provart, Nicholas J. .
PLANT CELL, 2009, 21 (04) :1034-1051
[7]   Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study [J].
Costanzo, Maria C. ;
Park, Julie ;
Balakrishnan, Rama ;
Cherry, J. Michael ;
Hong, Eurie L. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,
[8]   Gene Ontology annotations: what they mean and where they come from [J].
Hill, David P. ;
Smith, Barry ;
McAndrews-Hill, Monica S. ;
Blake, Judith A. .
BMC BIOINFORMATICS, 2008, 9 (Suppl 5)
[9]   Big data: The future of biocuration [J].
Howe, Doug ;
Costanzo, Maria ;
Fey, Petra ;
Gojobori, Takashi ;
Hannick, Linda ;
Hide, Winston ;
Hill, David P. ;
Kania, Renate ;
Schaeffer, Mary ;
St Pierre, Susan ;
Twigger, Simon ;
White, Owen ;
Rhee, Seung Yon .
NATURE, 2008, 455 (7209) :47-50
[10]   Systematic prediction of gene function in Arabidopsis thaliana using a probabilistic functional gene network [J].
Hwang, Sohyun ;
Rhee, Seung Y. ;
Marcotte, Edward M. ;
Lee, Insuk .
NATURE PROTOCOLS, 2011, 6 (09) :1429-1442