Practical application of ontologies to annotate and analyse large scale raw mouse phenotype data

被引:36
作者
Beck, Tim [1 ]
Morgan, Hugh [1 ]
Blake, Andrew [1 ]
Wells, Sara [1 ]
Hancock, John M. [1 ]
Mallon, Ann-Marie [1 ]
机构
[1] MRC Harwell, Didcot OX11 0RD, Oxon, England
基金
英国医学研究理事会;
关键词
TOOL; MUTAGENESIS; EMPRESS;
D O I
10.1186/1471-2105-10-S5-S2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Large-scale international projects are underway to generate collections of knockout mouse mutants and subsequently to perform high throughput phenotype assessments, raising new challenges for computational researchers due to the complexity and scale of the phenotype data. Phenotypes can be described using ontologies in two differing methodologies. Traditionally an individual phenotypic character has either been defined using a single compound term, originating from a species-specific dedicated phenotype ontology, or alternatively by a combinatorial annotation, using concepts from a range of disparate ontologies, to define a phenotypic character as an entity with an associated quality (EQ). Both methods have their merits, which include the dedicated approach allowing use of community standard terminology, and the combinatorial approach facilitating cross-species phenotypic statement comparisons. Previously databases have favoured one approach over another. The EUMODIC project will generate large amounts of mouse phenotype data, generated as a result of the execution of a set of Standard Operating Procedures (SOPs) and will implement both ontological approaches to capture the phenotype data generated. Results: For all SOPs a four-tier annotation is made: a high-level description of the SOP, to broadly define the type of data generated by the SOP; individual parameter annotation using the EQ model; annotation of the qualitative data generated for each mouse; and the annotation of mutant lines after statistical analysis. The qualitative assessments of phenodeviance are made at the point of data entry, using child PATO qualities to the parameter quality. To facilitate data querying by scientists more familiar with single compound terms to describe phenotypes, the mappings between the Mammalian Phenotype (MP) ontology and the EQ PATO model are exploited to allow querying via MP terms. Conclusion: Well-annotated and comparable phenotype databases can be achieved through the use of ontologically derived comparable phenotypic statements and have been implemented here by means of OBO compatible EQ annotations. The implementation we describe also sees scientists working seamlessly with ontologies through the assessment of qualitative phenotypes in terms of PATO qualities and the ability to query the database using community-accepted compound MP terms. This work represents the first time the combinatorial and single-dedicated approaches have both been implemented to annotate a phenotypic dataset.
引用
收藏
页数:9
相关论文
共 21 条
[1]  
[Anonymous], 2006, GO Slim and subset guide
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   The European dimension for the mouse genome mutagenesis program [J].
Auwerx, J ;
Avner, P ;
Baldock, R ;
Ballabio, A ;
Balling, R ;
Barbacid, M ;
Berns, A ;
Bradley, A ;
Brown, S ;
Carmeliet, P ;
Chambon, P ;
Cox, R ;
Davidson, D ;
Davies, K ;
Duboule, D ;
Forejt, J ;
Granucci, F ;
Hastie, N ;
de Angelis, MH ;
Jackson, I ;
Kioussis, D ;
Kollias, G ;
Lathrop, M ;
Lendahl, U ;
Malumbres, M ;
von Melchner, H ;
Müller, W ;
Partanen, J ;
Ricciardi-Castagnoli, P ;
Rigby, P ;
Rosen, B ;
Rosenthal, N ;
Skarnes, B ;
Stewart, AF ;
Thornton, J ;
Tocchini-Valentini, G ;
Wagner, E ;
Wahli, W ;
Wurst, W .
NATURE GENETICS, 2004, 36 (09) :925-927
[4]   EMPReSS: standardized phenotype screens for functional annotation of the mouse genome [J].
Brown, SDM ;
Chambon, P ;
de Angelis, MH .
NATURE GENETICS, 2005, 37 (11) :1155-1155
[5]   Understanding mammalian genetic systems: The challenge of phenotyping in the mouse [J].
Brown, Steve D. M. ;
Hancock, John M. ;
Gates, Hilary .
PLOS GENETICS, 2006, 2 (08) :1131-1137
[6]   The Mouse Genome Database (MGD): mouse biology and model systems [J].
Bult, Carol J. ;
Eppig, Janan T. ;
Kadin, James A. ;
Richardson, Joel E. ;
Blake, Judith A. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D724-D728
[7]   A mouse for all reasons [J].
Collins, Francis S. .
CELL, 2007, 128 (01) :9-13
[8]   N-ethyl-N-nitrosourea mutagenesis:: Boarding the mouse mutant express [J].
Cordes, SP .
MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 2005, 69 (03) :426-+
[9]   OBO-Edit - an ontology editor for biologists [J].
Day-Richter, John ;
Harris, Midori A. ;
Haendel, Melissa .
BIOINFORMATICS, 2007, 23 (16) :2198-2200
[10]   ChEBI:: a database and ontology for chemical entities of biological interest [J].
Degtyarenko, Kirill ;
de Matos, Paula ;
Ennis, Marcus ;
Hastings, Janna ;
Zbinden, Martin ;
McNaught, Alan ;
Alcantara, Rafael ;
Darsow, Michael ;
Guedj, Mickael ;
Ashburner, Michael .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D344-D350