Gene Function Hypotheses for the Campylobacter jejuni Glycome Generated by a Logic-Based Approach

被引:17
作者
Sternberg, Michael J. E. [1 ,2 ]
Tamaddoni-Nezhad, Alireza [1 ,3 ]
Lesk, Victor I. [1 ,2 ]
Kay, Emily [1 ,4 ]
Hitchen, Paul G. [1 ,2 ]
Cootes, Adrian [1 ,2 ]
van Alphen, Lieke B. [5 ]
Lamoureux, Marc P. [6 ]
Jarrelle, Harold C. [6 ]
Rawlings, Christopher J. [1 ,7 ]
Soo, Evelyn C. [8 ]
Szymanski, Christine M. [5 ]
Dell, Anne [1 ,2 ]
Wren, Brendan W. [1 ,4 ]
Muggleton, Stephen H. [1 ,3 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Ctr Integrat Syst Biol, London SW7 2AZ, England
[2] Univ London Imperial Coll Sci Technol & Med, Dept Life Sci, Div Mol Biosci, London SW7 2AZ, England
[3] Univ London Imperial Coll Sci Technol & Med, Dept Comp, London SW7 2AZ, England
[4] Univ London London Sch Hyg & Trop Med, Dept Pathogen Mol Biol, London WC1E 7HT, England
[5] Univ Alberta, Dept Biol Sci, Alberta Glyc Ctr, Edmonton, AB T6G 2E9, Canada
[6] Natl Res Council Canada, Inst Biol Sci, Ottawa, ON K1A 0R6, Canada
[7] Rothamsted Res, Dept Computat & Syst Biol, Harpenden AL5 2JQ, Herts, England
[8] Natl Res Council Canada, Inst Marine Biosci, Halifax, NS B3H 3Z1, Canada
基金
英国生物技术与生命科学研究理事会; 英国惠康基金;
关键词
systems biology; Campylobacter jejuni; machine learning; capsular polysaccharide; pathway modelling; PROTEIN FUNCTION; CAPSULAR POLYSACCHARIDE; ANNOTATION; PREDICTION; INFECTION; NETWORKS; SYSTEMS;
D O I
10.1016/j.jmb.2012.10.014
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Increasingly, experimental data on biological systems are obtained from several sources and computational approaches are required to integrate this information and derive models for the function of the system. Here, we demonstrate the power of a logic-based machine learning approach to propose hypotheses for gene function integrating information from two diverse experimental approaches. Specifically, we use inductive logic programming that automatically proposes hypotheses explaining the empirical data with respect to logically encoded background knowledge. We study the capsular polysaccharide biosynthetic pathway of the major human gastrointestinal pathogen Campylobacter jejuni. We consider several key steps in the formation of capsular polysaccharide consisting of 15 genes of which 8 have assigned function, and we explore the extent to which functions can be hypothesised for the remaining 7. Two sources of experimental data provide the information for learning-the results of knockout experiments on the genes involved in capsule formation and the absence/presence of capsule genes in a multitude of strains of different serotypes. The machine learning uses the pathway structure as background knowledge. We propose assignments of specific genes to five previously unassigned reaction steps. For four of these steps, there was an unambiguous optimal assignment of gene to reaction, and to the fifth, there were three candidate genes. Several of these assignments were consistent with additional experimental results. We therefore show that the logic-based methodology provides a robust strategy to integrate results from different experimental approaches and propose hypotheses for the behaviour of a biological system. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:186 / 197
页数:12
相关论文
共 32 条
[1]  
[Anonymous], 2012, PROLOG PROGRAMMING A
[2]   Single bifunctional UDP-GlcNAc/Glc 4-epimerase supports the synthesis of three cell surface glycoconjugates in Campylobacter jejuni [J].
Bernatchez, S ;
Szymanski, CM ;
Ishiyama, N ;
Li, JJ ;
Jarrell, HC ;
Lau, PC ;
Berghuis, AM ;
Young, NM ;
Wakarchuk, WW .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2005, 280 (06) :4792-4802
[3]   Application of formal methods to biological regulatory networks: extending Thomas' asynchronous logical approach with temporal logic [J].
Bernot, G ;
Comet, JP ;
Richard, A ;
Guespin, J .
JOURNAL OF THEORETICAL BIOLOGY, 2004, 229 (03) :339-347
[4]   BIOCHAM: an environment for modeling biological systems and formalizing experimental knowledge [J].
Calzone, Laurence ;
Fages, Francois ;
Soliman, Sylvain .
BIOINFORMATICS, 2006, 22 (14) :1805-1807
[5]  
Caspi R, 2008, NUCLEIC ACIDS RES, V36, pD623, DOI [10.1093/nar/gkm900, 10.1093/nar/gkt1103]
[6]   Comparative phylogenomics of the food-borne pathogen Campylobacter jejuni reveals genetic markers predictive of infection source [J].
Champion, OL ;
Gaunt, MW ;
Gundogdu, O ;
Elmi, A ;
Witney, AA ;
Hinds, J ;
Dorrell, N ;
Wren, BW .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (44) :16043-16048
[7]   Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae [J].
Chen, Y ;
Xu, D .
NUCLEIC ACIDS RESEARCH, 2004, 32 (21) :6414-6424
[8]   Machine learning methods for metabolic pathway prediction [J].
Dale, Joseph M. ;
Popescu, Liviu ;
Karp, Peter D. .
BMC BIOINFORMATICS, 2010, 11
[9]   Campylobacter sugars sticking out [J].
Guerry, Patricia ;
Szymanski, Christine M. .
TRENDS IN MICROBIOLOGY, 2008, 16 (09) :428-435
[10]   Re-annotation and re-analysis of the Campylobacter jejuni NCTC11168 genome sequence [J].
Gundogdu, Ozan ;
Bentley, Stephen D. ;
Holden, Matt T. ;
Parkhill, Julian ;
Dorrell, Nick ;
Wren, Brendan W. .
BMC GENOMICS, 2007, 8