Computational prediction of the human-microbial oral interactome

被引:26
作者
Coelho, Edgar D. [1 ]
Arrais, Joel P. [2 ,3 ]
Matos, Sergio [1 ]
Pereira, Carlos [3 ,4 ]
Rosa, Nuno [5 ]
Correia, Maria Jose [5 ]
Barros, Marlene [5 ,6 ]
Oliveira, Jose Luis [1 ]
机构
[1] Univ Aveiro, Inst Elect & Telemat Engn Aveiro IEETA, Dept Elect Telecommun & Informat, P-3800 Aveiro, Portugal
[2] Univ Coimbra, Dept Informat Engn DEI, Coimbra, Portugal
[3] Univ Coimbra, Ctr Informat & Syst Univ Coimbra, Coimbra, Portugal
[4] Engn Inst Coimbra IPC ISEC, Polytech Inst Coimbra, Dept Informat Engn & Syst, Coimbra, Portugal
[5] Catholic Univ Portugal, Inst Hlth Sci, Dept Hlth Sci, Viseu, Portugal
[6] Univ Coimbra, Ctr Neurosci & Cell Biol, Coimbra, Portugal
关键词
Protein-protein interactions; Oral interactome; Bayesian classification; PROTEIN-PROTEIN INTERACTIONS; GROWTH-FACTOR RECEPTOR; SEQUENCE-BASED PREDICTION; GENE ONTOLOGY; INTERACTION NETWORKS; DATABASE; CONSERVATION; CLUSTERS; HYPERPLANES; FIBRONECTIN;
D O I
10.1186/1752-0509-8-24
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The oral cavity is a complex ecosystem where human chemical compounds coexist with a particular microbiota. However, shifts in the normal composition of this microbiota may result in the onset of oral ailments, such as periodontitis and dental caries. In addition, it is known that the microbial colonization of the oral cavity is mediated by protein-protein interactions (PPIs) between the host and microorganisms. Nevertheless, this kind of PPIs is still largely undisclosed. To elucidate these interactions, we have created a computational prediction method that allows us to obtain a first model of the Human-Microbial oral interactome. Results: We collected high-quality experimental PPIs from five major human databases. The obtained PPIs were used to create our positive dataset and, indirectly, our negative dataset. The positive and negative datasets were merged and used for training and validation of a naive Bayes classifier. For the final prediction model, we used an ensemble methodology combining five distinct PPI prediction techniques, namely: literature mining, primary protein sequences, orthologous profiles, biological process similarity, and domain interactions. Performance evaluation of our method revealed an area under the ROC-curve (AUC) value greater than 0.926, supporting our primary hypothesis, as no single set of features reached an AUC greater than 0.877. After subjecting our dataset to the prediction model, the classified result was filtered for very high confidence PPIs (probability >= 1-10(-7)), leading to a set of 46,579 PPIs to be further explored. Conclusions: We believe this dataset holds not only important pathways involved in the onset of infectious oral diseases, but also potential drug-targets and biomarkers. The dataset used for training and validation, the predictions obtained and the network final network are available at http://bioinformatics.ua.pt/software/oralint.
引用
收藏
页数:12
相关论文
共 111 条
[1]   The relationship between sequence and interaction divergence in proteins [J].
Aloy, P ;
Ceulemans, H ;
Stark, A ;
Russell, RB .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 332 (05) :989-998
[2]   Interrogating protein interaction networks through structural biology [J].
Aloy, P ;
Russell, RB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (09) :5896-5901
[3]  
[Anonymous], COMPUTER METHODS BIO
[4]   Enolases from Gram-positive bacterial pathogens and commensal lactobacilli share functional similarity in virulence-associated traits [J].
Antikainen, Jenni ;
Kuparinen, Veera ;
Laehteenmaeki, Kaarina ;
Korhonen, Timo K. .
FEMS IMMUNOLOGY AND MEDICAL MICROBIOLOGY, 2007, 51 (03) :526-534
[5]   Reorganizing the protein space at the Universal Protein Resource (UniProt) [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Antunes, Ricardo ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bower, Lawrence ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Da Silva, Alan ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Fazzini, Francesco ;
Fedotov, Alexander ;
Garavelli, John ;
Castro, Leyla Garcia ;
Gardner, Michael ;
Hieta, Reija ;
Huntley, Rachael ;
Jacobsen, Julius ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pontikos, Nikolas ;
Pundir, Sangya ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Wardell, Tony ;
Watkins, Xavier ;
Corbett, Matt ;
Donnelly, Mike ;
van Rensburg, Pieter ;
Goujon, Mickael ;
McWilliam, Hamish ;
Lopez, Rodrigo ;
Xenarios, Ioannis ;
Bougueleret, Lydie ;
Bridge, Alan ;
Poux, Sylvain ;
Redaschi, Nicole .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D71-D75
[6]   OralCard: A bioinformatic tool for the study of oral proteome [J].
Arrais, Joel P. ;
Rosa, Nuno ;
Melo, Jose ;
Coelho, Edgar D. ;
Amaral, Diana ;
Correia, Maria Jose ;
Barros, Marlene ;
Oliveira, Jose Luis .
ARCHIVES OF ORAL BIOLOGY, 2013, 58 (07) :762-772
[7]  
AVILA-CAMPOS Mario J., 2002, Rev. Inst. Med. trop. S. Paulo, V44, P1, DOI 10.1590/S0036-46652002000100001
[8]   Predicting functional gene links from phylogenetic-statistical analyses of whole genomes [J].
Barker, D ;
Pagel, M .
PLOS COMPUTATIONAL BIOLOGY, 2005, 1 (01) :24-31
[9]   EPIDERMAL GROWTH FACTOR-RELATED PEPTIDES AND THEIR RELEVANCE TO GASTROINTESTINAL PATHOPHYSIOLOGY [J].
BARNARD, JA ;
BEAUCHAMP, RD ;
RUSSELL, WE ;
DUBOIS, RN ;
COFFEY, RJ .
GASTROENTEROLOGY, 1995, 108 (02) :564-580
[10]   ACTINOMYCES-ODONTOLYTICUS, A NEW SPECIES OF ACTINOMYCETE REGULARLY ISOLATED FROM DEEP CARIOUS DENTINE [J].
BATTY, I .
JOURNAL OF PATHOLOGY AND BACTERIOLOGY, 1958, 75 (02) :455-&