Prediction of human-Streptococcus pneumoniae protein-protein interactions using logistic regression

被引:9
作者
Prasasty, Vivitri Dewi [1 ]
Hutagalung, Rory Anthony [1 ]
Gunadi, Reinhart [2 ]
Sofia, Dewi Yustika [2 ]
Rosmalena, Rosmalena [3 ]
Yazid, Fatmawaty [3 ]
Sinaga, Ernawati [4 ]
机构
[1] Atma Jaya Catholic Univ Indonesia, Fac Biotechnol, Jakarta 12930, Indonesia
[2] Univ Surya, Fac Life Sci, Dept Biol, Tangerang 15143, Banten, Indonesia
[3] Univ Indonesia, Dept Med Chem, Fac Med, Jakarta 10430, Indonesia
[4] Univ Nas, Fac Biol, Jakarta 12520, Indonesia
关键词
Host-pathogen protein-protein interactions; Logistic regression; Network centrality; Pathway enrichment; Streptococcus pneumoniae; VIRULENCE PROTEINS; SEQUENCE; NETWORKS; GENE; COMBINATIONS; BOTTLENECKS; PNEUMOLYSIN; MECHANISMS; RESISTANCE; CHILDREN;
D O I
10.1016/j.compbiolchem.2021.107492
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Streptococcus pneumoniae is a major cause of mortality in children under five years old. In recent years, the emergence of antibiotic-resistant strains of S. pneumoniae increases the threat level of this pathogen. For that reason, the exploration of S. pneumoniae protein virulence factors should be considered in developing new drugs or vaccines, for instance by the analysis of host-pathogen protein-protein interactions (HP-PPIs). In this research, prediction of protein-protein interactions was performed with a logistic regression model with the number of protein domain occurrences as features. By utilizing HP-PPIs of three different pathogens as training data, the model achieved 57-77 % precision, 64-75 % recall, and 96-98 % specificity. Prediction of human-S. pneumoniae protein-protein interactions using the model yielded 5823 interactions involving thirty S. pneumoniae proteins and 324 human proteins. Pathway enrichment analysis showed that most of the pathways involved in the predicted interactions are immune system pathways. Network topology analysis revealed beta-galactosidase (BgaA) as the most central among the S. pneumoniae proteins in the predicted HP-PPI networks, with a degree centrality of 1.0 and a betweenness centrality of 0.451853. Further experimental studies are required to validate the predicted interactions and examine their roles in S. pneumoniae infection.
引用
收藏
页数:9
相关论文
共 65 条
  • [21] Hagberg AA, 2008, EXPLORING NETWORK ST, P11, DOI DOI 10.1016/J.JELECTROCARD.2010.09.003
  • [22] Hastie T., 2009, Springer series in statistics, Vxxii, DOI 10.1007/978-0-387-84858-7
  • [23] Why do hubs tend to be essential in protein networks?
    He, Xionglei
    Zhang, Jianzhi
    [J]. PLOS GENETICS, 2006, 2 (06): : 826 - 834
  • [24] The Pneumococcus: Epidemiology, Microbiology, and Pathogenesis
    Henriques-Normark, Birgitta
    Tuomanen, Elaine I.
    [J]. COLD SPRING HARBOR PERSPECTIVES IN MEDICINE, 2013, 3 (07):
  • [25] Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources
    Huang, Da Wei
    Sherman, Brad T.
    Lempicki, Richard A.
    [J]. NATURE PROTOCOLS, 2009, 4 (01) : 44 - 57
  • [26] Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding
    Huang, Yu-An
    You, Zhu-Hong
    Chen, Xing
    Chan, Keith
    Luo, Xin
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [27] Complement resistance mechanisms of streptococci
    Jarva, H
    Jokiranta, TS
    Würzner, R
    Meri, S
    [J]. MOLECULAR IMMUNOLOGY, 2003, 40 (2-4) : 95 - 107
  • [28] Pneumococcal virulence factors: Structure and function
    Jedrzejas, MJ
    [J]. MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 2001, 65 (02) : 187 - +
  • [29] The role of Streptococcus pneumoniae virulence factors in host respiratory colonization and disease
    Kadioglu, Aras
    Weiser, Jeffrey N.
    Paton, James C.
    Andrew, Peter W.
    [J]. NATURE REVIEWS MICROBIOLOGY, 2008, 6 (04) : 288 - 301
  • [30] KEGG as a reference resource for gene and protein annotation
    Kanehisa, Minoru
    Sato, Yoko
    Kawashima, Masayuki
    Furumichi, Miho
    Tanabe, Mao
    [J]. NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) : D457 - D462