Ontology-based literature mining of E. coli vaccine-associated gene interaction networks

被引:12
作者
Hur, Junguk [1 ]
Ozgur, Arzucan [2 ]
He, Yongqun [3 ,4 ,5 ,6 ]
机构
[1] Univ North Dakota, Dept Biomed Sci, Sch Med & Hlth Sci, Grand Forks, ND 58202 USA
[2] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
[3] Univ Michigan, Dept Microbiol & Immunol, Unit Lab Anim Med, Sch Med, Ann Arbor, MI 48109 USA
[4] Univ Michigan, Dept Microbiol & Immunol, Sch Med, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Ctr Computat Med & Bioinformat, Sch Med, Ann Arbor, MI 48109 USA
[6] Univ Michigan, Ctr Comprehens Canc, Sch Med, Ann Arbor, MI 48109 USA
来源
JOURNAL OF BIOMEDICAL SEMANTICS | 2017年 / 8卷
关键词
ESCHERICHIA-COLI; IDENTIFICATION; PROTEGEN; DATABASE; VIOLIN; TOOL;
D O I
10.1186/s13326-017-0122-4
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Pathogenic Escherichia coli infections cause various diseases in humans and many animal species. However, with extensive E. coli vaccine research, we are still unable to fully protect ourselves against E. coli infections. To more rational development of effective and safe E. coli vaccine, it is important to better understand E. coli vaccine-associated gene interaction networks. Methods: In this study, we first extended the Vaccine Ontology (VO) to semantically represent various E. coli vaccines and genes used in the vaccine development. We also normalized E. coli gene names compiled from the annotations of various E. coli strains using a pan-genome-based annotation strategy. The Interaction Network Ontology (INO) includes a hierarchy of various interaction-related keywords useful for literature mining. Using VO, INO, and normalized E. coli gene names, we applied an ontology-based SciMiner literature mining strategy to mine all PubMed abstracts and retrieve E. coli vaccine-associated E. coli gene interactions. Four centrality metrics (i. e., degree, eigenvector, closeness, and betweenness) were calculated for identifying highly ranked genes and interaction types. Results: Using vaccine-related PubMed abstracts, our study identified 11,350 sentences that contain 88 unique INO interactions types and 1,781 unique E. coli genes. Each sentence contained at least one interaction type and two unique E. coli genes. An E. coli gene interaction network of genes and INO interaction types was created. From this big network, a sub-network consisting of 5 E. coli vaccine genes, including carA, carB, fimH, fepA, and vat, and 62 other E. coli genes, and 25 INO interaction types was identified. While many interaction types represent direct interactions between two indicated genes, our study has also shown that many of these retrieved interaction types are indirect in that the two genes participated in the specified interaction process in a required but indirect process. Our centrality analysis of these gene interaction networks identified top ranked E. coli genes and 6 INO interaction types (e. g., regulation and gene expression). Conclusions: Vaccine-related E. coli gene-gene interaction network was constructed using ontology-based literature mining strategy, which identified important E. coli vaccine genes and their interactions with other genes through specific interaction types.
引用
收藏
页数:10
相关论文
共 39 条
  • [1] [Anonymous], 2006, Wkly Epidemiol Rec, V81, P97
  • [2] Collobert R, 2011, J MACH LEARN RES, V12, P2493
  • [3] The Pyrimidine Nucleotide Biosynthetic Pathway Modulates Production of Biofilm Determinants in Escherichia coli
    Garavaglia, Marco
    Rossi, Elio
    Landini, Paolo
    [J]. PLOS ONE, 2012, 7 (02):
  • [4] A review of vaccine research and development: human enteric infections
    Girard, MP
    Steele, D
    Chaignat, CL
    Kieny, MP
    [J]. VACCINE, 2006, 24 (15) : 2732 - 2750
  • [5] Activation by fusion of the glutaminase and synthetase subunits of Escherichia coli carbamyl-phosphate synthetase
    Guy, HI
    Rotgeri, A
    Evans, DR
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 1997, 272 (32) : 19913 - 19918
  • [6] HE Y., 2009, The 1st International Conference on Biomedical Ontology (ICBO 2009) Nature Precedings, P24
  • [7] Updates on the web-based VIOLIN vaccine database and analysis system
    He, Yongqun
    Racz, Rebecca
    Sayers, Samantha
    Lin, Yu
    Todd, Thomas
    Hur, Junguk
    Li, Xinna
    Patel, Mukti
    Zhao, Boyang
    Chung, Monica
    Ostrow, Joseph
    Sylora, Andrew
    Dungarani, Priya
    Ulysse, Guerlain
    Kochhar, Kanika
    Vidri, Boris
    Strait, Kelsey
    Jourdian, George W.
    Xiang, Zuoshuang
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D1124 - D1132
  • [8] Bioinformatics analysis of bacterial protective antigens in manually curated Protegen database
    He, Yongqun
    Xiang, Zuoshuang
    [J]. 5TH VACCINE AND ISV ANNUAL GLOBAL CONGRESS, 2012, 6 : 3 - 9
  • [9] Vaxign: The First Web-Based Vaccine Design Program for Reverse Vaccinology and Applications for Vaccine Development
    He, Yongqun
    Xiang, Zuoshuang
    Mobley, Harry L. T.
    [J]. JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2010,
  • [10] Development and application of an interaction network ontology for literature mining of vaccine-associated gene-gene interactions
    Hur, Junguk
    Ozgur, Arzucan
    Xiang, Zuoshuang
    He, Yongqun
    [J]. JOURNAL OF BIOMEDICAL SEMANTICS, 2015, 6