Literature Mining and Ontology based Analysis of Host-Brucella Gene-Gene Interaction Network

被引:6
作者
Karadeniz, Ilknur [2 ]
Hur, Junguk [1 ,3 ]
He, Yongqun [4 ,5 ,6 ]
Ozgur, Arzucan [2 ]
机构
[1] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
[2] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
[3] Univ N Dakota, Sch Med & Hlth Sci, Dept Basic Sci, Grand Forks, ND 58201 USA
[4] Univ Michigan, Dept Microbiol & Immunol, Unit Lab Anim Med, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Sch Med, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[6] Univ Michigan Hlth Syst, Ctr Comprehens Canc, Ann Arbor, MI USA
来源
FRONTIERS IN MICROBIOLOGY | 2015年 / 6卷
关键词
host-pathogen interaction extraction; Brucella; text mining; host and pathogen gene name recognition; SciMiner; support vector machines (SVM); Interaction Network Ontology (INO); INFORMATION; PROTECTION; PROTEIN; IDENTIFICATION; MELITENSIS; EXTRACTION; ABORTUS; MICE; CELL;
D O I
10.3389/fmicb.2015.01386
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Brucella is an intracellular bacterium that causes chronic brucellosis in humans and various mammals. The identification of host-Brucella interaction is crucial to understand host immunity against Brucella infection and Brucella pathogenesis against host immune responses. Most of the information about the inter-species interactions between host and Brucella genes is only available in the text of the scientific publications. Many text-mining systems for extracting gene and protein interactions have been proposed. However, only a few of them have been designed by considering the peculiarities of host-pathogen interactions. In this paper, we used a text mining approach for extracting host-Brucella gene gene interactions from the abstracts of articles in PubMed. The gene-gene interactions here represent the interactions between genes and/or gene products (e.g., proteins). The SciMiner tool, originally designed for detecting mammalian gene/protein names in text, was extended to identify host and Brucella gene/protein names in the abstracts. Next, sentence level and abstract level co-occurrence based approaches, as well as sentence-level machine learning based methods, originally designed for extracting intra-species gene interactions, were utilized to extract the interactions among the identified host and Brucella genes. The extracted interactions were manually evaluated. A total of 46 host-Brucella gene interactions were identified and represented as an interaction network. Twenty four of these interactions were identified from sentence-level processing. Twenty two additional interactions were identified when abstract level processing was performed. The Interaction Network Ontology (INO) was used to represent the identified interaction types at a hierarchical ontology structure. Ontological modeling of specific gene-gene interactions demonstrates that host-pathogen gene-gene interactions occur at experimental conditions which can be ontologically represented. Our results show that the introduced literature mining and ontology-based modeling approach are effective in retrieving and analyzing host-pathogen gene-gene interaction networks.
引用
收藏
页数:10
相关论文
共 50 条
[41]   Gene Ontology and KEGG Pathway Enrichment Analysis of a Drug Target-Based Classification System [J].
Chen, Lei ;
Chu, Chen ;
Lu, Jing ;
Kong, Xiangyin ;
Huang, Tao ;
Cai, Yu-Dong .
PLOS ONE, 2015, 10 (05)
[42]   GO-Bayes: Gene Ontology-based overrepresentation analysis using a Bayesian approach [J].
Zhang, Song ;
Cao, Jing ;
Kong, Y. Megan ;
Scheuermann, Richard H. .
BIOINFORMATICS, 2010, 26 (07) :905-911
[43]   Literature mining, gene-set enrichment and pathway analysis for target identification in Behcet's disease [J].
Wilson, P. ;
Larminie, C. ;
Smith, R. .
CLINICAL AND EXPERIMENTAL RHEUMATOLOGY, 2016, 34 (06) :S101-S110
[44]   Structuring osteosarcoma knowledge: an osteosarcoma-gene association database based on literature mining and manual annotation [J].
Poos, Kathrin ;
Smida, Jan ;
Nathrath, Michaela ;
Maugg, Doris ;
Baumhoer, Daniel ;
Neumann, Anna ;
Korsching, Eberhard .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2014,
[45]   Network-Based Analysis of OMICs Data to Understand the HIV-Host Interaction [J].
Ivanov, Sergey ;
Lagunin, Alexey ;
Filimonov, Dmitry ;
Tarasova, Olga .
FRONTIERS IN MICROBIOLOGY, 2020, 11
[46]   DiCE: differential centrality-ensemble analysis based on gene expression profiles and protein-protein interaction network [J].
Pashaei, Elnaz ;
Liu, Sheng ;
Li, Kailing ;
Zang, Yong ;
Yang, Lei ;
Lautenschlaeger, Tim ;
Huang, Jun ;
Lu, Xin ;
Wan, Jun .
NUCLEIC ACIDS RESEARCH, 2025, 53 (13)
[47]   A network-based gene-weighting approach for pathway analysis [J].
Fang, Zhaoyuan ;
Tian, Weidong ;
Ji, Hongbin .
CELL RESEARCH, 2012, 22 (03) :565-580
[48]   Gene network-based cancer prognosis analysis with sparse boosting [J].
Ma, Shuangge ;
Huang, Yuan ;
Huang, Jian ;
Fang, Kuangnan .
GENETICS RESEARCH, 2012, 94 (04) :205-221
[49]   Literature-Based Discovery of IFN-γ and Vaccine-Mediated Gene Interaction Networks [J].
Ozgur, Arzucan ;
Xiang, Zuoshuang ;
Radev, Dragomir R. ;
He, Yongqun .
JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2010,
[50]   Expression and gene regulation network of INHBA in Head and neck squamous cell carcinoma based on data mining [J].
Wu, Zeng-hong ;
Tang, Yun ;
Niu, Xun ;
Cheng, Qing .
SCIENTIFIC REPORTS, 2019, 9 (1)