Literature Mining and Ontology based Analysis of Host-Brucella Gene-Gene Interaction Network

被引:6
作者
Karadeniz, Ilknur [2 ]
Hur, Junguk [1 ,3 ]
He, Yongqun [4 ,5 ,6 ]
Ozgur, Arzucan [2 ]
机构
[1] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
[2] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
[3] Univ N Dakota, Sch Med & Hlth Sci, Dept Basic Sci, Grand Forks, ND 58201 USA
[4] Univ Michigan, Dept Microbiol & Immunol, Unit Lab Anim Med, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Sch Med, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[6] Univ Michigan Hlth Syst, Ctr Comprehens Canc, Ann Arbor, MI USA
来源
FRONTIERS IN MICROBIOLOGY | 2015年 / 6卷
关键词
host-pathogen interaction extraction; Brucella; text mining; host and pathogen gene name recognition; SciMiner; support vector machines (SVM); Interaction Network Ontology (INO); INFORMATION; PROTECTION; PROTEIN; IDENTIFICATION; MELITENSIS; EXTRACTION; ABORTUS; MICE; CELL;
D O I
10.3389/fmicb.2015.01386
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Brucella is an intracellular bacterium that causes chronic brucellosis in humans and various mammals. The identification of host-Brucella interaction is crucial to understand host immunity against Brucella infection and Brucella pathogenesis against host immune responses. Most of the information about the inter-species interactions between host and Brucella genes is only available in the text of the scientific publications. Many text-mining systems for extracting gene and protein interactions have been proposed. However, only a few of them have been designed by considering the peculiarities of host-pathogen interactions. In this paper, we used a text mining approach for extracting host-Brucella gene gene interactions from the abstracts of articles in PubMed. The gene-gene interactions here represent the interactions between genes and/or gene products (e.g., proteins). The SciMiner tool, originally designed for detecting mammalian gene/protein names in text, was extended to identify host and Brucella gene/protein names in the abstracts. Next, sentence level and abstract level co-occurrence based approaches, as well as sentence-level machine learning based methods, originally designed for extracting intra-species gene interactions, were utilized to extract the interactions among the identified host and Brucella genes. The extracted interactions were manually evaluated. A total of 46 host-Brucella gene interactions were identified and represented as an interaction network. Twenty four of these interactions were identified from sentence-level processing. Twenty two additional interactions were identified when abstract level processing was performed. The Interaction Network Ontology (INO) was used to represent the identified interaction types at a hierarchical ontology structure. Ontological modeling of specific gene-gene interactions demonstrates that host-pathogen gene-gene interactions occur at experimental conditions which can be ontologically represented. Our results show that the introduced literature mining and ontology-based modeling approach are effective in retrieving and analyzing host-pathogen gene-gene interaction networks.
引用
收藏
页数:10
相关论文
共 50 条
[21]   RETRACTED: Gene-gene interaction network analysis of ovarian cancer using TCGA data (Retracted article. See vol. 8, 18, 2015) [J].
Ying, Huanchun ;
Lv, Jing ;
Ying, Tianshu ;
Jin, Shanshan ;
Shao, Jingru ;
Wang, Lili ;
Xu, Hongying ;
Yuan, Bin ;
Yang, Qing .
JOURNAL OF OVARIAN RESEARCH, 2013, 6
[22]   VisANT 3.5: multi-scale network visualization, analysis and inference based on the gene ontology [J].
Hu, Zhenjun ;
Hung, Jui-Hung ;
Wang, Yan ;
Chang, Yi-Chien ;
Huang, Chia-Ling ;
Huyck, Matt ;
DeLisi, Charles .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W115-W121
[23]   Dynamic and collective analysis of membrane protein interaction network based on gene regulatory network model [J].
Ding, Yong-Sheng ;
Shen, Yi-Zhen ;
Ren, Li-Hong ;
Cheng, Li-Jun .
NEUROCOMPUTING, 2012, 98 :151-158
[24]   ChIP-seq analysis of the LuxR-type regulator VjbR reveals novel insights into the Brucella virulence gene expression network [J].
Kleinman, Claudia L. ;
Sycz, Gabriela ;
Bonomi, Hernan R. ;
Rodriguez, Romina M. ;
Zorreguieta, Angeles ;
Sieira, Rodrigo .
NUCLEIC ACIDS RESEARCH, 2017, 45 (10) :5757-5769
[25]   Analysis of Effect of Schisandra in the Treatment of Myocardial Infarction Based on Three-Mode Gene Ontology Network [J].
Hu, Siyao ;
Zuo, Huali ;
Qi, Jin ;
Hu, Yuanjia ;
Yu, Boyang .
FRONTIERS IN PHARMACOLOGY, 2019, 10
[26]   Prediction of key gene function in spinal muscular atrophy using guilt by association method based on network and gene ontology [J].
Yang, Wenjiu ;
Han, Jing ;
Ma, Jinfeng ;
Feng, Yujie ;
Hou, Qingxian ;
Wang, Zhijie ;
Yu, Tengbo .
EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2019, 17 (04) :2561-2566
[27]   Automatic extraction of reference gene from literature in plants based on texting mining [J].
He Lin ;
Shen Gengyu ;
Li Fei ;
Huang Shuiqing .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 12 (04) :400-416
[28]   Literature Based Bayesian Analysis of Gene Expression Data [J].
Xu, Lijing ;
Homayouni, Ramin ;
George, E. Olusegun .
2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, :1032-1032
[29]   Constructing a molecular interaction network for thyroid cancer via large-scale text mining of gene and pathway events [J].
Wu, Chengkun ;
Schwartz, Jean-Marc ;
Brabant, Georg ;
Peng, Shao-Liang ;
Nenadic, Goran .
BMC SYSTEMS BIOLOGY, 2015, 9
[30]   Knowledge-Driven Analysis Identifies a Gene-Gene Interaction Affecting High-Density Lipoprotein Cholesterol Levels in Multi-Ethnic Populations [J].
Ma, Li ;
Brautbar, Ariel ;
Boerwinkle, Eric ;
Sing, Charles F. ;
Clark, Andrew G. ;
Keinan, Alon .
PLOS GENETICS, 2012, 8 (05)