Literature Mining and Ontology based Analysis of Host-Brucella Gene-Gene Interaction Network

被引:6
作者
Karadeniz, Ilknur [2 ]
Hur, Junguk [1 ,3 ]
He, Yongqun [4 ,5 ,6 ]
Ozgur, Arzucan [2 ]
机构
[1] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
[2] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
[3] Univ N Dakota, Sch Med & Hlth Sci, Dept Basic Sci, Grand Forks, ND 58201 USA
[4] Univ Michigan, Dept Microbiol & Immunol, Unit Lab Anim Med, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Sch Med, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[6] Univ Michigan Hlth Syst, Ctr Comprehens Canc, Ann Arbor, MI USA
来源
FRONTIERS IN MICROBIOLOGY | 2015年 / 6卷
关键词
host-pathogen interaction extraction; Brucella; text mining; host and pathogen gene name recognition; SciMiner; support vector machines (SVM); Interaction Network Ontology (INO); INFORMATION; PROTECTION; PROTEIN; IDENTIFICATION; MELITENSIS; EXTRACTION; ABORTUS; MICE; CELL;
D O I
10.3389/fmicb.2015.01386
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Brucella is an intracellular bacterium that causes chronic brucellosis in humans and various mammals. The identification of host-Brucella interaction is crucial to understand host immunity against Brucella infection and Brucella pathogenesis against host immune responses. Most of the information about the inter-species interactions between host and Brucella genes is only available in the text of the scientific publications. Many text-mining systems for extracting gene and protein interactions have been proposed. However, only a few of them have been designed by considering the peculiarities of host-pathogen interactions. In this paper, we used a text mining approach for extracting host-Brucella gene gene interactions from the abstracts of articles in PubMed. The gene-gene interactions here represent the interactions between genes and/or gene products (e.g., proteins). The SciMiner tool, originally designed for detecting mammalian gene/protein names in text, was extended to identify host and Brucella gene/protein names in the abstracts. Next, sentence level and abstract level co-occurrence based approaches, as well as sentence-level machine learning based methods, originally designed for extracting intra-species gene interactions, were utilized to extract the interactions among the identified host and Brucella genes. The extracted interactions were manually evaluated. A total of 46 host-Brucella gene interactions were identified and represented as an interaction network. Twenty four of these interactions were identified from sentence-level processing. Twenty two additional interactions were identified when abstract level processing was performed. The Interaction Network Ontology (INO) was used to represent the identified interaction types at a hierarchical ontology structure. Ontological modeling of specific gene-gene interactions demonstrates that host-pathogen gene-gene interactions occur at experimental conditions which can be ontologically represented. Our results show that the introduced literature mining and ontology-based modeling approach are effective in retrieving and analyzing host-pathogen gene-gene interaction networks.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Detecting Essential Proteins Based on Network Topology, Gene Expression Data, and Gene Ontology Information
    Zhang, Wei
    Xu, Jia
    Li, Yuanyuan
    Zou, Xiufen
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (01) : 109 - 116
  • [22] VisANT 3.5: multi-scale network visualization, analysis and inference based on the gene ontology
    Hu, Zhenjun
    Hung, Jui-Hung
    Wang, Yan
    Chang, Yi-Chien
    Huang, Chia-Ling
    Huyck, Matt
    DeLisi, Charles
    NUCLEIC ACIDS RESEARCH, 2009, 37 : W115 - W121
  • [23] Dynamic and collective analysis of membrane protein interaction network based on gene regulatory network model
    Ding, Yong-Sheng
    Shen, Yi-Zhen
    Ren, Li-Hong
    Cheng, Li-Jun
    NEUROCOMPUTING, 2012, 98 : 151 - 158
  • [24] ChIP-seq analysis of the LuxR-type regulator VjbR reveals novel insights into the Brucella virulence gene expression network
    Kleinman, Claudia L.
    Sycz, Gabriela
    Bonomi, Hernan R.
    Rodriguez, Romina M.
    Zorreguieta, Angeles
    Sieira, Rodrigo
    NUCLEIC ACIDS RESEARCH, 2017, 45 (10) : 5757 - 5769
  • [25] Analysis of Effect of Schisandra in the Treatment of Myocardial Infarction Based on Three-Mode Gene Ontology Network
    Hu, Siyao
    Zuo, Huali
    Qi, Jin
    Hu, Yuanjia
    Yu, Boyang
    FRONTIERS IN PHARMACOLOGY, 2019, 10
  • [26] Prediction of key gene function in spinal muscular atrophy using guilt by association method based on network and gene ontology
    Yang, Wenjiu
    Han, Jing
    Ma, Jinfeng
    Feng, Yujie
    Hou, Qingxian
    Wang, Zhijie
    Yu, Tengbo
    EXPERIMENTAL AND THERAPEUTIC MEDICINE, 2019, 17 (04) : 2561 - 2566
  • [27] Automatic extraction of reference gene from literature in plants based on texting mining
    He Lin
    Shen Gengyu
    Li Fei
    Huang Shuiqing
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 12 (04) : 400 - 416
  • [28] Literature Based Bayesian Analysis of Gene Expression Data
    Xu, Lijing
    Homayouni, Ramin
    George, E. Olusegun
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 1032 - 1032
  • [29] Constructing a molecular interaction network for thyroid cancer via large-scale text mining of gene and pathway events
    Wu, Chengkun
    Schwartz, Jean-Marc
    Brabant, Georg
    Peng, Shao-Liang
    Nenadic, Goran
    BMC SYSTEMS BIOLOGY, 2015, 9
  • [30] Knowledge-Driven Analysis Identifies a Gene-Gene Interaction Affecting High-Density Lipoprotein Cholesterol Levels in Multi-Ethnic Populations
    Ma, Li
    Brautbar, Ariel
    Boerwinkle, Eric
    Sing, Charles F.
    Clark, Andrew G.
    Keinan, Alon
    PLOS GENETICS, 2012, 8 (05):