Computing semantic similarity between biomedical concepts using new information content approach

被引:34
|
作者
Ben Aouicha, Mohamed [1 ]
Taieb, Mohamed Ali Hadj [1 ]
机构
[1] Sfax Univ, Multimedia InfoRmat Syst & Adv Comp Lab, Sfax 3021, Tunisia
关键词
Semantic similarity; Information content; DAG topological parameters; MeSH; Biomedicine; RELATEDNESS;
D O I
10.1016/j.jbi.2015.12.007
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The exploitation of heterogeneous clinical sources and healthcare records is fundamental in clinical and translational research. The determination of semantic similarity between word pairs is an important component of text understanding that enables the processing and structuring of textual resources. Some of these measures have been adapted to the biomedical field by incorporating domain information extracted from clinical data or from medical ontologies such as MeSH. This study focuses on Information Content (IC) based measures that exploit the topological parameters of the taxonomy to express the semantics of a concept. A new intrinsic IC computing method based on the taxonomical parameters of the ancestors' subgraph is then assigned to a biomedical concept into the "is a" hierarchy. Moreover, we present a study of the topological parameters through the MeSH taxonomy. This study treats the semantic interpretation and the different ways of expressing the parameters of depth and the descendants' subgraph. Using MeSH as an input ontology, the accuracy of our proposal is evaluated and compared against other IC-based measures according to several widely-used benchmarks of biomedical terms. The correlation between the results obtained for the evaluated measure using the proposed approach and those from the ratings of human' experts shows that our proposal outperforms the previous measures. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:258 / 275
页数:18
相关论文
共 50 条
  • [21] An information Content-Based Approach for Measuring Concept Semantic Similarity in WordNet
    Zhang, Xiaogang
    Sun, Shouqian
    Zhang, Kejun
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 103 (01) : 117 - 132
  • [22] An information Content-Based Approach for Measuring Concept Semantic Similarity in WordNet
    Xiaogang Zhang
    Shouqian Sun
    Kejun Zhang
    Wireless Personal Communications, 2018, 103 : 117 - 132
  • [23] Assessment of Semantic Similarity between Proteins Using Information Content and Topological Properties of the Gene Ontology Graph
    Dutta, Pritha
    Basu, Subhadip
    Kundu, Mahantapas
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (03) : 839 - 849
  • [24] Information content measures of semantic similarity between documents based on Hadoop system
    Birjali, Marouane
    Beni-Hssane, Abderrahim
    Erritali, Mohammed
    Madani, Youness
    2016 INTERNATIONAL CONFERENCE ON WIRELESS NETWORKS AND MOBILE COMMUNICATIONS (WINCOM), 2016, : P187 - P192
  • [25] A new hybrid semantic similarity measure using information content and topological features of the Gene Ontology graph
    Dutta, Pritha
    Basu, Subhadip
    Kundu, Mahantapas
    2017 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2017,
  • [26] A New Approach for Calculating Semantic Similarity between Words Using WordNet and Set Theory
    Ezzikouri, Hanane
    Madani, Youness
    Erritali, Mohammed
    Oukessou, Mohamed
    10TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2019) / THE 2ND INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40 2019) / AFFILIATED WORKSHOPS, 2019, 151 : 1261 - 1265
  • [27] Assessing Semantic Similarity Between Concepts Using Wikipedia Based on Nonlinear Fitting
    Huang, Guangjian
    Jiang, Yuncheng
    Ma, Wenjun
    Liu, Weiru
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 159 - 171
  • [28] Decision of Semantic Similarity using Description Logic and Vector Weight between Concepts
    Kim, Su-Kyoung
    Choi, Ho-Jin
    NCM 2008: 4TH INTERNATIONAL CONFERENCE ON NETWORKED COMPUTING AND ADVANCED INFORMATION MANAGEMENT, VOL 2, PROCEEDINGS, 2008, : 345 - 350
  • [29] A Model for the Relationship between Semantic and Content Based Similarity using LIDC
    Dasovich, Grace
    Kim, Robert
    Raicu, Daniela S.
    Furst, Jacob D.
    MEDICAL IMAGING 2010: COMPUTER - AIDED DIAGNOSIS, 2010, 7624
  • [30] Analysis and Implementation Measurement of Semantic Similarity Using Content Management Information on WordNet
    Sagala, Tommy Wijaya
    Wati, Theresia
    Solikin
    Budi, Nur Fitriah Ayuning
    Hidayanto, Achmad Nizar
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2018, : 337 - 342