An Efficient and Asymmetrical Feature Mapping Model for Measuring Semantic Similarity

被引:0
作者
Zhu, Xinhua [1 ]
Xus, Qingting [1 ]
Zhang, Lanfang [2 ]
Guo, Xiaohua [1 ]
Chen, Hongchao [1 ]
Guo, Qingsong [1 ]
机构
[1] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Peoples R China
[2] Guangxi Normal Univ, Fac Educ, Guilin, Peoples R China
来源
2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019) | 2019年
基金
中国国家自然科学基金;
关键词
semantic similarity; feature mapping; WordNet; SNOMED CT; INFORMATION-CONTENT; EDGE;
D O I
10.1109/icaci.2019.8778521
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The semantic similarity estimation is an important basic research topic in conceptual modeling. This paper proposes a high-efficiency and asymmetrical feature mapping model for computing semantic similarity. Firstly, to improve efficiency, we take the edge as the main information source of mapped features for concepts. Then, to avoid the computed deviation with the human judgment in the low score range is more significant than that in the high score range, we map the commonalities between concepts only into the single information source of the weighted depth of their Least Common Subsumer (LCS), and map their differences into three types of information sources, including their weighted path, density and encoding distance, to improve the ability to distinguish between different concepts. To further improve efficiency, we consider only the direct hyponyms of LCS as its density, rather than all of its hyponyms. Moreover, the edge weight is introduced into the calculation of depths and paths to reduce the influence of high-layer edges. The experimental results show that the proposed model is an excellent semantic similarity method with high computational efficiency and high measurement accuracy on both the common ontology WordNet and the biomedical ontology SNOMED CT.
引用
收藏
页码:66 / 71
页数:6
相关论文
共 24 条
[1]   Semantic similarity assessment of words using weighted WordNet [J].
Ahsaee, Mostafa Ghazizadeh ;
Naghibzadeh, Mahmoud ;
Naeini, S. Ehsan Yasrebi .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2014, 5 (03) :479-490
[2]   Semantic similarity estimation from multiple ontologies [J].
Batet, Montserrat ;
Sanchez, David ;
Valls, Aida ;
Gibert, Karina .
APPLIED INTELLIGENCE, 2013, 38 (01) :29-44
[3]   LWCR: multi-Layered Wikipedia representation for Computing word Relatedness [J].
Ben Aouicha, Mohamed ;
Taieb, Mohamed Ali Hadj ;
Ben Hamadou, Abdelmajid .
NEUROCOMPUTING, 2016, 216 :816-843
[4]   A Word Net-based semantic similarity measurement combining edge-counting and information content theory [J].
Gao, Jian-Bo ;
Zhang, Bao-Wen ;
Chen, Xiao-Hua .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 39 :80-88
[5]   Wikipedia-based information content and semantic similarity computation [J].
Jiang, Yuncheng ;
Bai, Wen ;
Zhang, Xiaopei ;
Hu, Jiaojiao .
INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (01) :248-265
[6]  
Li YH, 2003, IEEE T KNOWL DATA EN, V15, P871, DOI 10.1109/TKDE.2003.1209005
[7]  
Lin D, 1998, P 15 INT C MACH LEAR, V98, P296
[8]   CONTEXTUAL CORRELATES OF SEMANTIC SIMILARITY [J].
MILLER, GA ;
CHARLES, WG .
LANGUAGE AND COGNITIVE PROCESSES, 1991, 6 (01) :1-28
[9]   Measures of semantic similarity and relatedness in the biomedical domain [J].
Pedersen, Ted ;
Pakhomov, Serguei V. S. ;
Patwardhan, Siddharth ;
Chute, Christopher G. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (03) :288-299
[10]  
Petrakis E. G. M., 2006, Journal of Digital Information Management, V4, P233