An Efficient Approach for Semantic Relatedness Evaluation based on Semantic Neighborhood

被引:1
作者
Lopes, Alcides [1 ]
Alvarenga, Renata [2 ]
Carbonera, Joel [1 ]
Abel, Mara [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
[2] Univ Fed Rio Grande do Sul, Inst Geosci, Porto Alegre, RS, Brazil
来源
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019) | 2019年
关键词
Semantic relatedness; semantic neighbors; word sense disambiguation; INFORMATION-CONTENT; SIMILARITY;
D O I
10.1109/ICTAI.2019.00052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of natural language processing and information retrieval, ontologies can improve the results of the word sense disambiguation (WSD) techniques. By making explicit the semantics of the term, ontology-based semantic measures play a crucial role to determine how different ontology classes have a similar meaning. In this context, it is common to use semantic similarity as a basis for WSD. However, the measures generally consider only taxonomic relationships, which affects negatively the discrimination of two ontology classes that are related by the other relationship types. On the other hand, semantic relatedness measures consider diverse types of relationships to determine how much two classes on the ontology are related. However, these measures, especially the path-based approaches, has as the main drawback a high computational complexity to be calculated in query execution time. Also, for both types of semantic measures, it is unpractical to store all similarity or relatedness values between all ontology classes in memory, especially for large ontologies. In this work, we propose a novel approach based on semantic neighbors that aim to improve the query time in path-based semantic measures without losing their effectiveness in relatedness analysis. We also propose an efficient algorithm to calculate the semantic distance between two ontology classes. We evaluate our proposal in WSD using a pre-existent domain ontology for well-core description. This ontology contains 929 classes related to rock facies and a set of sentences from four different corpora about geology in the Oil&Gas domain. In the experiments, we compared our approach with state-of-the-art semantic relatedness measures, such as path-based, feature-based, information content, and hybrid methods regarding the F-score, query time and the total number of classes in memory. The experimental results show that the proposed method obtains F-score gains in WSD, as well as an improvement in the query time concerning the traditional path-based approaches. Also, we reduce the total number of classes stored in memory for each ontology class.
引用
收藏
页码:316 / 323
页数:8
相关论文
共 20 条
[1]   A hybrid approach for measuring semantic similarity based on IC-weighted path distance in WordNet [J].
Cai, Yuanyuan ;
Zhang, Qingchuan ;
Lu, Wei ;
Che, Xiaoping .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 51 (01) :23-47
[2]  
Diefenbach D., 2016, P 6 INT C WEB INT MI, P20, DOI [10.1145/2912845.2912864, DOI 10.1145/2912845.2912864]
[3]  
Dijkstra E. W., 1959, Numer. Math, V1, P269, DOI [DOI 10.1007/BF01386390, 10.1007/BF01386390]
[4]  
Dou Hao, 2011, Proceedings of the 2011 Second International Conference on Digital Manufacturing and Automation (ICDMA 2011), P177, DOI 10.1109/ICDMA.2011.50
[5]  
Hirst G, 1998, LANG SPEECH & COMMUN, P305
[6]  
Li YH, 2003, IEEE T KNOWL DATA EN, V15, P871, DOI 10.1109/TKDE.2003.1209005
[7]   Sigmoid similarity - a new feature-based similarity measure [J].
Likavec, Silvia ;
Lombardi, Ilaria ;
Cena, Federica .
INFORMATION SCIENCES, 2019, 481 :203-218
[8]  
Liu XY, 2007, PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, P3431
[9]  
Lorenzatti A, 2009, LECT NOTES COMPUT SC, V5833, P169, DOI 10.1007/978-3-642-04947-7_21
[10]   Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text [J].
McInnes, Bridget T. ;
Pedersen, Ted .
JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (06) :1116-1124