Sigmoid similarity - a new feature-based similarity measure

被引:16
作者
Likavec, Silvia [1 ]
Lombardi, Ilaria [1 ]
Cena, Federica [1 ]
机构
[1] Univ Torino, Dipartimento Informat, Italy Corso Svizzera 185, I-10149 Turin, Italy
关键词
Similarity; Properties; Feature-based similarity; Hierarchy; Ontology; Instances; SEMANTIC SIMILARITY; INFORMATION-CONTENT;
D O I
10.1016/j.ins.2018.12.018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity is one of the most straightforward ways to relate objects and guide the human perception of the world. It has an important role in many areas, such as Information Retrieval, Natural Language Processing, Semantic Web and Recommender Systems. To help applications in these areas achieve satisfying results when finding similar concepts, it is important to simulate human perception of similarity and assess which similarity measure is the most adequate. We propose Sigmoid similarity, a feature-based semantic similarity measure on instances in a specific ontology, as an improvement of Dice measure. We performed two separate evaluations with real evaluators. The first evaluation includes 137 subjects and 25 pairs of concepts in the recipes domain and the second one includes 147 subjects and 30 pairs of concepts in the drinks domain. To the best of our knowledge these are some of the most extensive evaluations in the field. We also explored the performance of some hierarchy-based approaches and showed that feature-based approaches outperform them on two specific ontologies we tested. In addition, we tried to incorporate hierarchy-based information into our measures and concluded it is not worth complicating the measures only based on features with additional information since they perform comparably. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:203 / 218
页数:16
相关论文
共 36 条
[1]  
[Anonymous], 1998, WORDNET ELECT LEXICA, DOI DOI 10.7551/MITPRESS/7287.001.0001
[2]   An ontology-based measure to compute semantic similarity in biomedicine [J].
Batet, Montserrat ;
Sanchez, David ;
Valls, Aida .
JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (01) :118-125
[3]   How do we measure and improve the quality of a hierarchical ontology? [J].
Beydoun, Ghassan ;
Lopez-Lorca, Antonio A. ;
Garcia-Sanchez, Francisco ;
Martinez-Bejar, Rodrigo .
JOURNAL OF SYSTEMS AND SOFTWARE, 2011, 84 (12) :2363-2373
[4]   Linked Data - The Story So Far [J].
Bizer, Christian ;
Heath, Tom ;
Berners-Lee, Tim .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) :1-22
[5]  
Budanitsky A, 2006, COMPUT LINGUIST, V32, P13, DOI 10.1162/coli.2006.32.1.13
[6]   Fuzzy Weighted Attribute Combinations Based Similarity Measures [J].
Coletti, Giulianella ;
Petturiti, Davide ;
Vantaggi, Barbara .
SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2017, 2017, 10369 :364-374
[7]  
Cordier A, 2014, STUD COMPUT INTELL, V494, P121, DOI 10.1007/978-3-642-38736-4_7
[8]   Association rule ontology matching approach [J].
David, Jerome ;
Guillet, Fabrice ;
Briand, Henri .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2007, 3 (02) :27-49
[9]  
Devi M.Uma., 2015, EMERGING ICT BRIDGIN, V1, P443
[10]  
Di Noia T., 2012, Proceedings of the 8th International Conference on Semantic Systems, I-SEMANTICS 12, P1, DOI [10.1145/2362499.2362501, DOI 10.1145/2362499.2362501]