Evolution of Semantic Similarity-A Survey

被引:195
作者
Chandrasekaran, Dhivya [1 ]
Mago, Vijay [1 ]
机构
[1] Lakehead Univ, 955 Oliver Rd, Thunder Bay, ON P7B 5E1, Canada
关键词
Semantic similarity; linguistics; supervised and unsupervised methods; knowledge-based methods; word embeddings; corpus-based methods; INFORMATION-CONTENT; SENSE EMBEDDINGS; WORD; KNOWLEDGE; REPRESENTATION; MODELS; FRAMEWORK; KERNELS; WEB;
D O I
10.1145/3440755
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Estimating the semantic similarity between text data is one of the challenging and open research problems in the field of Natural Language Processing (NLP). The versatility of natural language makes it difficult to define rule-based methods for determining semantic similarity measures. To address this issue, various semantic similarity methods have been proposed over the years. This survey article traces the evolution of such methods beginning from traditional NLP techniques such as kernel-based methods to the most recent research work on transformer-based models, categorizing them based on their underlying principles as knowledge-based, corpus-based, deep neural network based methods, and hybrid methods. Discussing the strengths and weaknesses of each method, this survey provides a comprehensive view of existing systems in place for new researchers to experiment and develop innovative ideas to address the issue of semantic similarity.
引用
收藏
页数:37
相关论文
共 134 条
[41]  
Gorman J, 2006, COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, P361
[42]  
Gravier C., 2017, P 2017 C EMP METH NA, P254
[43]   UESTS: An Unsupervised Ensemble Semantic Textual Similarity Method [J].
Hassan, Basma ;
Abdelrahman, Samir E. ;
Bahgat, Reem ;
Farag, Ibrahim .
IEEE ACCESS, 2019, 7 :85462-85482
[44]  
He H., 2016, P 2016 C N AM CHAPTE, P937, DOI DOI 10.18653/V1/N16-1108
[45]   SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation [J].
Hill, Felix ;
Reichart, Roi ;
Korhonen, Anna .
COMPUTATIONAL LINGUISTICS, 2015, 41 (04) :665-695
[46]   YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia [J].
Hoffart, Johannes ;
Suchanek, Fabian M. ;
Berberich, Klaus ;
Weikum, Gerhard .
ARTIFICIAL INTELLIGENCE, 2013, 194 :28-61
[47]   Syntactic, Semantic and Sentiment Analysis: The Joint Effect on Automated Essay Evaluation [J].
Janda, Harneet Kaur ;
Pawar, Atish ;
Du, Shan ;
Mago, Vijay .
IEEE ACCESS, 2019, 7 :108486-108503
[48]  
Jiang J, 1997, INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 1997 DIGEST OF TECHNICAL PAPERS, P94
[49]   Wikipedia-based information content and semantic similarity computation [J].
Jiang, Yuncheng ;
Bai, Wen ;
Zhang, Xiaopei ;
Hu, Jiaojiao .
INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (01) :248-265
[50]   Feature-based approaches to semantic similarity assessment of concepts using Wikipedia [J].
Jiang, Yuncheng ;
Zhang, Xiaopei ;
Tang, Yong ;
Nie, Ruihua .
INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (03) :215-234