Subjective Bayes Method for Word Semantic Similarity Measurement

被引:0
|
作者
Wang, Junhua [1 ]
Zuo, Xianglin [1 ]
Zuo, Wanli [1 ]
Peng, Tao [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130023, Peoples R China
关键词
Word Semantic Similarity; Scatter Plot; Piecewise linear interpolation; Subjective Bayes; Word Net; INFORMATION-CONTENT; CONTEXT;
D O I
10.1109/ICDMW.2013.47
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Measuring semantic similarity between words is a classical problem in nature language processing, the result of which can promote many applications such as machine translation, word sense disambiguation, ontology mapping, computational linguistics, etc. This paper combines knowledge-based methods with statistical methods in measuring words similarity, the novel aspect of which is that subjective Bayes method is employed. Firstly, extract evidences based on Word Net; secondly, analyze reasonableness of candidate evidence using scatter plot; thirdly, generate sufficiency measure by statistics and piecewise linear interpolation technique; fourthly, obtain comprehensive posteriori by integrating uncertainty reasoning with conclusion uncertainty synthetic strategy; finally, we quantify word semantic similarity. On data set R&G (65), we conducted experiment through 5-fold cross validation, and the correlation of our experimental results with human judgment is 0.912, with 0.4% improvements over existing best practice, which show that using subjective Bayes method to measure word semantic similarity is reasonable and effective.
引用
收藏
页码:971 / 977
页数:7
相关论文
共 50 条
  • [21] An Improved Semantic Similarity Measure for Word Pairs
    Cai, Songmei
    Lu, Zhao
    2010 INTERNATIONAL CONFERENCE ON E-EDUCATION, E-BUSINESS, E-MANAGEMENT AND E-LEARNING: IC4E 2010, PROCEEDINGS, 2010, : 212 - 216
  • [22] Applying a Naive Bayes Similarity Measure to Word Sense Disambiguation
    Wang, Tong
    Hirst, Graeme
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 531 - 537
  • [23] A novel method based on symbolic regression for interpretable semantic similarity measurement
    Martinez-Gil, Jorge
    Chaves-Gonzalez, Jose M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160 (160)
  • [24] Combining Word Embedding and Semantic Lexicon for Chinese Word Similarity Computation
    Pei, Jiahuan
    Zhang, Cong
    Huang, Degen
    Ma, Jianjun
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 766 - 777
  • [25] Text Similarity Measurement of Semantic Cognition Based on Word Vector Distance Decentralization With Clustering Analysis
    Zhou, Shenghan
    Xu, Xingxing
    Liu, Yinglai
    Chang, Runfeng
    Xiao, Yiyong
    IEEE ACCESS, 2019, 7 : 107247 - 107258
  • [26] Word Clustering based on Word2vec and Semantic Similarity
    Luo Jie
    Wang Qinglin
    Li Yuan
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 517 - 521
  • [27] Enhancing Short Text Semantic Similarity Measurement Using Pretrained Word Embeddings and Big Data
    Jinarat, Supakpong
    Pruengkarn, Ratchakoon
    2024 5TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS AND PRACTICES, IBDAP, 2024, : 63 - 66
  • [28] Word Semantic Similarity Calculation Based on Word2vec
    Jin, Xiaolin
    Zhang, Shuwu
    Liu, Jie
    2018 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2018, : 12 - 16
  • [29] Semantic text similarity using corpus-based word similarity and string similarity
    University of Ottawa
    不详
    ACM Transactions on Knowledge Discovery from Data, 2008, 2 (02)
  • [30] Measurement of word similarity based on Corpus
    Zhang Zhiling
    Yu Liqun
    Luo Haifei
    Shao Xiaomin
    Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 1297 - 1301