Semantic Similarity Measurement Using Knowledge-Augmented Multiple-prototype Distributed Word Vector

被引:3
|
作者
Lu, Wei [1 ]
Shi, Kailun [2 ]
Cai, Yuanyuan [2 ]
Che, Xiaoping [2 ]
机构
[1] Beijing Jiaotong Univ, Sch Software Engn, Beijing, Peoples R China
[2] Beijing Jiaotong Univ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Distributed Word Vector; Knowledge-augmented; Multiple Prototype; Semantic Similarity;
D O I
10.4018/IJITN.2016040105
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Recent years, textual semantic similarity measurements play an important role in Natural Language Processing. The semantic similarity between concepts or terms can be measured by various resources like corpora, ontologies, taxonomies, etc. With the development of deep learning, distributed vector models are constructed for extracting the latent semantic information from corpora. Most of existing models create a single prototype vector to represent the meaning of a word such as CBOW. However, due to lexical ambiguity, encoding word meaning with a single vector is problematic. In this work, the authors propose a knowledge-augmented multiple-prototype model by using corpora and ontologies. Based on the distributed word vector learned by the CBOW model, the authors append the concept definition and the relational knowledge vector into the target word vector to enrich the semantic information of the word. Finally, the authors perform the experiments on well-known datasets to verify the efficiency of the authors' approach.
引用
收藏
页码:45 / 57
页数:13
相关论文
共 6 条
  • [1] Text Similarity Measurement of Semantic Cognition Based on Word Vector Distance Decentralization With Clustering Analysis
    Zhou, Shenghan
    Xu, Xingxing
    Liu, Yinglai
    Chang, Runfeng
    Xiao, Yiyong
    IEEE ACCESS, 2019, 7 : 107247 - 107258
  • [2] Towards Geological Knowledge Discovery Using Vector-Based Semantic Similarity
    Enkhsaikhan, Majigsuren
    Liu, Wei
    Holden, Eun-Jung
    Duuring, Paul
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2018, 2018, 11323 : 224 - 237
  • [3] Knowledge-enhanced Multi-semantic Fusion for Concept Similarity Measurement in Continuous Vector Space
    Cai, Yuanyuan
    Lu, Wei
    Che, Xiaoping
    Lu, Yuxun
    IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, : 415 - 423
  • [4] Enhancing Short Text Semantic Similarity Measurement Using Pretrained Word Embeddings and Big Data
    Jinarat, Supakpong
    Pruengkarn, Ratchakoon
    2024 5TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS AND PRACTICES, IBDAP, 2024, : 63 - 66
  • [5] Semantically Readable Distributed Representation Learning and Its Expandability Using a Word Semantic Vector Dictionary
    Keshi, Ikuo
    Suzuki, Yu
    Yoshino, Koichiro
    Nakamura, Satoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (04): : 1066 - 1078
  • [6] Evaluating keyphrase extraction algorithms for finding similar news articles using lexical similarity calculation and semantic relatedness measurement by word embedding
    Bin Sarwar, Talha
    Noor, Noorhuzaimi Mohd
    Miah, M. Saef Ullah
    PEERJ COMPUTER SCIENCE, 2022, 8