Enhancing Word Embeddings for Improved Semantic Alignment

被引:0
|
作者
Szymanski, Julian [1 ]
Operlejn, Maksymilian [1 ]
Weichbroth, Pawel [2 ]
机构
[1] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Dept Comp Syst Architecture, PL-80233 Gdansk, Poland
[2] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Dept Software Engn, PL-80233 Gdansk, Poland
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 24期
关键词
natural language processing; semantic ambiguity; word vector representation; Word2vec; polysemous word embedding; word sense disambiguation;
D O I
10.3390/app142411519
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This study introduces a method for the improvement of word vectors, addressing the limitations of traditional approaches like Word2Vec or GloVe through introducing into embeddings richer semantic properties. Our approach leverages supervised learning methods, with shifts in vectors in the representation space enhancing the quality of word embeddings. This ensures better alignment with semantic reference resources, such as WordNet. The effectiveness of the method has been demonstrated through the application of modified embeddings to text classification and clustering. We also show how our method influences document class distributions, visualized through PCA projections. By comparing our results with state-of-the-art approaches and achieving better accuracy, we confirm the effectiveness of the proposed method. The results underscore the potential of adaptive embeddings to improve both the accuracy and efficiency of semantic analysis across a range of NLP.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Semantic Comparison of Driving Sequences by Adaptation of Word Embeddings
    Ries, Lennart
    Stumpf, Maximilian
    Bach, Johannes
    Sax, Eric
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [32] Short texts semantic similarity based on word embeddings
    Babic, Karlo
    Martincic-Ipsic, Sanda
    Mestrovic, Ana
    Guerra, Francesco
    CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS (CECIIS 2019), 2019, : 27 - 33
  • [33] Exploring Implicit Semantic Constraints for Bilingual Word Embeddings
    Jinsong Su
    Zhenqiao Song
    Yaojie Lu
    Mu Xu
    Changxing Wu
    Yidong Chen
    Neural Processing Letters, 2018, 48 : 1073 - 1088
  • [34] Exploring Implicit Semantic Constraints for Bilingual Word Embeddings
    Su, Jinsong
    Song, Zhenqiao
    Lu, Yaojie
    Xu, Mu
    Wu, Changxing
    Chen, Yidong
    NEURAL PROCESSING LETTERS, 2018, 48 (02) : 1073 - 1088
  • [35] Improved Semantic-Aware Network Embedding with Fine-Grained Word Alignment
    Shen, Dinghan
    Zhang, Xinyuan
    Henao, Ricardo
    Carin, Lawrence
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1829 - 1838
  • [37] Improved biomedical word embeddings in the transformer era
    Noh, Jiho
    Kavuluru, Ramakanth
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 120 (120)
  • [38] Improving accuracy of an existing semantic word labelling tool using word embeddings
    Sanjurjo-Gonzalez, Hugo
    PROCEEDINGS OF 2021 16TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI'2021), 2021,
  • [39] Enhancing First Story Detection using Word Embeddings
    Moran, Sean
    McCreadie, Richard
    Macdonald, Craig
    Ounis, Iadh
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 821 - 824
  • [40] Enhancing biomedical word embeddings by retrofitting to verb clusters
    Chiu, Billy
    Baker, Simon
    Palmer, Martha
    Korhonen, Anna
    SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2019), 2019, : 125 - 134