Enhancing Word Embeddings for Improved Semantic Alignment

被引:0
|
作者
Szymanski, Julian [1 ]
Operlejn, Maksymilian [1 ]
Weichbroth, Pawel [2 ]
机构
[1] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Dept Comp Syst Architecture, PL-80233 Gdansk, Poland
[2] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Dept Software Engn, PL-80233 Gdansk, Poland
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 24期
关键词
natural language processing; semantic ambiguity; word vector representation; Word2vec; polysemous word embedding; word sense disambiguation;
D O I
10.3390/app142411519
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This study introduces a method for the improvement of word vectors, addressing the limitations of traditional approaches like Word2Vec or GloVe through introducing into embeddings richer semantic properties. Our approach leverages supervised learning methods, with shifts in vectors in the representation space enhancing the quality of word embeddings. This ensures better alignment with semantic reference resources, such as WordNet. The effectiveness of the method has been demonstrated through the application of modified embeddings to text classification and clustering. We also show how our method influences document class distributions, visualized through PCA projections. By comparing our results with state-of-the-art approaches and achieving better accuracy, we confirm the effectiveness of the proposed method. The results underscore the potential of adaptive embeddings to improve both the accuracy and efficiency of semantic analysis across a range of NLP.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Semantic Visualization for Short Texts with Word Embeddings
    Le, Tuan M., V
    Lauw, Hady W.
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2074 - 2080
  • [22] Semantic Author Name Disambiguation with Word Embeddings
    Mueller, Mark-Christoph
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES (TPDL 2017), 2017, 10450 : 300 - 311
  • [23] Improving semantic similarity retrieval with word embeddings
    Yan, Fengqi
    Fan, Qiaoqing
    Lu, Mingming
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (23):
  • [24] Semantic Change Detection With Gaussian Word Embeddings
    Yuksel, Arda
    Ugurlu, Berke
    Koc, Aykut
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3349 - 3361
  • [25] Enhancing Backchannel Prediction Using Word Embeddings
    Ruede, Robin
    Mueller, Markus
    Stueker, Sebastian
    Waibel, Alex
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 879 - 883
  • [26] Improving semantic change analysis by combining word embeddings and word frequencies
    Englhardt, Adrian
    Willkomm, Jens
    Schaeler, Martin
    Boehm, Klemens
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2020, 21 (03) : 247 - 264
  • [27] Improving semantic change analysis by combining word embeddings and word frequencies
    Adrian Englhardt
    Jens Willkomm
    Martin Schäler
    Klemens Böhm
    International Journal on Digital Libraries, 2020, 21 : 247 - 264
  • [28] WETA: Automatic taxonomy alignment via word embeddings
    Giabelli, Anna
    Malandri, Lorenzo
    Mercorio, Fabio
    Mezzanzanica, Mario
    COMPUTERS IN INDUSTRY, 2022, 138
  • [29] Visual Exploration of Semantic Relationships in Neural Word Embeddings
    Liu, Shusen
    Bremer, Peer-Timo
    Thiagarajan, Jayaraman J.
    Srikumar, Vivek
    Wang, Bei
    Livnat, Yarden
    Pascucci, Valerio
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) : 553 - 562
  • [30] Semantic Equivalence in Birth Stories: Application of Word Embeddings
    Bubenhofer, Noah
    ZEITSCHRIFT FUR GERMANISTISCHE LINGUISTIK, 2020, 48 (03): : 562 - 589