Enhancing Word Embeddings for Improved Semantic Alignment

被引:0
|
作者
Szymanski, Julian [1 ]
Operlejn, Maksymilian [1 ]
Weichbroth, Pawel [2 ]
机构
[1] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Dept Comp Syst Architecture, PL-80233 Gdansk, Poland
[2] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Dept Software Engn, PL-80233 Gdansk, Poland
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 24期
关键词
natural language processing; semantic ambiguity; word vector representation; Word2vec; polysemous word embedding; word sense disambiguation;
D O I
10.3390/app142411519
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This study introduces a method for the improvement of word vectors, addressing the limitations of traditional approaches like Word2Vec or GloVe through introducing into embeddings richer semantic properties. Our approach leverages supervised learning methods, with shifts in vectors in the representation space enhancing the quality of word embeddings. This ensures better alignment with semantic reference resources, such as WordNet. The effectiveness of the method has been demonstrated through the application of modified embeddings to text classification and clustering. We also show how our method influences document class distributions, visualized through PCA projections. By comparing our results with state-of-the-art approaches and achieving better accuracy, we confirm the effectiveness of the proposed method. The results underscore the potential of adaptive embeddings to improve both the accuracy and efficiency of semantic analysis across a range of NLP.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization
    Hirota, Wataru
    Suhara, Yoshihiko
    Golshan, Behzad
    Tan, Wang-Chiew
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7935 - 7943
  • [42] Word Alignment by Fine-tuning Embeddings on Parallel Corpora
    Dou, Zi-Yi
    Neubig, Graham
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2112 - 2128
  • [43] Learning Diachronic Word Embeddings with Iterative Stable Information Alignment
    Lin, Zefeng
    Wan, Xiaojun
    Guo, Zongming
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 749 - 760
  • [44] Comparing Pretrained Multilingual Word Embeddings on an Ontology Alignment Task
    Gromann, Dagmar
    Declerck, Thierry
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 230 - 236
  • [45] Learning Chinese word embeddings from semantic and phonetic components
    Wang, Fu Lee
    Lu, Yuyin
    Cheng, Gary
    Xie, Haoran
    Rao, Yanghui
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42805 - 42820
  • [46] Learning Chinese word embeddings from semantic and phonetic components
    Fu Lee Wang
    Yuyin Lu
    Gary Cheng
    Haoran Xie
    Yanghui Rao
    Multimedia Tools and Applications, 2022, 81 : 42805 - 42820
  • [47] Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints
    Liu, Quan
    Jiang, Hui
    Wei, Si
    Ling, Zhen-Hua
    Hu, Yu
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1501 - 1511
  • [48] En-Ar Bilingual word Embeddings without Word Alignment: Factors Effects
    Alqaisi, Taghreed
    O'Keefe, Simon
    FOURTH ARABIC NATURAL LANGUAGE PROCESSING WORKSHOP (WANLP 2019), 2019, : 97 - 107
  • [49] THE JOINT EFFECT OF SEMANTIC AND SYNTACTIC WORD EMBEDDINGS ON SENTIMENT ANALYSIS
    Chen, Shu
    Chen, Guang
    Wang, Wei
    PROCEEDINGS OF 2016 5TH IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2016), 2016, : 366 - 370
  • [50] Leveraging Multilingual Transfer for Unsupervised Semantic Acoustic Word Embeddings
    Jacobs, Christiaan
    Kamper, Herman
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 311 - 315