A survey of word embeddings based on deep learning

被引:112
|
作者
Wang, Shirui [1 ]
Zhou, Wenan [1 ]
Jiang, Chao [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Dept Comp Sci, Beijing 100876, Peoples R China
关键词
Word embeddings; Neural networks; Distributed hypothesis; Multi-source data; PERFORMANCE;
D O I
10.1007/s00607-019-00768-7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The representational basis for downstream natural language processing tasks is word embeddings, which capture lexical semantics in numerical form to handle the abstract semantic concept of words. Recently, the word embeddings approaches, represented by deep learning, has attracted extensive attention and widely used in many tasks, such as text classification, knowledge mining, question-answering, smart Internet of Things systems and so on. These neural networks-based models are based on the distributed hypothesis while the semantic association between words can be efficiently calculated in low-dimensional space. However, the expressed semantics of most models are constrained by the context distribution of each word in the corpus while the logic and common knowledge are not better utilized. Therefore, how to use the massive multi-source data to better represent natural language and world knowledge still need to be explored. In this paper, we introduce the recent advances of neural networks-based word embeddings with their technical features, summarizing the key challenges and existing solutions, and further give a future outlook on the research and application.
引用
收藏
页码:717 / 740
页数:24
相关论文
共 50 条
  • [1] A survey of word embeddings based on deep learning
    Shirui Wang
    Wenan Zhou
    Chao Jiang
    Computing, 2020, 102 : 717 - 740
  • [2] Arabic Sentiment Analysis Based on Word Embeddings and Deep Learning
    Elhassan, Nasrin
    Varone, Giuseppe
    Ahmed, Rami
    Gogate, Mandar
    Dashtipour, Kia
    Almoamari, Hani
    El-Affendi, Mohammed A.
    Al-Tamimi, Bassam Naji
    Albalwy, Faisal
    Hussain, Amir
    COMPUTERS, 2023, 12 (06)
  • [3] Genre Classification using Word Embeddings and Deep Learning
    Kumar, Akshi
    Rajpal, Arjun
    Rathore, Dushyant
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 2142 - 2146
  • [4] Fall Detection in EHR using Word Embeddings and Deep Learning
    dos Santos, Henrique D. P.
    Silva, Amanda P.
    Maciel, Maria Carolina O.
    Burin, Haline Maria V.
    Urbanetto, Janete S.
    Vieira, Renata
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 265 - 268
  • [5] Lexical Function Identification Using Word Embeddings and Deep Learning
    Hernandez-Miranda, Arturo
    Gelbukh, Alexander
    Kolesnikova, Olga
    ADVANCES IN SOFT COMPUTING, MICAI 2019, 2019, 11835 : 77 - 86
  • [6] Beyond word embeddings: A survey
    Incitti, Francesca
    Urli, Federico
    Snidaro, Lauro
    INFORMATION FUSION, 2023, 89 : 418 - 436
  • [7] Word Embeddings: A Comprehensive Survey
    Pak, Alexandr
    Ziyaden, Atabay
    Saparov, Timur
    Akhmetov, Iskander
    Gelbukh, Alexander
    COMPUTACION Y SISTEMAS, 2024, 28 (04): : 2005 - 2029
  • [8] An optimized hybrid deep learning model based on word embeddings and statistical features for extractive summarization
    Wazery, Yaser M.
    Saleh, Marwa E.
    Ali, Abdelmgeid A.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (07)
  • [9] Deep learning with word embeddings improves biomedical named entity recognition
    Habibi, Maryam
    Weber, Leon
    Neves, Mariana
    Wiegandt, David Luis
    Leser, Ulf
    BIOINFORMATICS, 2017, 33 (14) : I37 - I48
  • [10] Training Word Embeddings for Deep Learning in Biomedical Text Mining Tasks
    Jiang, Zhenchao
    Li, Lishuang
    Huang, Degen
    Jin, Liuke
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 625 - 628