A survey of word embeddings based on deep learning

被引:112
|
作者
Wang, Shirui [1 ]
Zhou, Wenan [1 ]
Jiang, Chao [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Dept Comp Sci, Beijing 100876, Peoples R China
关键词
Word embeddings; Neural networks; Distributed hypothesis; Multi-source data; PERFORMANCE;
D O I
10.1007/s00607-019-00768-7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The representational basis for downstream natural language processing tasks is word embeddings, which capture lexical semantics in numerical form to handle the abstract semantic concept of words. Recently, the word embeddings approaches, represented by deep learning, has attracted extensive attention and widely used in many tasks, such as text classification, knowledge mining, question-answering, smart Internet of Things systems and so on. These neural networks-based models are based on the distributed hypothesis while the semantic association between words can be efficiently calculated in low-dimensional space. However, the expressed semantics of most models are constrained by the context distribution of each word in the corpus while the logic and common knowledge are not better utilized. Therefore, how to use the massive multi-source data to better represent natural language and world knowledge still need to be explored. In this paper, we introduce the recent advances of neural networks-based word embeddings with their technical features, summarizing the key challenges and existing solutions, and further give a future outlook on the research and application.
引用
收藏
页码:717 / 740
页数:24
相关论文
共 50 条
  • [31] Learning Word Meta-Embeddings
    Yin, Wenpeng
    Schuetze, Hinrich
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1351 - 1360
  • [32] Classification of Arabic healthcare questions based on word embeddings learned from massive consultations: a deep learning approach
    Faris, Hossam
    Habib, Maria
    Faris, Mohammad
    Alomari, Alaa
    Castillo, Pedro A.
    Alomari, Manal
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (04) : 1811 - 1827
  • [33] Sentiment classification and aspect-based sentiment analysis on yelp reviews using deep learning and word embeddings
    Alamoudi, Eman Saeed
    Alghamdi, Norah Saleh
    JOURNAL OF DECISION SYSTEMS, 2021, 30 (2-3) : 259 - 281
  • [34] Classification of Arabic healthcare questions based on word embeddings learned from massive consultations: a deep learning approach
    Hossam Faris
    Maria Habib
    Mohammad Faris
    Alaa Alomari
    Pedro A. Castillo
    Manal Alomari
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 1811 - 1827
  • [35] DEEP WORD EMBEDDINGS FOR VISUAL SPEECH RECOGNITION
    Stafylakis, Themos
    Tzimiropoulos, Georgios
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4974 - 4978
  • [36] Adjusting Word Embeddings by Deep Neural Networks
    Gao, Xiaoyang
    Ichise, Ryutaro
    ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 398 - 406
  • [37] Deep learning in law: early adaptation and legal word embeddings trained on large corpora
    Ilias Chalkidis
    Dimitrios Kampas
    Artificial Intelligence and Law, 2019, 27 : 171 - 198
  • [38] Deep learning in law: early adaptation and legal word embeddings trained on large corpora
    Chalkidis, Ilias
    Kampas, Dimitrios
    ARTIFICIAL INTELLIGENCE AND LAW, 2019, 27 (02) : 171 - 198
  • [39] Enhanced classification of crisis related tweets using deep learning models and word embeddings
    Ramachandran D.
    Parvathi R.
    Ramachandran, Dharini (dharini.r2014@vit.ac.in), 1600, Inderscience Publishers (16): : 158 - 186
  • [40] Detecting Misinformation in COVID-19 Content: A Machine Learning and Deep Learning Approach with Word Embeddings
    Arati Chabukswar
    P. Deepa Shenoy
    K. R. Venugopal
    SN Computer Science, 6 (1)