A survey of word embeddings based on deep learning

被引:112
|
作者
Wang, Shirui [1 ]
Zhou, Wenan [1 ]
Jiang, Chao [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Dept Comp Sci, Beijing 100876, Peoples R China
关键词
Word embeddings; Neural networks; Distributed hypothesis; Multi-source data; PERFORMANCE;
D O I
10.1007/s00607-019-00768-7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The representational basis for downstream natural language processing tasks is word embeddings, which capture lexical semantics in numerical form to handle the abstract semantic concept of words. Recently, the word embeddings approaches, represented by deep learning, has attracted extensive attention and widely used in many tasks, such as text classification, knowledge mining, question-answering, smart Internet of Things systems and so on. These neural networks-based models are based on the distributed hypothesis while the semantic association between words can be efficiently calculated in low-dimensional space. However, the expressed semantics of most models are constrained by the context distribution of each word in the corpus while the logic and common knowledge are not better utilized. Therefore, how to use the massive multi-source data to better represent natural language and world knowledge still need to be explored. In this paper, we introduce the recent advances of neural networks-based word embeddings with their technical features, summarizing the key challenges and existing solutions, and further give a future outlook on the research and application.
引用
收藏
页码:717 / 740
页数:24
相关论文
共 50 条
  • [21] A survey of word embeddings for clinical text
    Khattak F.K.
    Jeblee S.
    Pou-Prom C.
    Abdalla M.
    Meaney C.
    Rudzicz F.
    Journal of Biomedical Informatics: X, 2019, 4
  • [22] DeepD2V-Deep Learning and Domain Word Embeddings for DGA based Malware Detection
    Torrealba Aravena, Lucas
    Casas, Pedro
    Bustos-Jimenez, Javier
    Capdehourat, German
    Findrik, Mislav
    2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 164 - 170
  • [23] Using Word Embeddings and Deep Learning for Supervised Topic Detection in Social Networks
    Gutierrez-Batista, Karel
    Campana, Jesus R.
    Vila, Maria-Amparo
    Martin-Bautista, Maria J.
    FLEXIBLE QUERY ANSWERING SYSTEMS, 2019, 11529 : 155 - 165
  • [24] Deep Learning Architecture for Part-of-Speech Tagging with Word and Suffix Embeddings
    Popov, Alexander
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2016, 2016, 9883 : 68 - 77
  • [25] Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints
    Liu, Quan
    Jiang, Hui
    Wei, Si
    Ling, Zhen-Hua
    Hu, Yu
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1501 - 1511
  • [26] Learning Word Embeddings for Aspect-Based Sentiment Analysis
    Duc-Hong Pham
    Anh-Cuong Le
    Thi-Kim-Chung Le
    COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 28 - 40
  • [27] Integrating word embeddings and document topics with deep learning in a video classification framework
    Kastrati, Zenun
    Imran, Ali Shariq
    Kurti, Arianit
    PATTERN RECOGNITION LETTERS, 2019, 128 : 85 - 92
  • [28] Joint Learning of Character and Word Embeddings
    Chen, Xinxiong
    Xu, Lei
    Liu, Zhiyuan
    Sun, Maosong
    Luan, Huanbo
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1236 - 1242
  • [29] Joint Learning of Sense and Word Embeddings
    Alsuhaibani, Mohammed
    Bollegala, Danushka
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 223 - 229
  • [30] Learning Word Embeddings in Parallel by Alignment
    Zubair, Sahil
    Zubair, Mohammad
    2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 566 - 571