A survey of word embeddings based on deep learning

被引:112
|
作者
Wang, Shirui [1 ]
Zhou, Wenan [1 ]
Jiang, Chao [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Dept Comp Sci, Beijing 100876, Peoples R China
关键词
Word embeddings; Neural networks; Distributed hypothesis; Multi-source data; PERFORMANCE;
D O I
10.1007/s00607-019-00768-7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The representational basis for downstream natural language processing tasks is word embeddings, which capture lexical semantics in numerical form to handle the abstract semantic concept of words. Recently, the word embeddings approaches, represented by deep learning, has attracted extensive attention and widely used in many tasks, such as text classification, knowledge mining, question-answering, smart Internet of Things systems and so on. These neural networks-based models are based on the distributed hypothesis while the semantic association between words can be efficiently calculated in low-dimensional space. However, the expressed semantics of most models are constrained by the context distribution of each word in the corpus while the logic and common knowledge are not better utilized. Therefore, how to use the massive multi-source data to better represent natural language and world knowledge still need to be explored. In this paper, we introduce the recent advances of neural networks-based word embeddings with their technical features, summarizing the key challenges and existing solutions, and further give a future outlook on the research and application.
引用
收藏
页码:717 / 740
页数:24
相关论文
共 50 条
  • [41] A distant supervision method based on paradigmatic relations for learning word embeddings
    Jianquan Li
    Renfen Hu
    Xiaokang Liu
    Prayag Tiwari
    Hari Mohan Pandey
    Wei Chen
    Benyou Wang
    Yaohong Jin
    Kaicheng Yang
    Neural Computing and Applications, 2020, 32 : 7759 - 7768
  • [42] A distant supervision method based on paradigmatic relations for learning word embeddings
    Li, Jianquan
    Hu, Renfen
    Liu, Xiaokang
    Tiwari, Prayag
    Pandey, Hari Mohan
    Chen, Wei
    Wang, Benyou
    Jin, Yaohong
    Yang, Kaicheng
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12): : 7759 - 7768
  • [43] Ontology Learning Based on Word Embeddings for Text Big Data Extraction
    Mahmoud, Nesma
    Elbeh, Heba
    Abdlkader, Hatem M.
    2018 14TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2018, : 183 - 188
  • [44] Detecting Malicious URLs Based on Machine Learning Algorithms and Word Embeddings
    Crisan, Andrei
    Florea, Gabriel
    Halasz, Lorand
    Lemnaru, Camelia
    Oprisa, Ciprian
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2020), 2020, : 187 - 193
  • [45] A Word Embeddings Based Clustering Approach for Collaborative Learning Group Formation
    Wu, Yongchao
    Nouri, Jalal
    Li, Xiu
    Weegar, Rebecka
    Afzaal, Muhammad
    ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2021), PT II, 2021, 12749 : 395 - 400
  • [46] Reproducibility dataset for a large experimental survey on word embeddings and ontology-based methods for word similarity
    Lastra-Diaz, Juan J.
    Goikoetxea, Josu
    Taieb, Mohamed Ali Hadj
    Garcia-Serrano, Ana
    Ben Aouicha, Mohamed
    Agirre, Eneko
    DATA IN BRIEF, 2019, 26
  • [47] Learning Word Sense Embeddings from Word Sense Definitions
    Li, Qi
    Li, Tianshi
    Chang, Baobao
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 224 - 235
  • [48] Improved Learning of Word Embeddings with Word Definitions and Semantic Injection
    Zhang, Yichi
    Dai, Yinpei
    Ou, Zhijian
    Wang, Huixin
    Feng, Junlan
    INTERSPEECH 2020, 2020, : 4253 - 4257
  • [49] A deep learning-based bilingual Hindi and Punjabi named entity recognition system using enhanced word embeddings
    Goyal, Archana
    Gupta, Vishal
    Kumar, Manish
    KNOWLEDGE-BASED SYSTEMS, 2021, 234
  • [50] Jointly learning bilingual word embeddings and alignments
    Song, Zhenqiao
    Zheng, Xiaoqing
    Huang, Xuanjing
    MACHINE TRANSLATION, 2021, 35 (04) : 551 - 569