An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引:8
|
作者
Marra, Giuseppe [1 ,2 ]
Zugarini, Andrea [1 ,2 ]
Melacci, Stefano [2 ]
Maggini, Marco [2 ]
机构
[1] Univ Firenze, DINFO, Florence, Italy
[2] Univ Siena, DIISM, Siena, Italy
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷
关键词
Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;
D O I
10.1007/978-3-030-01424-7_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.
引用
收藏
页码:126 / 136
页数:11
相关论文
共 50 条
  • [31] Sentiment aware word emb e ddings using refinement and senti-contextualized learning approach
    Naderalvojoud, Behzad
    Sezer, Ebru Akcapinar
    NEUROCOMPUTING, 2020, 405 : 149 - 160
  • [32] A Deep Learning Based Approach for Context-Aware Multi-Criteria Recommender Systems
    Vu, Son-Lam
    Le, Quang-Hung
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (01): : 471 - 483
  • [33] A novel context-aware recommender system based on a deep sequential learning approach (CReS)
    Thaipisutikul, Tipajin
    Shih, Timothy K.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (17) : 11067 - 11090
  • [34] Semi-Supervised Locality Preserving Dense Graph Neural Network With ARMA Filters and Context-Aware Learning for Hyperspectral Image Classification
    Ding, Yao
    Zhao, Xiaofeng
    Zhang, Zhili
    Cai, Wei
    Yang, Nengjun
    Zhan, Ying
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [35] Convolutional Neural Network-Based Speckle Tracking for Ultrasound Strain Elastography: An Unsupervised Learning Approach
    Wen, Shuojie
    Peng, Bo
    Wei, Xingyue
    Luo, Jianwen
    Jiang, Jingfeng
    IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2023, 70 (05) : 354 - 367
  • [36] ACNN-TL: attention-based convolutional neural network coupling with transfer learning and contextualized word representation for enhancing the performance of sentiment classification
    Hossein Sadr
    Mojdeh Nazari Soleimandarabi
    The Journal of Supercomputing, 2022, 78 : 10149 - 10175
  • [37] ACNN-TL: attention-based convolutional neural network coupling with transfer learning and contextualized word representation for enhancing the performance of sentiment classification
    Sadr, Hossein
    Nazari Soleimandarabi, Mojdeh
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (07) : 10149 - 10175
  • [38] An Unsupervised Approach to Learning and Early Detection of Spatio-Temporal Patterns Using Spiking Neural Networks
    Rekabdar, Banafsheh
    Nicolescu, Monica
    Kelley, Richard
    Nicolescu, Mircea
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2015, 80 : S83 - S97
  • [39] An Unsupervised Approach to Learning and Early Detection of Spatio-Temporal Patterns Using Spiking Neural Networks
    Banafsheh Rekabdar
    Monica Nicolescu
    Richard Kelley
    Mircea Nicolescu
    Journal of Intelligent & Robotic Systems, 2015, 80 : 83 - 97
  • [40] An Approach for Multi-Context-Aware Multi-Criteria Recommender Systems Based on Deep Learning
    Afzal, Ifra
    Yilmazel, Burcu
    Kaleli, Cihan
    IEEE ACCESS, 2024, 12 : 99936 - 99948