An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引:8
|
作者
Marra, Giuseppe [1 ,2 ]
Zugarini, Andrea [1 ,2 ]
Melacci, Stefano [2 ]
Maggini, Marco [2 ]
机构
[1] Univ Firenze, DINFO, Florence, Italy
[2] Univ Siena, DIISM, Siena, Italy
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷
关键词
Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;
D O I
10.1007/978-3-030-01424-7_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.
引用
收藏
页码:126 / 136
页数:11
相关论文
共 50 条
  • [21] Enhancing Sindhi Word Segmentation Using Subword Representation Learning and Position-Aware Self-Attention
    Ali, Wazir
    Kumar, Jay
    Tumani, Saifullah
    Nour, Redhwan
    Noor, Adeeb
    Xu, Zenglin
    IEEE ACCESS, 2024, 12 : 183133 - 183142
  • [22] Unsupervised representation learning with Hebbian synaptic and structural plasticity in brain-like feedforward neural networks
    Ravichandran, Naresh
    Lansner, Anders
    Herman, Pawel
    NEUROCOMPUTING, 2025, 626
  • [23] Unsupervised Learning for Identifying High Eigenvector Centrality Nodes: A Graph Neural Network Approach
    Rakaraddi, Appan
    Pratama, Mahardhika
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4945 - 4954
  • [24] A context aware-based deep neural network approach for simultaneous speech denoising and dereverberation
    Sidheswar Routray
    Qirong Mao
    Neural Computing and Applications, 2022, 34 : 9831 - 9845
  • [25] A context aware-based deep neural network approach for simultaneous speech denoising and dereverberation
    Routray, Sidheswar
    Mao, Qirong
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (12) : 9831 - 9845
  • [26] A subject-specific unsupervised deep learning method for quantitative susceptibility mapping using implicit neural representation
    Zhang, Ming
    Feng, Ruimin
    Li, Zhenghao
    Feng, Jie
    Wu, Qing
    Zhang, Zhiyong
    Ma, Chengxin
    Wu, Jinsong
    Yan, Fuhua
    Liu, Chunlei
    Zhang, Yuyao
    Wei, Hongjiang
    MEDICAL IMAGE ANALYSIS, 2024, 95
  • [27] Hyperparameter optimization through context-based meta-reinforcement learning with task-aware representation
    Wu, Jia
    Liu, Xiyuan
    Chen, Senpeng
    KNOWLEDGE-BASED SYSTEMS, 2023, 260
  • [28] Medical nearest-word embedding technique implemented using an unsupervised machine learning approach for Bengali language
    Mandal, Kailash Pati
    Mukherjee, Prasenjit
    Vishnu, Devraj
    Chakraborty, Baisakhi
    Choudhury, Tanupriya
    Arya, Pradeep Kumar
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2024, 17 (01):
  • [29] Mobile Deep Learning: Exploring Deep Neural Network for Predicting Context-Aware Smartphone Usage
    Sarker I.H.
    Abushark Y.B.
    Khan A.I.
    Alam M.M.
    Nowrozy R.
    SN Computer Science, 2021, 2 (3)
  • [30] A novel context-aware recommender system based on a deep sequential learning approach (CReS)
    Tipajin Thaipisutikul
    Timothy K. Shih
    Neural Computing and Applications, 2021, 33 : 11067 - 11090