An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引：8

作者：

Marra, Giuseppe ^{[1
,2
]}

Zugarini, Andrea ^{[1
,2
]}

Melacci, Stefano ^{[2
]}

Maggini, Marco ^{[2
]}

机构：

[1] Univ Firenze, DINFO, Florence, Italy

[2] Univ Siena, DIISM, Siena, Italy

来源：

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷

关键词：

Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;

D O I：

10.1007/978-3-030-01424-7_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.

引用

页码：126 / 136

页数：11

共 50 条

[21] Enhancing Sindhi Word Segmentation Using Subword Representation Learning and Position-Aware Self-Attention
Ali, Wazir
Kumar, Jay
Tumani, Saifullah
Nour, Redhwan
Noor, Adeeb
Xu, Zenglin
IEEE ACCESS, 2024, 12 : 183133 - 183142
[22] Unsupervised representation learning with Hebbian synaptic and structural plasticity in brain-like feedforward neural networks
Ravichandran, Naresh
Lansner, Anders
Herman, Pawel
NEUROCOMPUTING, 2025, 626
[23] Unsupervised Learning for Identifying High Eigenvector Centrality Nodes: A Graph Neural Network Approach
Rakaraddi, Appan
Pratama, Mahardhika
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4945 - 4954
[24] A context aware-based deep neural network approach for simultaneous speech denoising and dereverberation
Sidheswar Routray
Qirong Mao
Neural Computing and Applications, 2022, 34 : 9831 - 9845
[25] A context aware-based deep neural network approach for simultaneous speech denoising and dereverberation
Routray, Sidheswar
Mao, Qirong
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (12) : 9831 - 9845
[26] A subject-specific unsupervised deep learning method for quantitative susceptibility mapping using implicit neural representation
Zhang, Ming
Feng, Ruimin
Li, Zhenghao
Feng, Jie
Wu, Qing
Zhang, Zhiyong
Ma, Chengxin
Wu, Jinsong
Yan, Fuhua
Liu, Chunlei
Zhang, Yuyao
Wei, Hongjiang
MEDICAL IMAGE ANALYSIS, 2024, 95
[27] Hyperparameter optimization through context-based meta-reinforcement learning with task-aware representation
Wu, Jia
Liu, Xiyuan
Chen, Senpeng
KNOWLEDGE-BASED SYSTEMS, 2023, 260
[28] Medical nearest-word embedding technique implemented using an unsupervised machine learning approach for Bengali language
Mandal, Kailash Pati
Mukherjee, Prasenjit
Vishnu, Devraj
Chakraborty, Baisakhi
Choudhury, Tanupriya
Arya, Pradeep Kumar
INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2024, 17 (01):
[29] Mobile Deep Learning: Exploring Deep Neural Network for Predicting Context-Aware Smartphone Usage
Sarker I.H.
Abushark Y.B.
Khan A.I.
Alam M.M.
Nowrozy R.
SN Computer Science, 2021, 2 (3)
[30] A novel context-aware recommender system based on a deep sequential learning approach (CReS)
Tipajin Thaipisutikul
Timothy K. Shih
Neural Computing and Applications, 2021, 33 : 11067 - 11090

← 1 2 3 4 5 →