An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引：8

作者：

Marra, Giuseppe ^{[1
,2
]}

Zugarini, Andrea ^{[1
,2
]}

Melacci, Stefano ^{[2
]}

Maggini, Marco ^{[2
]}

机构：

[1] Univ Firenze, DINFO, Florence, Italy

[2] Univ Siena, DIISM, Siena, Italy

来源：

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷

关键词：

Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;

D O I：

10.1007/978-3-030-01424-7_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.

引用

页码：126 / 136

页数：11

共 50 条

[1] Gated Character-aware Convolutional Neural Network for Effective Automated Essay Scoring
Bai, Huanyu
Huang, Zhilin
Hao, Anran
Hiu, Siu Cheung
2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 351 - 359
[2] Unsupervised Learning of Paragraph Embeddings for Context-Aware Recommendation
Xie, Jin
Zhu, Fuxi
Huang, Minxue
Xiong, Naixue
Huang, Sheng
Xiong, Wei
IEEE ACCESS, 2019, 7 : 43100 - 43109
[3] Cluster-aware multiplex InfoMax for unsupervised graph representation learning
Xu, Xin
Du, Junping
Song, Jie
Xue, Zhe
Li, Ang
Guan, Zeli
NEUROCOMPUTING, 2023, 532 : 94 - 105
[4] Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering
Mei, Guofeng
Saltori, Cristiano
Ricci, Elisa
Sebe, Nicu
Wu, Qiang
Zhang, Jian
Poiesi, Fabio
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (08) : 3251 - 3269
[5] A Trust Aware Unsupervised Learning Approach for Insider Threat Detection
Aldairi, Maryam
Karimi, Leila
Joshi, James
2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 89 - 98
[6] Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations
Noguchi, Atsuhiro
Sun, Xiao
Lin, Stephen
Harada, Tatsuya
COMPUTER VISION - ECCV 2022, PT XVII, 2022, 13677 : 597 - 614
[7] Recommendations with context aware framework using particle swarm optimization and unsupervised learning
Jain, Parul
Dixit, Veer Sain
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4479 - 4490
[8] Unsupervised Point Cloud Representation Learning With Deep Neural Networks: A Survey
Xiao, Aoran
Huang, Jiaxing
Guan, Dayan
Zhang, Xiaoqin
Lu, Shijian
Shao, Ling
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 11321 - 11339
[9] Variational approach to unsupervised learning algorithms of neural networks
Likhovidov, V
NEURAL NETWORKS, 1997, 10 (02) : 273 - 289
[10] Graph Representation Learning for Context-Aware Network Intrusion Detection
Premkumar, Augustine
Schneider, Madeleine
Spivey, Carlton
Pavlik, John A.
Bastian, Nathaniel D.
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538

← 1 2 3 4 5 →