An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引:8
|
作者
Marra, Giuseppe [1 ,2 ]
Zugarini, Andrea [1 ,2 ]
Melacci, Stefano [2 ]
Maggini, Marco [2 ]
机构
[1] Univ Firenze, DINFO, Florence, Italy
[2] Univ Siena, DIISM, Siena, Italy
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷
关键词
Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;
D O I
10.1007/978-3-030-01424-7_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.
引用
收藏
页码:126 / 136
页数:11
相关论文
共 50 条
  • [1] Gated Character-aware Convolutional Neural Network for Effective Automated Essay Scoring
    Bai, Huanyu
    Huang, Zhilin
    Hao, Anran
    Hiu, Siu Cheung
    2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 351 - 359
  • [2] Unsupervised Learning of Paragraph Embeddings for Context-Aware Recommendation
    Xie, Jin
    Zhu, Fuxi
    Huang, Minxue
    Xiong, Naixue
    Huang, Sheng
    Xiong, Wei
    IEEE ACCESS, 2019, 7 : 43100 - 43109
  • [3] Cluster-aware multiplex InfoMax for unsupervised graph representation learning
    Xu, Xin
    Du, Junping
    Song, Jie
    Xue, Zhe
    Li, Ang
    Guan, Zeli
    NEUROCOMPUTING, 2023, 532 : 94 - 105
  • [4] Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering
    Mei, Guofeng
    Saltori, Cristiano
    Ricci, Elisa
    Sebe, Nicu
    Wu, Qiang
    Zhang, Jian
    Poiesi, Fabio
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (08) : 3251 - 3269
  • [5] A Trust Aware Unsupervised Learning Approach for Insider Threat Detection
    Aldairi, Maryam
    Karimi, Leila
    Joshi, James
    2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 89 - 98
  • [6] Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations
    Noguchi, Atsuhiro
    Sun, Xiao
    Lin, Stephen
    Harada, Tatsuya
    COMPUTER VISION - ECCV 2022, PT XVII, 2022, 13677 : 597 - 614
  • [7] Recommendations with context aware framework using particle swarm optimization and unsupervised learning
    Jain, Parul
    Dixit, Veer Sain
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4479 - 4490
  • [8] Unsupervised Point Cloud Representation Learning With Deep Neural Networks: A Survey
    Xiao, Aoran
    Huang, Jiaxing
    Guan, Dayan
    Zhang, Xiaoqin
    Lu, Shijian
    Shao, Ling
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 11321 - 11339
  • [9] Variational approach to unsupervised learning algorithms of neural networks
    Likhovidov, V
    NEURAL NETWORKS, 1997, 10 (02) : 273 - 289
  • [10] Graph Representation Learning for Context-Aware Network Intrusion Detection
    Premkumar, Augustine
    Schneider, Madeleine
    Spivey, Carlton
    Pavlik, John A.
    Bastian, Nathaniel D.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538