An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引:8
|
作者
Marra, Giuseppe [1 ,2 ]
Zugarini, Andrea [1 ,2 ]
Melacci, Stefano [2 ]
Maggini, Marco [2 ]
机构
[1] Univ Firenze, DINFO, Florence, Italy
[2] Univ Siena, DIISM, Siena, Italy
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷
关键词
Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;
D O I
10.1007/978-3-030-01424-7_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.
引用
收藏
页码:126 / 136
页数:11
相关论文
共 50 条
  • [41] Context-Aware Transfer Learning Approach to Detect Informative Social Media Content for Disaster Management
    Saleem, Saima
    Mehrotra, Monica
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 680 - 689
  • [42] An Optimal Approach to Enhance Context Aware Description Administration Service for Cloud Robots in a Deep Learning Environment
    Subha, R.
    Haldorai, Anandakumar
    Ramu, Arulmurugan
    WIRELESS PERSONAL COMMUNICATIONS, 2021, 117 (04) : 3343 - 3358
  • [43] An Optimal Approach to Enhance Context Aware Description Administration Service for Cloud Robots in a Deep Learning Environment
    R. Subha
    Anandakumar Haldorai
    Arulmurugan Ramu
    Wireless Personal Communications, 2021, 117 : 3343 - 3358
  • [44] An Unsupervised Deep Neural Network Approach Based on Ensemble Learning to Suppress Seismic Surface-Related Multiples
    Wang, Kunxi
    Hu, Tianyue
    Zhao, Bangliu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [45] A novel spatial-aware deep learning approach for exploring the environmental context of terrorist attacks and armed conflicts
    Zhao, Zhan'ao
    Liu, Kai
    Wang, Ming
    INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION, 2024, 114
  • [46] CWI: A multimodal deep learning approach for named entity recognition from social media using character, word and image features
    Asgari-Chenaghlu, Meysam
    Feizi-Derakhshi, M. Reza
    Farzinvash, Leili
    Balafar, M. A.
    Motamed, Cina
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (03) : 1905 - 1922
  • [47] CWI: A multimodal deep learning approach for named entity recognition from social media using character, word and image features
    Meysam Asgari-Chenaghlu
    M. Reza Feizi-Derakhshi
    Leili Farzinvash
    M. A. Balafar
    Cina Motamed
    Neural Computing and Applications, 2022, 34 : 1905 - 1922
  • [48] Improving Bug Detection via Context-Based Code Representation Learning and Attention-Based Neural Networks
    Li, Yi
    Wang, Shaohua
    Nguyen, Tien N.
    Son Van Nguyen
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2019, 3 (OOPSLA):
  • [49] A robust hybrid approach with product context-aware learning and explainable AI for sentiment analysis in Amazon user reviews
    Hashmi, Ehtesham
    Yayilgan, Sule Yildirim
    ELECTRONIC COMMERCE RESEARCH, 2024,
  • [50] Towards a Deep Learning-Driven Service Discovery Framework for the Social Internet of Things: a Context-Aware Approach
    Aljubairy, Abdulwahab
    Alhazmi, Ahoud
    Zhang, Wei Emma
    Sheng, Quan Z.
    Tran, Dai Hoang
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2021, PT II, 2021, 13081 : 480 - 488