Learning Word Representations from Scarce and Noisy Data with Embedding Sub-spaces

被引：0

作者：

Astudillo, Ramon F. ^{[1
]}

Amir, Silvio ^{[1
]}

Lin, Wang ^{[1
]}

Silva, Mario ^{[1
]}

Trancoso, Isabel ^{[1
]}

机构：

[1] Inst Engn Sistemas & Comp Invest & Desenvolviment, Rua Alves Redol 9, Lisbon, Portugal

来源：

PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 | 2015年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We investigate a technique to adapt unsupervised word embeddings to specific applications, when only small and noisy labeled datasets are available. Current methods use pre-trained embeddings to initialize model parameters, and then use the labeled data to tailor them for the intended task. However, this approach is prone to overfitting when the training is performed with scarce and noisy data. To overcome this issue, we use the supervised data to find an embedding subspace that fits the task complexity. All the word representations are adapted through a projection into this task-specific subspace, even if they do not occur on the labeled dataset. This approach was recently used in the SemEval 2015 Twitter sentiment analysis challenge, attaining state-of-the-art results. Here we show results improving those of the challenge, as well as additional experiments in a Twitter Part-Of-Speech tagging task.

引用

页码：1074 / 1084

页数：11

共 50 条

[31] Learning to Learn from Noisy Labeled Data
Li, Junnan
Wong, Yongkang
Zhao, Qi
Kankanhalli, Mohan S.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5046 - 5054
[32] Learning from Noisy Similar and Dissimilar Data
Dan, Soham
Bao, Han
Sugiyama, Masashi
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 233 - 249
[33] Learning Tree Structures from Noisy Data
Nikolakakis, Konstantinos E.
Kalogerias, Dionysios S.
Sarwate, Anand D.
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
[34] A line-search optimization method for non-Gaussian data assimilation via random quasi-orthogonal sub-spaces
Nino-Ruiz, Elias D.
JOURNAL OF COMPUTATIONAL SCIENCE, 2021, 53
[35] Learning rule representations from data
Apolloni, Bruno
Brega, Andrea
Malchiodi, Dario
Palmas, Giorgio
Zanaboni, Anna Maria
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2006, 36 (05): : 1010 - 1028
[36] NMF-BASED KEYWORD LEARNING FROM SCARCE DATA
Ons, Bart
Gemmeke, Jort F.
Van Hamme, Hugo
2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 392 - 397
[37] Batch IS NOT Heavy: Learning Word Representations From All Samples
Xin, Xin
Yuan, Fajie
He, Xiangnan
Jose, Joemon M.
PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1853 - 1862
[38] Learning from Noisy Pairwise Similarity and Unlabeled Data
Wu, Songhua
Liu, Tongliang
Han, Bo
Yu, Jun
Niu, Gang
Sugiyama, Masashi
Journal of Machine Learning Research, 2022, 23
[39] A TOPOLOGICAL VIEW OF UNSUPERVISED LEARNING FROM NOISY DATA
Niyogi, P.
Smale, S.
Weinberger, S.
SIAM JOURNAL ON COMPUTING, 2011, 40 (03) : 646 - 663
[40] LEARNING FROM NOISY DATA - AN EXACTLY SOLVABLE MODEL
BIEHL, M
RIEGLER, P
STECHERT, M
PHYSICAL REVIEW E, 1995, 52 (05) : R4624 - R4627

← 1 2 3 4 5 →