Learning Word Representations from Scarce and Noisy Data with Embedding Sub-spaces

被引:0
|
作者
Astudillo, Ramon F. [1 ]
Amir, Silvio [1 ]
Lin, Wang [1 ]
Silva, Mario [1 ]
Trancoso, Isabel [1 ]
机构
[1] Inst Engn Sistemas & Comp Invest & Desenvolviment, Rua Alves Redol 9, Lisbon, Portugal
来源
PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 | 2015年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate a technique to adapt unsupervised word embeddings to specific applications, when only small and noisy labeled datasets are available. Current methods use pre-trained embeddings to initialize model parameters, and then use the labeled data to tailor them for the intended task. However, this approach is prone to overfitting when the training is performed with scarce and noisy data. To overcome this issue, we use the supervised data to find an embedding subspace that fits the task complexity. All the word representations are adapted through a projection into this task-specific subspace, even if they do not occur on the labeled dataset. This approach was recently used in the SemEval 2015 Twitter sentiment analysis challenge, attaining state-of-the-art results. Here we show results improving those of the challenge, as well as additional experiments in a Twitter Part-Of-Speech tagging task.
引用
收藏
页码:1074 / 1084
页数:11
相关论文
共 50 条
  • [31] Learning to Learn from Noisy Labeled Data
    Li, Junnan
    Wong, Yongkang
    Zhao, Qi
    Kankanhalli, Mohan S.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5046 - 5054
  • [32] Learning from Noisy Similar and Dissimilar Data
    Dan, Soham
    Bao, Han
    Sugiyama, Masashi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 233 - 249
  • [33] Learning Tree Structures from Noisy Data
    Nikolakakis, Konstantinos E.
    Kalogerias, Dionysios S.
    Sarwate, Anand D.
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [34] A line-search optimization method for non-Gaussian data assimilation via random quasi-orthogonal sub-spaces
    Nino-Ruiz, Elias D.
    JOURNAL OF COMPUTATIONAL SCIENCE, 2021, 53
  • [35] Learning rule representations from data
    Apolloni, Bruno
    Brega, Andrea
    Malchiodi, Dario
    Palmas, Giorgio
    Zanaboni, Anna Maria
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2006, 36 (05): : 1010 - 1028
  • [36] NMF-BASED KEYWORD LEARNING FROM SCARCE DATA
    Ons, Bart
    Gemmeke, Jort F.
    Van Hamme, Hugo
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 392 - 397
  • [37] Batch IS NOT Heavy: Learning Word Representations From All Samples
    Xin, Xin
    Yuan, Fajie
    He, Xiangnan
    Jose, Joemon M.
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1853 - 1862
  • [38] Learning from Noisy Pairwise Similarity and Unlabeled Data
    Wu, Songhua
    Liu, Tongliang
    Han, Bo
    Yu, Jun
    Niu, Gang
    Sugiyama, Masashi
    Journal of Machine Learning Research, 2022, 23
  • [39] A TOPOLOGICAL VIEW OF UNSUPERVISED LEARNING FROM NOISY DATA
    Niyogi, P.
    Smale, S.
    Weinberger, S.
    SIAM JOURNAL ON COMPUTING, 2011, 40 (03) : 646 - 663
  • [40] LEARNING FROM NOISY DATA - AN EXACTLY SOLVABLE MODEL
    BIEHL, M
    RIEGLER, P
    STECHERT, M
    PHYSICAL REVIEW E, 1995, 52 (05) : R4624 - R4627