mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling

被引:3
|
作者
Heo, Seong-Hwan [1 ]
Lee, WonKee [2 ]
Lee, Jong-Hyeok [1 ,2 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, Grad Sch Artificial Intelligence, Pohang, South Korea
[2] Pohang Univ Sci & Technol POSTECH, Dept Comp Sci & Engn, Pohang, South Korea
来源
INTERSPEECH 2022 | 2022年
关键词
slot filling; zero-shot learning; momentum contrastive learning; task-oriented dialogue;
D O I
10.21437/Interspeech.2022-839
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Zero-shot slot filling has received considerable attention to cope with the problem of limited available data for the target domain. One of the important factors in zero-shot learning is to make the model learn generalized and reliable representations. For this purpose, we present mcBERT, which stands for 'm'omentum 'c'ontrastive learning with BERT, to develop a robust zero-shot slot filling model. mcBERT uses BERT to initialize the two encoders, the query encoder and key encoder, and is trained by applying momentum contrastive learning. Our experimental results on the SNIPS benchmark show that mcBERT substantially outperforms the previous models, recording a new state-of-the-art. Besides, we also show that each component composing mcBERT contributes to the performance improvement.
引用
收藏
页码:1243 / 1247
页数:5
相关论文
共 50 条
  • [1] HierarchicalContrast: A Coarse-to-Fine Contrastive Learning Framework for Cross-Domain Zero-Shot Slot Filling
    Zhang, Junwen
    Zhang, Yin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14483 - 14503
  • [2] Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling
    Wang, Liwen
    Li, Xuefeng
    Liu, Jiachi
    He, Keqing
    Yan, Yuanmeng
    Xu, Weiran
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9474 - 9480
  • [3] Intrinsic Representation Mining for Zero-Shot Slot Filling
    LI, Sixia
    Okada, Shogo
    Dang, Jianwu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (11) : 1947 - 1956
  • [4] Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2371 - 2381
  • [5] Robust Retrieval Augmented Generation for Zero-shot Slot Filling
    Glass, Michael
    Rossiello, Gaetano
    Chowdhury, Md Faisal Mahbub
    Gliozzo, Alfio
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1939 - 1949
  • [6] A Contrastive Method for Continual Generalized Zero-Shot Learning
    Liang, Chen
    Fan, Wentao
    Liu, Xin
    Peng, Shu-Juan
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE. THEORY AND APPLICATIONS, IEA/AIE 2023, PT I, 2023, 13925 : 365 - 376
  • [7] Semantic Contrastive Embedding for Generalized Zero-Shot Learning
    Zongyan Han
    Zhenyong Fu
    Shuo Chen
    Jian Yang
    International Journal of Computer Vision, 2022, 130 : 2606 - 2622
  • [8] Zero-Shot Stance Detection via Contrastive Learning
    Liang, Bin
    Chen, Zixiao
    Gui, Lin
    He, Yulan
    Yang, Min
    Xu, Ruifeng
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 2738 - 2747
  • [9] Transferable Contrastive Network for Generalized Zero-Shot Learning
    Jiang, Huajie
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9764 - 9773
  • [10] Semantic Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2606 - 2622