Improving Limited Labeled Dialogue State Tracking with Self-Supervision

被引:0
作者
Wu, Chien-Sheng [1 ]
Hoi, Steven [1 ]
Xiong, Caiming [1 ]
机构
[1] Salesforce Res, San Francisco, CA 94105 USA
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020 | 2020年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing dialogue state tracking (DST) models require plenty of labeled data. However, collecting high-quality labels is costly, especially when the number of domains increases. In this paper, we address a practical DST problem that is rarely discussed, i.e., learning efficiently with limited labeled data. We present and investigate two self-supervised objectives: preserving latent consistency and modeling conversational behavior. We encourage a DST model to have consistent latent distributions given a perturbed input, making it more robust to an unseen scenario. We also add an auxiliary utterance generation task, modeling a potential correlation between conversational behavior and dialogue states. The experimental results show that our proposed self-supervised signals can improve joint goal accuracy by 8.95% when only 1% labeled data is used on the MultiWOZ dataset. We can achieve an additional 1.76% improvement if some unlabeled data is jointly trained as semi-supervised learning. We analyze and visualize how our proposed self-supervised signals help the DST task and hope to stimulate future data-efficient DST research.
引用
收藏
页码:4462 / 4472
页数:11
相关论文
共 50 条
[31]   Improving Open-Set Semi-Supervised Learning with Self-Supervision [J].
Wallini, Erik ;
Svensson, Lennart ;
Kahl, Fredrik ;
Hammarstrand, Lars .
2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, :2345-2354
[32]   Robust Dialogue State Tracking with Weak Supervision and Sparse Data [J].
Heck, Michael ;
Lubis, Nurul ;
van Niekerk, Carel ;
Feng, Shutong ;
Geishauser, Christian ;
Lin, Hsien-Chin ;
Gasic, Milica .
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 :1175-1192
[33]   Disentangled Self-Supervision in Sequential Recommenders [J].
Ma, Jianxin ;
Zhou, Chang ;
Yang, Hongxia ;
Cui, Peng ;
Wang, Xin ;
Zhu, Wenwu .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :483-491
[34]   PITCH ESTIMATION VIA SELF-SUPERVISION [J].
Gfeller, Beat ;
Frank, Christian ;
Roblek, Dominik ;
Sharifi, Matt ;
Tagliasacchi, Marco ;
Velimirovic, Mihajlo .
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, :3527-3531
[35]   Hyperspherically regularized networks for self-supervision [J].
Durrant, Aiden ;
Leontidis, Georgios .
IMAGE AND VISION COMPUTING, 2022, 124
[36]   TRASS: Time Reversal as Self-Supervision [J].
Nair, Suraj ;
Babaeizadeh, Mohammad ;
Finn, Chelsea ;
Levine, Sergey ;
Kumar, Vikash .
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, :115-121
[37]   Knowledge Distillation Meets Self-supervision [J].
Xu, Guodong ;
Liu, Ziwei ;
Li, Xiaoxiao ;
Loy, Chen Change .
COMPUTER VISION - ECCV 2020, PT IX, 2020, 12354 :588-604
[38]   Extending limited datasets with GAN-like self-supervision for SMS spam detection [J].
Anidjar, Or Haim ;
Marbel, Revital ;
Dubin, Ran ;
Dvir, Amit ;
Hajaj, Chen .
COMPUTERS & SECURITY, 2024, 145
[39]   Hyperspherically regularized networks for self-supervision [J].
Durrant, Aiden ;
Leontidis, Georgios .
Image and Vision Computing, 2022, 124
[40]   The IRMA dream, self-analysis, and self-supervision [J].
Blum, H .
JOURNAL OF THE AMERICAN PSYCHOANALYTIC ASSOCIATION, 1996, 44 (02) :511-532