Improving Limited Labeled Dialogue State Tracking with Self-Supervision

被引：0

作者：

Wu, Chien-Sheng ^{[1
]}

Hoi, Steven ^{[1
]}

Xiong, Caiming ^{[1
]}

机构：

[1] Salesforce Res, San Francisco, CA 94105 USA

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020 | 2020年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing dialogue state tracking (DST) models require plenty of labeled data. However, collecting high-quality labels is costly, especially when the number of domains increases. In this paper, we address a practical DST problem that is rarely discussed, i.e., learning efficiently with limited labeled data. We present and investigate two self-supervised objectives: preserving latent consistency and modeling conversational behavior. We encourage a DST model to have consistent latent distributions given a perturbed input, making it more robust to an unseen scenario. We also add an auxiliary utterance generation task, modeling a potential correlation between conversational behavior and dialogue states. The experimental results show that our proposed self-supervised signals can improve joint goal accuracy by 8.95% when only 1% labeled data is used on the MultiWOZ dataset. We can achieve an additional 1.76% improvement if some unlabeled data is jointly trained as semi-supervised learning. We analyze and visualize how our proposed self-supervised signals help the DST task and hope to stimulate future data-efficient DST research.

引用

页码：4462 / 4472

页数：11

共 50 条

[31] Improving Open-Set Semi-Supervised Learning with Self-Supervision [J].

Wallini, Erik ;

Svensson, Lennart ;

Kahl, Fredrik ;

Hammarstrand, Lars .

2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, :2345-2354

[32] Robust Dialogue State Tracking with Weak Supervision and Sparse Data [J].

Heck, Michael ;

Lubis, Nurul ;

van Niekerk, Carel ;

Feng, Shutong ;

Geishauser, Christian ;

Lin, Hsien-Chin ;

Gasic, Milica .

TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 :1175-1192

[33] Disentangled Self-Supervision in Sequential Recommenders [J].

Ma, Jianxin ;

Zhou, Chang ;

Yang, Hongxia ;

Cui, Peng ;

Wang, Xin ;

Zhu, Wenwu .

KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :483-491

[34] PITCH ESTIMATION VIA SELF-SUPERVISION [J].

Gfeller, Beat ;

Frank, Christian ;

Roblek, Dominik ;

Sharifi, Matt ;

Tagliasacchi, Marco ;

Velimirovic, Mihajlo .

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, :3527-3531

[35] Hyperspherically regularized networks for self-supervision [J].

Durrant, Aiden ;

Leontidis, Georgios .

IMAGE AND VISION COMPUTING, 2022, 124

[36] TRASS: Time Reversal as Self-Supervision [J].

Nair, Suraj ;

Babaeizadeh, Mohammad ;

Finn, Chelsea ;

Levine, Sergey ;

Kumar, Vikash .

2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, :115-121

[37] Knowledge Distillation Meets Self-supervision [J].

Xu, Guodong ;

Liu, Ziwei ;

Li, Xiaoxiao ;

Loy, Chen Change .

COMPUTER VISION - ECCV 2020, PT IX, 2020, 12354 :588-604

[38] Extending limited datasets with GAN-like self-supervision for SMS spam detection [J].

Anidjar, Or Haim ;

Marbel, Revital ;

Dubin, Ran ;

Dvir, Amit ;

Hajaj, Chen .

COMPUTERS & SECURITY, 2024, 145

[39] Hyperspherically regularized networks for self-supervision [J].

Durrant, Aiden ;

Leontidis, Georgios .

Image and Vision Computing, 2022, 124

[40] The IRMA dream, self-analysis, and self-supervision [J].

Blum, H .

JOURNAL OF THE AMERICAN PSYCHOANALYTIC ASSOCIATION, 1996, 44 (02) :511-532

← 1 2 3 4 5 →