A Segment Augmentation and Prediction Consistency Framework for Multi-label Unknown Intent Detection

被引:0
作者
Yang, Jiacheng [1 ,2 ]
Chen, Miaoxin [1 ]
Liu, Cao [3 ]
Dai, Boqi [1 ]
Zheng, Hai-Tao [1 ,2 ]
Wang, Hui [2 ]
Xie, Rui [3 ]
Kim, Hong-Gee [4 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Meituan, Beijing, Peoples R China
[4] Seoul Natl Univ, Seoul, South Korea
基金
中国国家自然科学基金;
关键词
Intent detection; Multi-label; Unknown intents; Dialogue system; Natural language understanding; OF-DOMAIN DETECTION;
D O I
10.1145/3680286
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label unknown intent detection is a challenging task where each utterance may contain not only multiple known but also unknown intents. To tackle this challenge, pioneers proposed to predict the intent number of the utterance first, then compare it with the results of known intent matching to decide whether the utterence contains unknown intent(s). Though they have made remarkable progress on this task, their methods still suffer from two important issues: (1) It is inadequate to extract multiple intents using only utterance encoding; (2) Optimizing two sub-tasks (intent number prediction and known intent matching) independently leads to inconsistent predictions. In this article, we propose to incorporate segment augmentation rather than only use utterance encoding to better detect multiple intents. We also design a prediction consistency module to bridge the gap between the two sub-tasks. Empirical results on MultiWOZ2.3 and MixSNIPS datasets show that our method achieves state-of-the-art performance and significantly improves the best baseline.
引用
收藏
页数:18
相关论文
共 67 条
[1]  
Casanueva I, 2020, NLP FOR CONVERSATIONAL AI, P38
[2]  
Chaudhuri A, 2022, ADV NEUR IN
[3]   Segment Augmentation and Prediction Consistency Neural Network for Multi-label Unknown Intent Detection [J].
Chen, Miaoxin ;
Liu, Cao ;
Dai, Boqi ;
Zheng, Hai-Tao ;
Song, Ting ;
Chen, Jiansong ;
Wan, Guanglu ;
Xie, Rui .
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, :3788-3792
[4]   Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning [J].
Cheng, Xuxin ;
Xu, Wanshi ;
Zhu, Zhihong ;
Li, Hongxiang ;
Zou, Yuexian .
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, :326-336
[5]   Learning to Classify Open Intent via Soft Labeling and Manifold Mixup [J].
Cheng, Zifeng ;
Jiang, Zhiwei ;
Yin, Yafeng ;
Wang, Cong ;
Gu, Qing .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 :635-645
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]  
E HH, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P5467
[8]  
Gangadharaiah R, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P564
[9]  
Gangal V, 2020, AAAI CONF ARTIF INTE, V34, P7764
[10]   Recent Advances in Open Set Recognition: A Survey [J].
Geng, Chuanxing ;
Huang, Sheng-Jun ;
Chen, Songcan .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) :3614-3631