Learning Discriminative Representations and Decision Boundaries for Open Intent Detection

被引:8
|
作者
Zhang, Hanlei [1 ]
Xu, Hua [1 ]
Zhao, Shaojie [1 ,2 ]
Zhou, Qianrui [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China
基金
中国国家自然科学基金;
关键词
Intent detection; open classification; natural language understanding; representation learning; deep neural network; OF-DOMAIN DETECTION; CLASSIFICATION;
D O I
10.1109/TASLP.2023.3265203
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Open intent detection is a significant problem in natural language understanding, which aims to identify the unseen open intent while ensuring known intent identification performance. However, current methods face two major challenges. Firstly, they struggle to learn friendly representations to detect the open intent with prior knowledge of only known intents. Secondly, there is a lack of an effective approach to obtaining specific and compact decision boundaries for known intents. To address these issues, this article presents an original framework called DA-ADB, which successively learns distance-aware intent representations and adaptive decision boundaries for open intent detection. Specifically, we first leverage distance information to enhance the distinguishing capability of the intent representations. Then, we design a novel loss function to obtain appropriate decision boundaries by balancing both empirical and open space risks. Extensive experiments demonstrate the effectiveness of the proposed distance-aware and boundary learning strategies. Compared to state-of-the-art methods, our framework achieves substantial improvements on three benchmark datasets. Furthermore, it yields robust performance with varying proportions of labeled data and known categories.
引用
收藏
页码:1611 / 1623
页数:13
相关论文
共 50 条
  • [41] Neural representations for quality-related kernel learning and fault detection
    Shifu Yan
    Lihua Lv
    Xuefeng Yan
    Soft Computing, 2023, 27 : 13543 - 13551
  • [42] Neural representations for quality-related kernel learning and fault detection
    Yan, Shifu
    Lv, Lihua
    Yan, Xuefeng
    SOFT COMPUTING, 2023, 27 (18) : 13543 - 13551
  • [43] Decision Fusion of Deep Learning and Shallow Learning for Marine Oil Spill Detection
    Yang, Junfang
    Ma, Yi
    Hu, Yabin
    Jiang, Zongchen
    Zhang, Jie
    Wan, Jianhua
    Li, Zhongwei
    REMOTE SENSING, 2022, 14 (03)
  • [44] Model-Agnostic Zero-Shot Intent Detection via Contrastive Transfer Learning
    Maqbool, M. H.
    Fereidouni, Moghis
    Siddique, A. B.
    Foroosh, Hassan
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2024, 18 (01) : 5 - 24
  • [45] Lymph node detection and segmentation in chest CT data using discriminative learning and a spatial prior
    Feulner, Johannes
    Zhou, S. Kevin
    Hammon, Matthias
    Hornegger, Joachim
    Comaniciu, Dorin
    MEDICAL IMAGE ANALYSIS, 2013, 17 (02) : 254 - 270
  • [46] Learning Top-K Subtask Planning Tree Based on Discriminative Representation Pretraining for Decision-making
    Ruan, Jingqing
    Wang, Kaishen
    Zhang, Qingyang
    Xing, Dengpeng
    Xu, Bo
    MACHINE INTELLIGENCE RESEARCH, 2024, 21 (04) : 782 - 800
  • [47] Discriminative Feature Learning for Unsupervised Change Detection in Heterogeneous Images Based on a Coupled Neural Network
    Zhao, Wei
    Wang, Zhirui
    Gong, Maoguo
    Liu, Jia
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (12): : 7066 - 7080
  • [48] A Weighted Discriminative Extreme Learning Machine Design for Lung Cancer Detection by an Electronic Nose System
    Zhao, Leilei
    Qian, Junhui
    Tian, Fengchun
    Liu, Ran
    Liu, Bei
    Zhang, Shuya
    Lu, Mengchen
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [49] Learning Representations from Healthcare Time Series Data for Unsupervised Anomaly Detection
    Pereira, Joao
    Silveira, Margarida
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 84 - 90
  • [50] API2Vec: Learning Representations of API Sequences for Malware Detection
    Cui, Lei
    Cui, Jiancong
    Ji, Yuede
    Hao, Zhiyu
    Li, Lun
    Ding, Zhenquan
    PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 261 - 273