Attention-Based Joint Learning for Intent Detection and Slot Filling Using Bidirectional Long Short-Term Memory and Convolutional Neural Networks (AJLISBC)

被引:0
|
作者
Muhammad, Yusuf Idris [1 ]
Salim, Naomie [1 ]
Huspi, Sharin Hazlin [1 ]
Zainal, Anazida [1 ]
机构
[1] Univ Teknol Malaysia, Fac Comp, Skudai 81310, Malaysia
关键词
-Joint learning; intent detection; slot filling; multichannel;
D O I
10.14569/IJACSA.2024.0150890
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Effective natural language understanding is crucial for dialogue systems, requiring precise intent detection and slot filling to facilitate interactions. Traditionally, these subtasks have been addressed separately, but their interconnection suggests that joint solutions yield better results. Recent neural network-based approaches have shown significant performance in joint intent detection and slot filling tasks. The two primary neural network structures used are recurrent neural networks (RNNs) and convolutional neural networks (CNNs). RNNs capture long-term dependencies and store previous information semantics in a fixed- size vector, but their ability to extract global semantics is limited. CNNs can capture n-gram features using convolutional filters, but their performance is constrained by filter width. To leverage the strengths and mitigate the weaknesses of both networks, this paper proposes an attention-based joint learning classification for intent detection and slot filling using BiLSTM and CNNs (AJLISBC). The BiLSTM encodes input sequences in both forward and backward directions, producing high-dimensional representations. It applies scalar and vectorial attention to obtain multichannel representations, with scalar attention calculating word-level importance and vectorial attention assessing feature- level importance. For classification, AJLISBC employs a CNN structure to capture word relations in the representations generated by the attention mechanism, effectively extracting ngram features. Experimental results on the benchmark Airline Travel Information System (ATIS) dataset demonstrate that AJLISBC outperforms state-of-the-art methods.
引用
收藏
页码:915 / 922
页数:8
相关论文
共 50 条
  • [1] Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification
    Zhou, Peng
    Shi, Wei
    Tian, Jun
    Qi, Zhenyu
    Li, Bingchen
    Hao, Hongwei
    Xu, Bo
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 207 - 212
  • [2] Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling
    Liu, Bing
    Lane, Ian
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 685 - 689
  • [3] Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks
    Canlin Zhang
    Daniel Biś
    Xiuwen Liu
    Zhe He
    BMC Bioinformatics, 20
  • [4] Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks
    Zhang, Canlin
    Bis, Daniel
    Liu, Xiuwen
    He, Zhe
    BMC BIOINFORMATICS, 2019, 20 (Suppl 16)
  • [5] Urban Road Traffic Flow Prediction with Attention-Based Convolutional Bidirectional Long Short-Term Memory Networks
    Liu, Zhiquan
    Hu, Yao
    Ding, Xiangying
    TRANSPORTATION RESEARCH RECORD, 2023, 2677 (07) : 449 - 458
  • [6] Attention-based convolutional neural network and long short-term memory for short-term detection of mood disorders based on elicited speech responses
    Huang, Kun-Yi
    Wu, Chung-Hsien
    Su, Ming-Hsiang
    PATTERN RECOGNITION, 2019, 88 : 668 - 678
  • [7] Effective Attention-based Neural Architectures for Sentence Compression with Bidirectional Long Short-Term Memory
    Nhi-Thao Tran
    Viet-Thang Luong
    Ngan Luu-Thuy Nguyen
    Minh-Quoc Nghiem
    PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 123 - 130
  • [8] Speech emotion recognition based on convolutional neural network with attention-based bidirectional long short-term memory network and multi-task learning
    Liu, Zhen-Tao
    Han, Meng-Ting
    Wu, Bao-Han
    Rehman, Abdul
    APPLIED ACOUSTICS, 2023, 202
  • [9] Attention-based bidirectional-long short-term memory for abnormal human activity detection
    Manoj Kumar
    Anoop Kumar Patel
    Mantosh Biswas
    S. Shitharth
    Scientific Reports, 13
  • [10] Modeling citation worthiness by using attention-based bidirectional long short-term memory networks and interpretable models
    Tong Zeng
    Daniel E. Acuna
    Scientometrics, 2020, 124 : 399 - 428