Attention-Based Joint Learning for Intent Detection and Slot Filling Using Bidirectional Long Short-Term Memory and Convolutional Neural Networks (AJLISBC)

被引:0
|
作者
Muhammad, Yusuf Idris [1 ]
Salim, Naomie [1 ]
Huspi, Sharin Hazlin [1 ]
Zainal, Anazida [1 ]
机构
[1] Univ Teknol Malaysia, Fac Comp, Skudai 81310, Malaysia
关键词
Joint learning; intent detection; slot filling; multichannel; LSTM; CNN;
D O I
10.14569/IJACSA.2024.0150890
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Effective natural language understanding is crucial for dialogue systems, requiring precise intent detection and slot filling to facilitate interactions. Traditionally, these subtasks have been addressed separately, but their interconnection suggests that joint solutions yield better results. Recent neural network-based approaches have shown significant performance in joint intent detection and slot filling tasks. The two primary neural network structures used are recurrent neural networks (RNNs) and convolutional neural networks (CNNs). RNNs capture long-term dependencies and store previous information semantics in a fixed-size vector, but their ability to extract global semantics is limited. CNNs can capture n-gram features using convolutional filters, but their performance is constrained by filter width. To leverage the strengths and mitigate the weaknesses of both networks, this paper proposes an attention-based joint learning classification for intent detection and slot filling using BiLSTM and CNNs (AJLISBC). The BiLSTM encodes input sequences in both forward and backward directions, producing high-dimensional representations. It applies scalar and vectorial attention to obtain multichannel representations, with scalar attention calculating word-level importance and vectorial attention assessing feature-level importance. For classification, AJLISBC employs a CNN structure to capture word relations in the representations generated by the attention mechanism, effectively extracting n-gram features. Experimental results on the benchmark Airline Travel Information System (ATIS) dataset demonstrate that AJLISBC outperforms state-of-the-art methods.
引用
收藏
页码:915 / 922
页数:8
相关论文
共 50 条
  • [1] Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling
    Liu, Bing
    Lane, Ian
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 685 - 689
  • [2] Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks
    Canlin Zhang
    Daniel Biś
    Xiuwen Liu
    Zhe He
    BMC Bioinformatics, 20
  • [3] Speech emotion recognition based on convolutional neural network with attention-based bidirectional long short-term memory network and multi-task learning
    Liu, Zhen-Tao
    Han, Meng-Ting
    Wu, Bao-Han
    Rehman, Abdul
    APPLIED ACOUSTICS, 2023, 202
  • [4] Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks
    Zhang, Canlin
    Bis, Daniel
    Liu, Xiuwen
    He, Zhe
    BMC BIOINFORMATICS, 2019, 20 (Suppl 16)
  • [5] Effective Attention-based Neural Architectures for Sentence Compression with Bidirectional Long Short-Term Memory
    Nhi-Thao Tran
    Viet-Thang Luong
    Ngan Luu-Thuy Nguyen
    Minh-Quoc Nghiem
    PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 123 - 130
  • [6] Attention-based bidirectional-long short-term memory for abnormal human activity detection
    Kumar, Manoj
    Patel, Anoop Kumar
    Biswas, Mantosh
    Shitharth, S.
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [7] CONVOLUTIONAL NEURAL NETWORK BASED TRIANGULAR CRF FOR JOINT INTENT DETECTION AND SLOT FILLING
    Xu, Puyang
    Sarikaya, Ruhi
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 78 - 83
  • [8] Short-Term Traffic Congestion Forecasting Using Attention-Based Long Short-Term Memory Recurrent Neural Network
    Zhang, Tianlin
    Liu, Ying
    Cui, Zhenyu
    Leng, Jiaxu
    Xie, Weihong
    Zhang, Liang
    COMPUTATIONAL SCIENCE - ICCS 2019, PT III, 2019, 11538 : 304 - 314
  • [9] Hybrid Deep Learning Network Intrusion Detection System Based on Convolutional Neural Network and Bidirectional Long Short-Term Memory
    Jihado, Anindra Ageng
    Girsang, Abba Suganda
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (02) : 219 - 232
  • [10] Attention-based convolutional long short-term memory neural network for detection of patient-ventilator asynchrony from mechanical ventilation
    Chen, Dingfu
    Lin, Kangwei
    Deng, Ziheng
    Li, Dayu
    Deng, Qingxu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 78