Attention-Based Joint Learning for Intent Detection and Slot Filling Using Bidirectional Long Short-Term Memory and Convolutional Neural Networks (AJLISBC)

被引:0
|
作者
Muhammad, Yusuf Idris [1 ]
Salim, Naomie [1 ]
Huspi, Sharin Hazlin [1 ]
Zainal, Anazida [1 ]
机构
[1] Univ Teknol Malaysia, Fac Comp, Skudai 81310, Malaysia
关键词
-Joint learning; intent detection; slot filling; multichannel;
D O I
10.14569/IJACSA.2024.0150890
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Effective natural language understanding is crucial for dialogue systems, requiring precise intent detection and slot filling to facilitate interactions. Traditionally, these subtasks have been addressed separately, but their interconnection suggests that joint solutions yield better results. Recent neural network-based approaches have shown significant performance in joint intent detection and slot filling tasks. The two primary neural network structures used are recurrent neural networks (RNNs) and convolutional neural networks (CNNs). RNNs capture long-term dependencies and store previous information semantics in a fixed- size vector, but their ability to extract global semantics is limited. CNNs can capture n-gram features using convolutional filters, but their performance is constrained by filter width. To leverage the strengths and mitigate the weaknesses of both networks, this paper proposes an attention-based joint learning classification for intent detection and slot filling using BiLSTM and CNNs (AJLISBC). The BiLSTM encodes input sequences in both forward and backward directions, producing high-dimensional representations. It applies scalar and vectorial attention to obtain multichannel representations, with scalar attention calculating word-level importance and vectorial attention assessing feature- level importance. For classification, AJLISBC employs a CNN structure to capture word relations in the representations generated by the attention mechanism, effectively extracting ngram features. Experimental results on the benchmark Airline Travel Information System (ATIS) dataset demonstrate that AJLISBC outperforms state-of-the-art methods.
引用
收藏
页码:915 / 922
页数:8
相关论文
共 50 条
  • [31] Attention-based bidirectional long short-term memory networks for extracting temporal relationships from clinical discharge summaries
    Alfattni, Ghada
    Peek, Niels
    Nenadic, Goran
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 123
  • [32] Improving Mandarin Tone Recognition using Convolutional Bidirectional Long Short-Term Memory with Attention
    Yang, Longfei
    Xie, Yanlu
    Zhang, Jinsong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 352 - 356
  • [33] Twitter Bot Detection Using Bidirectional Long Short-term Memory Neural Networks and Word Embeddings
    Wei, Feng
    Uyen Trang Nguyen
    2019 FIRST IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS (TPS-ISA 2019), 2019, : 101 - 109
  • [34] Automatic Lip-Reading System Based on Deep Convolutional Neural Network and Attention-Based Long Short-Term Memory
    Lu, Yuanyao
    Li, Hongbo
    APPLIED SCIENCES-BASEL, 2019, 9 (08):
  • [35] Knowledge Tracing with Contrastive Learning and Attention-Based Long Short-Term Memory Network
    Xu, Liancheng
    Guo, Lihua
    Wu, Xiaoqi
    Wang, Xinhua
    Guo, Lei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 25 - 36
  • [36] Research on bridge structural damage detection based on convolutional and long short-term memory neural networks
    Yang, Jianxi
    Zhang, Likai
    Li, Ren
    He, Yingying
    Jiang, Shixin
    Zou, Junzhi
    Journal of Railway Science and Engineering, 2020, 17 (08) : 1893 - 1902
  • [37] Attention-Based Convolution Skip Bidirectional Long Short-Term Memory Network for Speech Emotion Recognition
    Zhang, Huiyun
    Huang, Heming
    Han, Henry
    IEEE ACCESS, 2021, 9 : 5332 - 5342
  • [38] Convolutional Neural Network and Bidirectional Long Short-Term Memory (CNN-BiLSTM)-Attention-Based Prediction of the Amount of Silica Powder Moving in and out of a Warehouse
    Guo, Dudu
    Duan, Pengbin
    Yang, Zhen
    Zhang, Xiaojiang
    Su, Yinuo
    ENERGIES, 2024, 17 (15)
  • [39] Aspect-Based Sentiment Analysis Using Convolutional Neural Network and Bidirectional Long Short-Term Memory
    Cahyadi, Alson
    Khodra, Masayu Leylia
    2018 5TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS (ICAICTA 2018), 2018, : 124 - 129
  • [40] Deep Learning with Convolutional Neural Network and Long Short-Term Memory for Phishing Detection
    Adebowale, M. A.
    Lwin, K. T.
    Hossain, M. A.
    2019 13TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA), 2019,