Attention-Based Joint Learning for Intent Detection and Slot Filling Using Bidirectional Long Short-Term Memory and Convolutional Neural Networks (AJLISBC)

被引:0
|
作者
Muhammad, Yusuf Idris [1 ]
Salim, Naomie [1 ]
Huspi, Sharin Hazlin [1 ]
Zainal, Anazida [1 ]
机构
[1] Univ Teknol Malaysia, Fac Comp, Skudai 81310, Malaysia
关键词
-Joint learning; intent detection; slot filling; multichannel;
D O I
10.14569/IJACSA.2024.0150890
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Effective natural language understanding is crucial for dialogue systems, requiring precise intent detection and slot filling to facilitate interactions. Traditionally, these subtasks have been addressed separately, but their interconnection suggests that joint solutions yield better results. Recent neural network-based approaches have shown significant performance in joint intent detection and slot filling tasks. The two primary neural network structures used are recurrent neural networks (RNNs) and convolutional neural networks (CNNs). RNNs capture long-term dependencies and store previous information semantics in a fixed- size vector, but their ability to extract global semantics is limited. CNNs can capture n-gram features using convolutional filters, but their performance is constrained by filter width. To leverage the strengths and mitigate the weaknesses of both networks, this paper proposes an attention-based joint learning classification for intent detection and slot filling using BiLSTM and CNNs (AJLISBC). The BiLSTM encodes input sequences in both forward and backward directions, producing high-dimensional representations. It applies scalar and vectorial attention to obtain multichannel representations, with scalar attention calculating word-level importance and vectorial attention assessing feature- level importance. For classification, AJLISBC employs a CNN structure to capture word relations in the representations generated by the attention mechanism, effectively extracting ngram features. Experimental results on the benchmark Airline Travel Information System (ATIS) dataset demonstrate that AJLISBC outperforms state-of-the-art methods.
引用
收藏
页码:915 / 922
页数:8
相关论文
共 50 条
  • [41] Classification of causes of speech recognition errors using attention-based bidirectional long short-term memory and modulation spectrum
    Santoso, Jennifer
    Yamada, Takeshi
    Makino, Shoji
    2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019, 2019, : 302 - 306
  • [42] Classification of causes of speech recognition errors using attention-based bidirectional long short-term memory and modulation spectrum
    Santoso, Jennifer
    Yamada, Takeshi
    Makino, Shoji
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 302 - 306
  • [43] Email Spam Detection using Bidirectional Long Short Term Memory with Convolutional Neural Network
    Rahman, Sefat E.
    Ullah, Shofi
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 1307 - 1311
  • [44] Relation extraction in Chinese using attention-based bidirectional long short- term networks
    Zhang, Yanzi
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [45] Transient electromagnetic inversion to image the shallow subsurface based on convolutional bidirectional long short-term memory neural networks
    Shi, Yu
    Zhang, Jifeng
    You, Xiran
    Ma, Ziben
    Li, Jiachen
    GEOPHYSICAL JOURNAL INTERNATIONAL, 2024, 239 (01) : 173 - 191
  • [46] VOICE CONVERSION USING DEEP BIDIRECTIONAL LONG SHORT-TERM MEMORY BASED RECURRENT NEURAL NETWORKS
    Sun, Lifa
    Kang, Shiyin
    Li, Kun
    Meng, Helen
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4869 - 4873
  • [47] Securing Networks with Convolutional Long Short-term Memory Based Traffic Prediction and Attention Mechanism for Intrusion Detection
    Tiwari, A.
    Kumar, D.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2025, 38 (08): : 1922 - 1931
  • [48] Convolutional Bidirectional Long Short-Term Memory for Deception Detection With Acoustic Features
    Xie, Yue
    Liang, Ruiyu
    Tao, Huawei
    Zhu, Yue
    Zhao, Li
    IEEE ACCESS, 2018, 6 : 76527 - 76534
  • [49] Attention-based long short-term memory fully convolutional network for chemical process fault diagnosis
    Xiong, Shanwei
    Zhou, Li
    Dai, Yiyang
    Ji, Xu
    CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2023, 56 : 1 - 14
  • [50] Towards Attention-Based Convolutional Long Short-Term Memory for Travel Time Prediction of Bus Journeys
    Wu, Jianqing
    Wu, Qiang
    Shen, Jun
    Cai, Chen
    SENSORS, 2020, 20 (12) : 1 - 13