End-to-End Dialogue Generation Using a Single Encoder and a Decoder Cascade With a Multidimension Attention Mechanism

被引:5
|
作者
Belainine, Billal [1 ]
Sadat, Fatiha [1 ]
Boukadoum, Mounir [1 ]
机构
[1] Univ Quebec Montreal, Dept Comp Sci, Montreal, PQ H3X 2Y7, Canada
关键词
Decoding; History; Context modeling; Computer architecture; Visualization; Transformers; Predictive models; Attention mechanism; dialogue generation; hierarchical recurrent attention network (HRAN); neural machine; relevant context with self-attention (ReCoSa); sequence transduction;
D O I
10.1109/TNNLS.2022.3151347
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human dialogues often show underlying dependencies between turns, with each interlocutor influencing the queries/responses of the other. This article follows this by proposing a neural architecture for conversation modeling that looks at the dialogue history of both sides. It consists of a generative model where one encoder feeds three decoders to process three successive turns of dialogue for predicting the next utterance, with a multidimension attention mechanism aggregating the past and current contexts for a cascade effect on each decoder. As a result, a more comprehensive account of the dialogue evolution is obtained than by focusing on a single turn or the last encoder context, or on the user side alone. The response generation performance of the model is evaluated on three corpora of different sizes and topics, and a comparison is made with six recent generative neural architectures, using both automatic metrics and human judgments. Our results show that the proposed architecture equals or improves the state-of-the-art for adequacy and fluency, particularly when large open-domain corpora are used in the training. Moreover, it allows better tracking of the dialogue state evolution for response explainability.
引用
收藏
页码:8482 / 8492
页数:11
相关论文
共 50 条
  • [41] Joint CTC-Attention End-to-End Speech Recognition with a Triangle Recurrent Neural Network Encoder
    Zhu T.
    Cheng C.
    Journal of Shanghai Jiaotong University (Science), 2020, 25 (01) : 70 - 75
  • [42] An End-to-End Speech Enhancement Method Combining Attention Mechanism to Improve GAN
    Chen, Wei
    Cai, Yichao
    Yang, Qingyu
    Wang, Ge
    Liu, Taian
    Liu, Xinying
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 538 - 542
  • [43] LARGE CONTEXT END-TO-END AUTOMATIC SPEECH RECOGNITION VIA EXTENSION OF HIERARCHICAL RECURRENT ENCODER-DECODER MODELS
    Masumura, Ryo
    Tanaka, Tomohiro
    Moriya, Takafumi
    Shinohara, Yusuke
    Oba, Takanobu
    Aono, Yushi
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5661 - 5665
  • [44] End-to-End Optical Music Recognition with Attention Mechanism and Memory Units Optimization
    He, Ruichen
    Yao, Junfeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT II, 2024, 14426 : 400 - 411
  • [45] End-to-end Parking Behavior Recognition Based on Self-attention Mechanism
    Li, Penghua
    Zhu, Dechen
    Mou, Qiyun
    Tu, Yushan
    Wu, Jinfeng
    2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 371 - 376
  • [46] A new end-to-end image dehazing algorithm based on residual attention mechanism
    Yang Z.
    Shang J.
    Zhang Z.
    Zhang Y.
    Liu S.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2021, 39 (04): : 901 - 908
  • [47] A novel end-to-end chromosome classification approach using deep neural network with triple attention mechanism
    Chang, Ling
    Wu, Kaijie
    Gu, Chaocheng
    Chen, Cailian
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 91
  • [48] Learning to localize image forgery using end-to-end attention network
    Ganapathi, Iyyakutti Iyappan
    Javed, Sajid
    Ali, Syed Sadaf
    Mahmood, Arif
    Vu, Ngoc-Son
    Werghi, Naoufel
    NEUROCOMPUTING, 2022, 512 : 25 - 39
  • [49] Streaming End-to-End ASR Using CTC Decoder and DRA for Linguistic Information Substitution
    Takagi, Tatsunari
    Ogawa, Atsunori
    Kitaoka, Norihide
    Wakabayashi, Yukoh
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1779 - 1783
  • [50] An End-to-End Multimodal Voice Activity Detection Using WaveNet Encoder and Residual Networks
    Ariav, Ido
    Cohen, Israel
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (02) : 265 - 274