End-to-End Dialogue Generation Using a Single Encoder and a Decoder Cascade With a Multidimension Attention Mechanism

被引:5
|
作者
Belainine, Billal [1 ]
Sadat, Fatiha [1 ]
Boukadoum, Mounir [1 ]
机构
[1] Univ Quebec Montreal, Dept Comp Sci, Montreal, PQ H3X 2Y7, Canada
关键词
Decoding; History; Context modeling; Computer architecture; Visualization; Transformers; Predictive models; Attention mechanism; dialogue generation; hierarchical recurrent attention network (HRAN); neural machine; relevant context with self-attention (ReCoSa); sequence transduction;
D O I
10.1109/TNNLS.2022.3151347
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human dialogues often show underlying dependencies between turns, with each interlocutor influencing the queries/responses of the other. This article follows this by proposing a neural architecture for conversation modeling that looks at the dialogue history of both sides. It consists of a generative model where one encoder feeds three decoders to process three successive turns of dialogue for predicting the next utterance, with a multidimension attention mechanism aggregating the past and current contexts for a cascade effect on each decoder. As a result, a more comprehensive account of the dialogue evolution is obtained than by focusing on a single turn or the last encoder context, or on the user side alone. The response generation performance of the model is evaluated on three corpora of different sizes and topics, and a comparison is made with six recent generative neural architectures, using both automatic metrics and human judgments. Our results show that the proposed architecture equals or improves the state-of-the-art for adequacy and fluency, particularly when large open-domain corpora are used in the training. Moreover, it allows better tracking of the dialogue state evolution for response explainability.
引用
收藏
页码:8482 / 8492
页数:11
相关论文
共 50 条
  • [1] GPS Trajectory Completion Using End-to-End Bidirectional Convolutional Recurrent Encoder-Decoder Architecture with Attention Mechanism
    Nawaz, Asif
    Huang, Zhiqiu
    Wang, Senzhang
    Akbar, Azeem
    AlSalman, Hussain
    Gumaei, Abdu
    SENSORS, 2020, 20 (18) : 1 - 16
  • [2] Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer
    Chen, Zhengyang
    Han, Bing
    Wang, Shuai
    Qian, Yanmin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1636 - 1649
  • [3] Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network
    Bhunia, Ayan Kumar
    Bhowmick, Abir
    Bhunia, Ankan Kumar
    Konwer, Aishik
    Banerjee, Prithaj
    Roy, Partha Pratim
    Pal, Umapada
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3639 - 3644
  • [4] A NEURAL PROSODY ENCODER FOR END-TO-END DIALOGUE ACT CLASSIFICATION
    Wei, Kai
    Knox, Dillon
    Radfar, Martin
    Tran, Thanh
    Muller, Markus
    Strimel, Grant P.
    Susanj, Nathan
    Mouchtaris, Athanasios
    Omologo, Maurizio
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7047 - 7051
  • [5] Encoder-Decoder Based Attractors for End-to-End Neural Diarization
    Horiguchi, Shota
    Fujita, Yusuke
    Watanabe, Shinji
    Xue, Yawen
    Garcia, Paola
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1493 - 1507
  • [6] End-to-End Deep Background Subtraction based on Encoder-Decoder Network
    Le, Duy H.
    Pham, Tuan, V
    PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 381 - 386
  • [7] End-to-End Trained CNN Encoder-Decoder Networks for Image Steganography
    Rehman, Atique ur
    Rahim, Rafia
    Nadeem, Shahroz
    ul Hussain, Sibt
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 723 - 729
  • [8] Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor
    Chen, Zhengyang
    Han, Bing
    Wang, Shuai
    Qian, Yanmin
    INTERSPEECH 2023, 2023, : 3552 - 3556
  • [9] SEQUENCE TRAINING OF ENCODER-DECODER MODEL USING POLICY GRADIENT FOR END-TO-END SPEECH RECOGNITION
    Karita, Shigeki
    Ogawa, Atsunori
    Delcroix, Marc
    Nakatani, Tomohiro
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5839 - 5843
  • [10] End-to-End Recurrent Cross-Modality Attention for Video Dialogue
    Chu, Yun-Wei
    Lin, Kuan-Yen
    Hsu, Chao-Chun
    Ku, Lun-Wei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2456 - 2464