End-to-End Dialogue Generation Using a Single Encoder and a Decoder Cascade With a Multidimension Attention Mechanism

被引：5

作者：

Belainine, Billal ^{[1
]}

Sadat, Fatiha ^{[1
]}

Boukadoum, Mounir ^{[1
]}

机构：

[1] Univ Quebec Montreal, Dept Comp Sci, Montreal, PQ H3X 2Y7, Canada

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 11期

关键词：

Decoding; History; Context modeling; Computer architecture; Visualization; Transformers; Predictive models; Attention mechanism; dialogue generation; hierarchical recurrent attention network (HRAN); neural machine; relevant context with self-attention (ReCoSa); sequence transduction;

D O I：

10.1109/TNNLS.2022.3151347

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human dialogues often show underlying dependencies between turns, with each interlocutor influencing the queries/responses of the other. This article follows this by proposing a neural architecture for conversation modeling that looks at the dialogue history of both sides. It consists of a generative model where one encoder feeds three decoders to process three successive turns of dialogue for predicting the next utterance, with a multidimension attention mechanism aggregating the past and current contexts for a cascade effect on each decoder. As a result, a more comprehensive account of the dialogue evolution is obtained than by focusing on a single turn or the last encoder context, or on the user side alone. The response generation performance of the model is evaluated on three corpora of different sizes and topics, and a comparison is made with six recent generative neural architectures, using both automatic metrics and human judgments. Our results show that the proposed architecture equals or improves the state-of-the-art for adequacy and fluency, particularly when large open-domain corpora are used in the training. Moreover, it allows better tracking of the dialogue state evolution for response explainability.

引用

页码：8482 / 8492

页数：11

共 50 条

[1] GPS Trajectory Completion Using End-to-End Bidirectional Convolutional Recurrent Encoder-Decoder Architecture with Attention Mechanism
Nawaz, Asif
Huang, Zhiqiu
Wang, Senzhang
Akbar, Azeem
AlSalman, Hussain
Gumaei, Abdu
SENSORS, 2020, 20 (18) : 1 - 16
[2] Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer
Chen, Zhengyang
Han, Bing
Wang, Shuai
Qian, Yanmin
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1636 - 1649
[3] Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network
Bhunia, Ayan Kumar
Bhowmick, Abir
Bhunia, Ankan Kumar
Konwer, Aishik
Banerjee, Prithaj
Roy, Partha Pratim
Pal, Umapada
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3639 - 3644
[4] A NEURAL PROSODY ENCODER FOR END-TO-END DIALOGUE ACT CLASSIFICATION
Wei, Kai
Knox, Dillon
Radfar, Martin
Tran, Thanh
Muller, Markus
Strimel, Grant P.
Susanj, Nathan
Mouchtaris, Athanasios
Omologo, Maurizio
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7047 - 7051
[5] Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Horiguchi, Shota
Fujita, Yusuke
Watanabe, Shinji
Xue, Yawen
Garcia, Paola
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1493 - 1507
[6] End-to-End Deep Background Subtraction based on Encoder-Decoder Network
Le, Duy H.
Pham, Tuan, V
PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 381 - 386
[7] End-to-End Trained CNN Encoder-Decoder Networks for Image Steganography
Rehman, Atique ur
Rahim, Rafia
Nadeem, Shahroz
ul Hussain, Sibt
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 723 - 729
[8] Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor
Chen, Zhengyang
Han, Bing
Wang, Shuai
Qian, Yanmin
INTERSPEECH 2023, 2023, : 3552 - 3556
[9] SEQUENCE TRAINING OF ENCODER-DECODER MODEL USING POLICY GRADIENT FOR END-TO-END SPEECH RECOGNITION
Karita, Shigeki
Ogawa, Atsunori
Delcroix, Marc
Nakatani, Tomohiro
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5839 - 5843
[10] End-to-End Recurrent Cross-Modality Attention for Video Dialogue
Chu, Yun-Wei
Lin, Kuan-Yen
Hsu, Chao-Chun
Ku, Lun-Wei
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2456 - 2464

← 1 2 3 4 5 →