SEQ2SEQ++: A Multitasking-Based Seq2seq Model to Generate Meaningful and Relevant Answers

被引:6
|
作者
Palasundram, Kulothunkan [1 ]
Sharef, Nurfadhlina Mohd [1 ]
Kasmiran, Khairul Azhar [1 ]
Azman, Azreen [1 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Intelligent Comp Res Grp, Seri Kembangan 43400, Selangor, Malaysia
来源
IEEE ACCESS | 2021年 / 9卷 / 09期
关键词
Task analysis; Chatbots; Computational modeling; Decoding; Training; Transformers; Benchmark testing; Sequence to sequence learning; natural answer generation; multitask learning; attention mechanism; ATTENTION; ENCODER;
D O I
10.1109/ACCESS.2021.3133495
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Question-answering chatbots have tremendous potential to complement humans in various fields. They are implemented using either rule-based or machine learning-based systems. Unlike the former, machine learning-based chatbots are more scalable. Sequence-to-sequence (Seq2Seq) learning is one of the most popular approaches in machine learning-based chatbots and has shown remarkable progress since its introduction in 2014. However, chatbots based on Seq2Seq learning show a weakness in that it tends to generate answers that can be generic and inconsistent with the questions, thereby becoming meaningless and, therefore, may lower the chatbot adoption rate. This weakness can be attributed to three issues: question encoder overfit, answer generation overfit, and language model influence. Several recent methods utilize multitask learning (MTL) to address this weakness. However, the existing MTL models show very little improvement over single-task learning, wherein they still generate generic and inconsistent answers. This paper presents a novel approach to MTL for the Seq2Seq learning model called SEQ2SEQ++, which comprises a multifunctional encoder, an answer decoder, an answer encoder, and a ternary classifier. Additionally, SEQ2SEQ++ utilizes a dynamic tasks loss weight mechanism for MTL loss calculation and a novel attention mechanism called the comprehensive attention mechanism. Experiments on NarrativeQA and SQuAD datasets were conducted to gauge the performance of the proposed model in comparison with two recently proposed models. The experimental results show that SEQ2SEQ++ yields noteworthy improvements over the two models on bilingual evaluation understudy, word error rate, and Distinct-2 metrics.
引用
收藏
页码:164949 / 164975
页数:27
相关论文
共 50 条
  • [1] Keyphrase Generation Based on Deep Seq2seq Model
    Zhang, Yong
    Xiao, Weidong
    IEEE ACCESS, 2018, 6 : 46047 - 46057
  • [2] A Hierarchical Attention Based Seq2Seq Model for Chinese Lyrics Generation
    Fan, Haoshen
    Wang, Jie
    Zhuang, Bojin
    Wang, Shaojun
    Xiao, Jing
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 279 - 288
  • [3] Application of Seq2Seq Models on Code Correction
    Huang, Shan
    Zhou, Xiao
    Chin, Sang
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4
  • [4] Knowledge-based Questions Generation with Seq2Seq Learning
    Tang, Xiangru
    Gao, Hanning
    Gao, Junjie
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2018, : 180 - 184
  • [5] Research On Human-computer Dialogue Based On Improved Seq2seq Model
    Shang, Wenqian
    Zhu, Sunyu
    Xiao, Dong
    2021 IEEE/ACIS 21ST INTERNATIONAL FALL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2021-FALL), 2021, : 204 - 209
  • [6] Learning Transductions and Alignments with RNN Seq2seq Models
    Wang, Zhengxiang
    INTERNATIONAL CONFERENCE ON GRAMMATICAL INFERENCE, VOL 217, 2023, 217 : 223 - 249
  • [7] Adaptive Multistep Prediction With Sequence-to-Sequence (Seq2Seq) Models
    Kelley, Joseph
    Hagan, Martin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
  • [8] Fine Grained Named Entity Recognition via Seq2seq Framework
    Zhu, Huiming
    He, Chunhui
    Fang, Yang
    Xiao, Weidong
    IEEE ACCESS, 2020, 8 : 53953 - 53961
  • [9] Evaluating Performance of Conversational Bot Using Seq2Seq Model and Attention Mechanism
    Saluja, Karandeep
    Agrawal, Shashwat
    Kumar, Sanjeev
    Choudhury, Tanupriya
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (06): : 1 - 11
  • [10] ST-Seq2Seq: A Spatio-Temporal Feature-Optimized Seq2Seq Model for Short-Term Vessel Trajectory Prediction
    You, Lan
    Xiao, Siyu
    Peng, Qingxi
    Claramunt, Christophe
    Han, Xuewei
    Guan, Zhengyi
    Zhang, Jiahe
    IEEE ACCESS, 2020, 8 : 218565 - 218574