Multi-source inverse-curriculum-based training for low-resource dialogue generation

被引:0
|
作者
Fuwei Cui
Hui Di
Hui Huang
Hongjie Ren
Kazushige Ouchi
Ze Liu
Jinan Xu
机构
[1] Beijing Jiaotong University,Institute of Advanced Control System, School of Electronic Information Engineering
[2] Toshiba (China) Co.,School of Computer Information Technology
[3] Ltd,undefined
[4] Beijing Jiaotong University,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Dialogue generation; Low-resource dialogue generation; Data augmentation; Curriculum learning;
D O I
暂无
中图分类号
学科分类号
摘要
An effective dialogue system needs amount of training data, but the existing training data is insufficient. Although the pre-trained model has made great progress in recent years, which can alleviate the problem of low resource dialogue to a certain extent, the pre-trained model is large and difficult to deploy. How to improve the performance of dialogue model without additional annotation data and decreasing the model volume has become a new challenge. We propose a multi-source data augmentation method for low-resource dialogue generation by utilizing inverse curriculum learning (inverse CL). Firstly, we adopt three data augmentation methods, including round-trip translation, paraphrasing and pre-trained model, to generate augmentation data. Next, we propose a new training strategy based on inverse CL to utilize different augmentation data. Comparing with the baselines, our method comprehensively outperform the baselines on all evaluation metrics, which shows the effectiveness of our proposed training strategy for dialogue generation. To the best of our knowledge, this is the first systematic investigation of data augmentation in the dialogue generation.
引用
收藏
页码:13665 / 13676
页数:11
相关论文
共 50 条
  • [21] More is Better: Enhancing Open-Domain Dialogue Generation via Multi-Source Heterogeneous Knowledge
    Wu, Sixing
    Li, Ying
    Wang, Minghui
    Zhang, Dawei
    Zhou, Yang
    Wu, Zhonghai
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2286 - 2300
  • [22] Attention-based Fusion for Multi-source Human Image Generation
    Lathuiliere, Stephane
    Sangineto, Enver
    Siarohin, Aliaksandr
    Sebe, Nicu
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 428 - 437
  • [23] BioAug: Conditional Generation based Data Augmentation for Low-Resource Biomedical NER
    Ghosh, Sreyan
    Tyagi, Utkarsh
    Kumar, Sonal
    Manocha, Dinesh
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1853 - 1858
  • [24] Digital Resource Recommendation Based on Multi-Source Data and Scene Similarity Calculation
    Yanwen, Wu
    Qiuting, Cai
    Zhi, Liu
    Yunze, Deng
    Data Analysis and Knowledge Discovery, 2021, 5 (11) : 114 - 123
  • [25] Neural network training method for materials science based on multi-source databases
    Jialong Guo
    Ziyi Chen
    Zhiwei Liu
    Xianwei Li
    Zhiyuan Xie
    Zongguo Wang
    Yangang Wang
    Scientific Reports, 12
  • [26] Adversarial Training Based Multi-Source Unsupervised Domain Adaptation for Sentiment Analysis
    Dai, Yong
    Liu, Jian
    Ren, Xiancong
    Xu, Zenglin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7618 - 7625
  • [27] Neural network training method for materials science based on multi-source databases
    Guo, Jialong
    Chen, Ziyi
    Liu, Zhiwei
    Li, Xianwei
    Xie, Zhiyuan
    Wang, Zongguo
    Wang, Yangang
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [28] A SIMULATION-BASED APPROACH TO CRITICAL CARE TRAINING FOR PEDIATRICIANS IN LOW-RESOURCE SETTINGS
    Akingbola, Olugbenga
    Akindolire, Abimbola
    Nzegwu, Barbara
    Srivastav, Apurv
    CRITICAL CARE MEDICINE, 2025, 53 (01)
  • [29] Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation
    Jing, Liqiang
    Song, Xuemeng
    Ouyang, Kun
    Jia, Mengzhao
    Nie, Liqiang
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11349 - 11361
  • [30] UniDE: A multi-level and low-resource framework for automatic dialogue evaluation via LLM-based data augmentation and multitask learning
    Ye, Guanghui
    Zhao, Huan
    Zhang, Zixing
    Jiang, Zhihua
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)