Multi-source inverse-curriculum-based training for low-resource dialogue generation

被引:0
|
作者
Fuwei Cui
Hui Di
Hui Huang
Hongjie Ren
Kazushige Ouchi
Ze Liu
Jinan Xu
机构
[1] Beijing Jiaotong University,Institute of Advanced Control System, School of Electronic Information Engineering
[2] Toshiba (China) Co.,School of Computer Information Technology
[3] Ltd,undefined
[4] Beijing Jiaotong University,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Dialogue generation; Low-resource dialogue generation; Data augmentation; Curriculum learning;
D O I
暂无
中图分类号
学科分类号
摘要
An effective dialogue system needs amount of training data, but the existing training data is insufficient. Although the pre-trained model has made great progress in recent years, which can alleviate the problem of low resource dialogue to a certain extent, the pre-trained model is large and difficult to deploy. How to improve the performance of dialogue model without additional annotation data and decreasing the model volume has become a new challenge. We propose a multi-source data augmentation method for low-resource dialogue generation by utilizing inverse curriculum learning (inverse CL). Firstly, we adopt three data augmentation methods, including round-trip translation, paraphrasing and pre-trained model, to generate augmentation data. Next, we propose a new training strategy based on inverse CL to utilize different augmentation data. Comparing with the baselines, our method comprehensively outperform the baselines on all evaluation metrics, which shows the effectiveness of our proposed training strategy for dialogue generation. To the best of our knowledge, this is the first systematic investigation of data augmentation in the dialogue generation.
引用
收藏
页码:13665 / 13676
页数:11
相关论文
共 50 条
  • [31] Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation
    Mao, Zhuoyuan
    Chu, Chenhui
    Kurohashi, Sadao
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (04)
  • [32] Cross-Lingual Summarization Method Based on Joint Training and Self-Training in Low-Resource Scenarios
    Cheng, Shaohuan
    Tang, Yujia
    Liu, Qiao
    Chen, Wenyu
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2024, 53 (05): : 762 - 770
  • [33] A surface water resource asset accounting method based on multi-source remote sensing data
    Kang, Hui
    Dou, Wenzhang
    Chen, Li
    Han, Lingyi
    Sui, Xinxin
    Ding, Ziyue
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2024, 12
  • [34] EM-based Phoneme Confusion Matrix Generation for Low-resource Spoken Term Detection
    Xu, Di
    Wang, Yun
    Metze, Florian
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 424 - 429
  • [35] Few-shot Low-resource Knowledge Graph Completion with Multi-view Task Representation Generation
    Pei, Shichao
    Kou, Ziyi
    Zhang, Qiannan
    Zhang, Xiangliang
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1862 - 1871
  • [36] A multi-source domain adaption intelligent fault diagnosis method based on asymmetric adversarial training
    Yang, Dan
    Ma, Tianyu
    Li, Zhipeng
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (03)
  • [37] Low-resource multi-granularity academic function recognition based on multiple prompt knowledge
    Liu, Jiawei
    Xiong, Zi
    Jiang, Yi
    Ma, Yongqiang
    Lu, Wei
    Huang, Yong
    Cheng, Qikai
    ELECTRONIC LIBRARY, 2024, 42 (06): : 879 - 904
  • [38] Multi-source Domain Adaptation Intelligent Fault Diagnosis Method Based on Asymmetric Adversarial Training
    Li, Zhipeng
    Ma, Tianyu
    Liu, Jinping
    Xiang, Qingsong
    Tang, Junjie
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2024, 60 (18): : 76 - 88
  • [39] Factors Affecting Implementation of Simulation-Based Education After Faculty Training in a Low-Resource Setting
    Seethamraju, Rajasri R.
    Stone, Kimberly P.
    Shepherd, Michael
    SIMULATION IN HEALTHCARE-JOURNAL OF THE SOCIETY FOR SIMULATION IN HEALTHCARE, 2022, 17 (01): : E113 - E121
  • [40] AutoQGS: Auto-Prompt for Low-Resource Knowledge-based Question Generation from SPARQL
    Xiong, Guanming
    Bao, Junwei
    Zhao, Wen
    Wu, Youzheng
    He, Xiaodong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2250 - 2259