SIDECONTROL: Controlled Open-domain Dialogue Generation via Additive Side Networks

被引:0
|
作者
Du, Wanyu [1 ]
Ji, Yangfeng [1 ]
机构
[1] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22904 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based pre-trained language models boost the performance of open-domain dialogue systems. Prior works leverage Transformer-based pre-trained language models to generate texts with desired attributes in two general approaches: (1) gradient-based methods: updating all latent representations of pre-trained models with gradients from attribute models; (2) weighted-decoding methods: re-ranking beam candidates from pretrained models with attribute functions. However, gradient-based methods lead to high computation cost and can easily get overfitted on small training sets, while weighted-decoding methods are inherently constrained by the lowvariance high-bias pre-trained model. In this work, we propose a novel approach to control the generation of Transformer-based pretrained language models: the SIDECONTROL framework, which leverages a novel control attributes loss to incorporate useful control signals, and is shown to perform well with very limited training samples. We evaluate our proposed method on two benchmark open-domain dialogue datasets, and results show that the SIDECONTROL framework has better controllability, higher generation quality and better sample-efficiency than existing gradient-based and weighted-decoding baselines.
引用
收藏
页码:2175 / 2194
页数:20
相关论文
共 50 条
  • [1] Adversarial Evaluation for Open-Domain Dialogue Generation
    Bruni, Elia
    Fernandez, Raquel
    18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017, : 284 - 288
  • [2] Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
    Li, Yiwei
    Mi, Fei
    Li, Yitong
    Wang, Yasheng
    Bin Sun
    Feng, Shaoxiong
    Li, Kan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 11585 - 11596
  • [3] Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation
    Pang, Bo
    Nijkamp, Erik
    Han, Wenjuan
    Zhou, Linqi
    Liu, Yixian
    Tu, Kewei
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3619 - 3629
  • [4] A Randomized Link Transformer for Diverse Open-Domain Dialogue Generation
    Lee, Jing Yang
    Lee, Kong Aik
    Gan, Woon Seng
    PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 1 - 11
  • [5] Towards Diverse, Relevant and Coherent Open-Domain Dialogue Generation via Hybrid Latent Variables
    Sun, Bin
    Li, Yitong
    Mi, Fei
    Wang, Weichao
    Li, Yiwei
    Li, Kan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13600 - 13608
  • [6] Leveraging Context for Neural Question Generation in Open-domain Dialogue Systems
    Ling, Yanxiang
    Cai, Fei
    Chen, Honghui
    de Rijke, Maarten
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2486 - 2492
  • [7] Interpreting open-domain dialogue generation by disentangling latent feature representations
    Ye Wang
    Jingbo Liao
    Hong Yu
    Guoyin Wang
    Xiaoxia Zhang
    Li Liu
    Neural Computing and Applications, 2023, 35 : 20855 - 20867
  • [8] Interpreting open-domain dialogue generation by disentangling latent feature representations
    Wang, Ye
    Liao, Jingbo
    Yu, Hong
    Wang, Guoyin
    Zhang, Xiaoxia
    Liu, Li
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (28): : 20855 - 20867
  • [9] Text is NOT Enough: Integrating Visual Impressions into Open-domain Dialogue Generation
    Shen, Lei
    Zhan, Haolan
    Shen, Xin
    Song, Yonghao
    Zhao, Xiaofang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4287 - 4296
  • [10] More is Better: Enhancing Open-Domain Dialogue Generation via Multi-Source Heterogeneous Knowledge
    Wu, Sixing
    Li, Ying
    Wang, Minghui
    Zhang, Dawei
    Zhou, Yang
    Wu, Zhonghai
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2286 - 2300