SIDECONTROL: Controlled Open-domain Dialogue Generation via Additive Side Networks

被引:0
|
作者
Du, Wanyu [1 ]
Ji, Yangfeng [1 ]
机构
[1] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22904 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based pre-trained language models boost the performance of open-domain dialogue systems. Prior works leverage Transformer-based pre-trained language models to generate texts with desired attributes in two general approaches: (1) gradient-based methods: updating all latent representations of pre-trained models with gradients from attribute models; (2) weighted-decoding methods: re-ranking beam candidates from pretrained models with attribute functions. However, gradient-based methods lead to high computation cost and can easily get overfitted on small training sets, while weighted-decoding methods are inherently constrained by the lowvariance high-bias pre-trained model. In this work, we propose a novel approach to control the generation of Transformer-based pretrained language models: the SIDECONTROL framework, which leverages a novel control attributes loss to incorporate useful control signals, and is shown to perform well with very limited training samples. We evaluate our proposed method on two benchmark open-domain dialogue datasets, and results show that the SIDECONTROL framework has better controllability, higher generation quality and better sample-efficiency than existing gradient-based and weighted-decoding baselines.
引用
收藏
页码:2175 / 2194
页数:20
相关论文
共 50 条
  • [31] Improving Open-Domain Dialogue Systems via Multi-Turn Incomplete Utterance Restoration
    Pan, Zhufeng
    Bai, Kun
    Wang, Yan
    Zhou, Lianqiang
    Liu, Xiaojiang
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1824 - 1833
  • [32] Selecting Stickers in Open-Domain Dialogue through Multitask Learning
    Zhang, Zhexin
    Zhu, Yeshuang
    Fei, Zhengcong
    Zhang, Jinchao
    Zhou, Jie
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3053 - 3060
  • [33] Contextual Dialogue Act Classification for Open-Domain Conversational Agents
    Ahmadvand, Ali
    Choi, Jason Ingyu
    Agichtein, Eugene
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1273 - 1276
  • [34] Achieving Reliable Human Assessment of Open-Domain Dialogue Systems
    Ji, Tianbo
    Graham, Yvette
    Jones, Gareth
    Lyu, Chenyang
    Liu, Qun
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6416 - 6437
  • [35] Generating Responses Expressing Emotion in an Open-Domain Dialogue System
    Huang, Chenyang
    Zaiane, Osmar R.
    INTERNET SCIENCE, 2019, 11551 : 100 - 112
  • [36] An Automatic Evaluation Method for Open-domain Dialogue Based on BLEURT
    Wu, Shih-Hung
    Lee, Jia-Jun
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2022), 2022, : 83 - 89
  • [37] Open-domain Dialogue Generation: What We Can Do, Cannot Do, And Should Do Next
    Kann, Katharina
    Ebrahimi, Abteen
    Koh, Joewie J.
    Dudy, Shiran
    Roncone, Alessandro
    PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 148 - 165
  • [38] Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue Generation
    Liu, Lei
    Huang, Jimmy Xiangji
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2287 - 2292
  • [39] Less is More: Mitigate Spurious Correlations for Open-Domain Dialogue Response Generation Models by Causal Discovery
    Feng, Tao
    Qu, Lizhen
    Haffari, Gholamreza
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 511 - 530
  • [40] A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue Systems
    Kim, San
    Jang, Jin Yea
    Jung, Minyoung
    Shin, Saim
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 352 - 365