SIDECONTROL: Controlled Open-domain Dialogue Generation via Additive Side Networks

被引:0
|
作者
Du, Wanyu [1 ]
Ji, Yangfeng [1 ]
机构
[1] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22904 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based pre-trained language models boost the performance of open-domain dialogue systems. Prior works leverage Transformer-based pre-trained language models to generate texts with desired attributes in two general approaches: (1) gradient-based methods: updating all latent representations of pre-trained models with gradients from attribute models; (2) weighted-decoding methods: re-ranking beam candidates from pretrained models with attribute functions. However, gradient-based methods lead to high computation cost and can easily get overfitted on small training sets, while weighted-decoding methods are inherently constrained by the lowvariance high-bias pre-trained model. In this work, we propose a novel approach to control the generation of Transformer-based pretrained language models: the SIDECONTROL framework, which leverages a novel control attributes loss to incorporate useful control signals, and is shown to perform well with very limited training samples. We evaluate our proposed method on two benchmark open-domain dialogue datasets, and results show that the SIDECONTROL framework has better controllability, higher generation quality and better sample-efficiency than existing gradient-based and weighted-decoding baselines.
引用
收藏
页码:2175 / 2194
页数:20
相关论文
共 50 条
  • [41] xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
    Zhang, Chen
    D'Haro, Luis Fernando
    Tang, Chengguang
    Shi, Ke
    Tang, Guohua
    Li, Haizhou
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 5579 - 5601
  • [42] OTTers: One-turn Topic Transitions for Open-Domain Dialogue
    Sevegnani, Karin
    Howcroft, David M.
    Konstas, Ioannis
    Rieser, Verena
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 2492 - 2504
  • [43] Improving Response Quality with Backward Reasoning in Open-domain Dialogue Systems
    Li, Ziming
    Kiseleva, Julia
    de Rijke, Maarten
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1940 - 1944
  • [44] The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents
    Shuster, Kurt
    Ju, Da
    Roller, Stephen
    Dinan, Emily
    Boureau, Y-Lan
    Weston, Jason
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2453 - 2470
  • [45] DENSITY: Open-domain Dialogue Evaluation Metric using Density Estimation
    Park, ChaeHun
    Lee, Seungil Chad
    Rim, Daniel
    Choo, Jaegul
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 14222 - 14236
  • [46] Towards Building an Open-Domain Dialogue System Incorporated With Internet Memes
    Lu, Hua
    Guo, Zhen
    Li, Chanjuan
    Yang, Yunyi
    He, Huang
    Bao, Siqi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 721 - 726
  • [47] Towards Building an Open-Domain Dialogue System Incorporated With Internet Memes
    Lu, Hua
    Guo, Zhen
    Li, Chanjuan
    Yang, Yunyi
    He, Huang
    Bao, Siqi
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2024, 32 : 721 - 726
  • [48] Unstructured Text Enhanced Open-Domain Dialogue System: A Systematic Survey
    Ma, Longxuan
    Li, Mingda
    Zhang, Wei-Nan
    Li, Jiapeng
    Liu, Ting
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 40 (01)
  • [49] Exploring the Effectiveness of Multi-Lingual Commonsense Knowledge-Aware Open-Domain Dialogue Response Generation
    Wu, Sixing
    Yu, Jiong
    Che, Tianshi
    Zhou, Yang
    Zhou, Wei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14804 - 14814
  • [50] Acquisition of Open-Domain Classes via Intersective Semantics
    Pasca, Marius
    WWW'14: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 551 - 561