Group-wise Contrastive Learning for Neural Dialogue Generation

被引:0
|
作者
Cai, Hengyi [1 ,2 ,3 ]
Chen, Hongshen [3 ]
Song, Yonghao [1 ]
Ding, Zhuoye [3 ]
Bao, Yongjun [3 ]
Yan, Weipeng [3 ]
Zhao, Xiaofang [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] JD Com, Beijing, Peoples R China
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020 | 2020年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural dialogue response generation has gained much popularity in recent years. Maximum Likelihood Estimation (MLE) objective is widely adopted in existing dialogue model learning. However, models trained with MLE objective function are plagued by the lowdiversity issue when it comes to the opendomain conversational setting. Inspired by the observation that humans not only learn from the positive signals but also benefit from correcting behaviors of undesirable actions, in this work, we introduce contrastive learning into dialogue generation, where the model explicitly perceives the difference between the well-chosen positive and negative utterances. Specifically, we employ a pretrained baseline model as a reference. During contrastive learning, the target dialogue model is trained to give higher conditional probabilities for the positive samples, and lower conditional probabilities for those negative samples, compared to the reference model. To manage the multimapping relations prevalent in human conversation, we augment contrastive dialogue learning with group-wise dual sampling. Extensive experimental results show that the proposed group-wise contrastive learning framework is suited for training a wide range of neural dialogue generation models with very favorable performance over the baseline training approaches.
引用
收藏
页码:793 / 802
页数:10
相关论文
共 50 条
  • [1] GROUP-WISE FEATURE SELECTION FOR SUPERVISED LEARNING
    Xiao, Qi
    Li, Hebi
    Tian, Jin
    Wang, Zhengdao
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3149 - 3153
  • [2] GWQ: Group-Wise Quantization Framework for Neural Networks
    Yang, Jiaming
    Tang, Chenwei
    Yu, Caiyang
    Lv, Jiancheng
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [3] Group-Wise Learning for Weakly Supervised Semantic Segmentation
    Zhou, Tianfei
    Li, Liulei
    Li, Xueyi
    Feng, Chun-Mei
    Li, Jianwu
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 799 - 811
  • [4] MGCNet: Multiple group-wise correlation network with hierarchical contrastive learning for co-salient object detection
    Fang, Xian
    Wang, Xin
    Zhu, Jinchao
    Chen, Qiaohong
    Chen, Zuofan
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [5] Group-Wise Learning for Aurora Image Classification With Multiple Representations
    Zhang, Jun
    Liu, Mingxia
    Lu, Ke
    Gao, Yue
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (08) : 4112 - 4124
  • [6] Bilateral personalized dialogue generation with contrastive learning
    Bin Li
    Hanjun Deng
    Soft Computing, 2023, 27 : 3115 - 3132
  • [7] Bilateral personalized dialogue generation with contrastive learning
    Li, Bin
    Deng, Hanjun
    SOFT COMPUTING, 2023, 27 (06) : 3115 - 3132
  • [8] A Neural Group-wise Sentiment Analysis Model with Data Sparsity Awareness
    Zhou, Deyu
    Zhang, Meng
    Zhang, Linhai
    He, Yulan
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14594 - 14601
  • [9] Personalized Filled-pause Generation with Group-wise Prediction Models
    Matsunaga, Yuta
    Saeki, Takaaki
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 385 - 392
  • [10] CL-CSP: Contrastive Learning with Continuous Semantic Perturbations for Neural Dialogue Generation
    Liang, Zhiping
    Zhan, Haolan
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,