Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots

被引:83
作者
Gu, Jia-Chen [1 ]
Li, Tianda [2 ]
Liu, Quan [1 ,3 ]
Ling, Zhen-Hua [1 ]
Su, Zhiming [3 ]
Wei, Si [3 ]
Zhu, Xiaodan [2 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Speech & Language Informat Proc, Hefei, Peoples R China
[2] Queens Univ, ECE & Ingenu Labs, Kingston, ON, Canada
[3] iFLYTEK Res, State Key Lab Cognit Intelligence, Hefei, Peoples R China
来源
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT | 2020年
关键词
Speaker-aware BERT; multi-turn response selection; retrieval-based chatbot;
D O I
10.1145/3340531.3412330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study the problem of employing pre-trained language models for multi-turn response selection in retrieval-based chatbots. A new model, named Speaker-Aware BERT (SA-BERT), is proposed in order to make the model aware of the speaker change information, which is an important and intrinsic property of multi-turn dialogues. Furthermore, a speaker-aware disentanglement strategy is proposed to tackle the entangled dialogues. This strategy selects a small number of most important utterances as the filtered context according to the speakers' information in them. Finally, domain adaptation is performed to incorporate the in-domain knowledge into pre-trained language models. Experiments on five public datasets show that our proposed model outperforms the present models on all metrics by large margins and achieves new state-of-the-art performances for multi-turn response selection.
引用
收藏
页码:2041 / 2044
页数:4
相关论文
共 13 条
  • [1] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [2] Dong J., 2018, Enhance word representation for out-of-vocabulary on ubuntu dialogue corpus
  • [3] Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots
    Gu, Jia-Chen
    Ling, Zhen-Hua
    Liu, Quan
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2321 - 2324
  • [4] Henderson M, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P5392
  • [5] Kim Seokhwan, 2019, ARXIV191106394
  • [6] Lowe R., 2015, SIGDIAL, P285, DOI [10.18653/V1/W15-4640, 10., DOI 10.18653/V1/W15-4640, 10.18653/v1/W15-4640]
  • [7] Lowe R, 2017, ORIGINS OF HIGHER LEARNING: KNOWLEDGE NETWORKS AND THE EARLY DEVELOPMENT OF UNIVERSITIES, P31
  • [8] Tao CY, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1
  • [9] Multi-Representation Fusion Network for Multi-turn Response Selection in Retrieval-based Chatbots
    Tao, Chongyang
    Wu, Wei
    Xu, Can
    Hu, Wenpeng
    Zhao, Dongyan
    Yan, Rui
    [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 267 - 275
  • [10] Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots
    Wu, Yu
    Wu, Wei
    Xing, Chen
    Li, Zhoujun
    Zhou, Ming
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 496 - 505