Improving Matching Models with Hierarchical Contextualized Representations for Multi-turn Response Selection

被引:6
作者
Tao, Chongyang [1 ]
Wu, Wei [2 ]
Feng, Yansong [1 ]
Zhao, Dongyan [1 ,3 ]
Yan, Rui [1 ,4 ]
机构
[1] Peking Univ, Wangxun Inst Comp Technol, Beijing, Peoples R China
[2] Microsoft Corp, Redmond, WA 98052 USA
[3] Peking Univ, Ctr Data Sci, Beijing, Peoples R China
[4] Beijing Acad Artificial Intelligence, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20) | 2020年
基金
美国国家科学基金会;
关键词
Contextualized word vectors; deep neural network; matching; multi-turn response selection; retrieval-based chatbot;
D O I
10.1145/3397271.3401290
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study context-response matching with pre-trained contextualized representations for multi-turn response selection in retrieval-based chatbots. Existing models, such as Cove and ELMo, are trained with limited context (often a single sentence or paragraph), and may not work well on multi-turn conversations, due to the hierarchical nature, informal language, and domain-specific words. To address the challenges, we propose pre-training hierarchical contextualized representations, including contextual word-level and sentence-level representations, by learning a dialogue generation model from large-scale conversations with a hierarchical encoder-decoder architecture. Then the two levels of representations are blended into the input and output layer of a matching model respectively. Experimental results on two benchmark conversation datasets indicate that the proposed hierarchical contextualized representations can bring significantly and consistently improvement to existing matching models for response selection.
引用
收藏
页码:1865 / 1868
页数:4
相关论文
共 10 条
[1]  
[Anonymous], 2013, NIPS
[2]  
[Anonymous], 2016, P 2016 C EMP METH NA, DOI DOI 10.18653/V1/D16-1036
[3]  
Cho K., 2014, P SSST 8 8 WORKSH SY
[4]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5]  
Lowe Ryan, 2015, P 16 ANN M SPECIAL I, P285
[6]  
McCann B, 2017, ADV NEUR IN, V30
[7]  
Pennington J., 2014, Glove: Global Vectors for Word Representation, P1532, DOI DOI 10.3115/V1/D14-1162
[8]  
Peters M. E., 2018, P 2018 C N AM CHAPT, V1, P2227, DOI [10.18653/V1/N18-1202, DOI 10.18653/V1/N18-1202]
[9]  
Serban IV, 2016, AAAI CONF ARTIF INTE, P3776
[10]   Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots [J].
Wu, Yu ;
Wu, Wei ;
Xing, Chen ;
Li, Zhoujun ;
Zhou, Ming .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :496-505