HCA: Hierarchical Compare Aggregate model for question retrieval in community question answering

被引:17
作者
Zahedi, Mohammad Sadegh [1 ]
Rahgozar, Maseud [1 ]
Zoroofi, Reza Aghaeizadeh [1 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
关键词
Community question answering; Question retrieval; Hierarchical compare-aggregate model; Transfer learning; Deep learning;
D O I
10.1016/j.ipm.2020.102318
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of finding similar historical questions that are semantically equivalent or relevant to an input query question in community question-answering (CQA) sites. One of the main challenges for this task is that questions are usually too long and often contain peripheral information in addition to the main goals of the question. To address this problem, we propose an end-to-end Hierarchical Compare Aggregate (HCA) model that can handle this problem without using any task-specific features. We first split questions into sentences and compare every sentence pair of the two questions using a proposed Word-Level-Compare-Aggregate model called WLCA-model and then the comparison results are aggregated with a proposed Sentence-Level-Compare-Aggregate model to make the final decision. To handle the insufficient training data problem, we propose a sequential transfer learning approach to pre-train the WLCA-model on a large paraphrase detection dataset. Our experiments on two editions of the Semeval benchmark datasets and the domain-specific AskUbuntu dataset show that our model outperforms the state-of-the-art models.
引用
收藏
页数:23
相关论文
共 80 条
[61]   Learning Semantic Representations Using Convolutional Neural Networks for Web Search [J].
Shen, Yelong ;
He, Xiaodong ;
Gao, Jianfeng ;
Deng, Li ;
Mesnil, Gregoire .
WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, :373-374
[62]  
Shtok Anna., 2012, Proceedings of the 21st international conference on World Wide Web, P759, DOI DOI 10.1145/2187836.2187939
[63]  
Singh A., 2012, P 2012 JOINT C EMP M, P1266
[64]   A Comprehensive Survey and Classification of Approaches for Community Question Answering [J].
Srba, Ivan ;
Bielikova, Maria .
ACM TRANSACTIONS ON THE WEB, 2016, 10 (03)
[65]  
Srivastava R. K., 2015, ARXIV150500387, P1
[66]  
Tan CQ, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4411
[67]   Concept and Attention-Based CNN for Question Retrieval in Multi-View Learning [J].
Wang, Pengwei ;
Ji, Lei ;
Yan, Jun ;
Dou, Dejing ;
De Silva, Nisansa ;
Zhang, Yong ;
Jin, Lianwen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2018, 9 (04)
[68]  
Wang S., 2017, P ICLR
[69]  
Wang Z, 2017, IEEE IJCNN, P1411, DOI 10.1109/IJCNN.2017.7966018
[70]  
Wu G., 2016, P 10 INT WORKSH SEM, P872, DOI 10.18653/v1/S16-1135