Convolutional Neural Networks for Soft-Matching N-Grams in Ad-hoc Search

被引:208
作者
Dai, Zhuyun [1 ]
Xiong, Chenyan [1 ]
Callan, Jamie [1 ]
Liu, Zhiyuan [2 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
来源
WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING | 2018年
基金
美国国家科学基金会;
关键词
D O I
10.1145/3159652.3159659
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents Conv-KNRM, a Convolutional Kernel-based Neural Ranking Model that models n-gram soft matches for ad-hoc search. Instead of exact matching query and document n-grams, Conv-KNRM uses Convolutional Neural Networks to represent n-grams of various lengths and soft matches them in a unified embedding space. The n-gram soft matches are then utilized by the kernel pooling and learning-to-rank layers to generate the final ranking score. Conv-KNRM can be learned end-to-end and fully optimized from user feedback. The learned model's generalizability is investigated by testing how well it performs in a related domain with small amounts of training data. Experiments on English search logs, Chinese search logs, and TREC Web track tasks demonstrated consistent advantages of Conv-KNRM over prior neural IR methods and feature-based methods.
引用
收藏
页码:126 / 134
页数:9
相关论文
共 30 条
[1]  
[Anonymous], 2010, Search engines: Information retrieval in practice
[2]  
[Anonymous], 2016, P 25 ACM INT C INF K
[3]  
[Anonymous], 2014, P 23 INT C WORLD WID
[4]  
[Anonymous], 2002, P ACM SIGKDD KDD 200
[5]  
[Anonymous], P 54 ANN M ASS COMP
[6]  
[Anonymous], 2017, P 40 INT ACM SIGIR C
[7]  
[Anonymous], 2017, P 40 INT ACM SIGIR C
[8]  
[Anonymous], 2014, THESIS U WATERLOO
[9]  
[Anonymous], 2014, ADV NEURAL INFORM PR
[10]  
[Anonymous], 2010, INFORM RETRIEVAL