Modeling Intent Graph for Search Result Diversification

被引:27
作者
Su, Zhan [2 ]
Dou, Zhicheng [1 ]
Zhu, Yutao [3 ]
Qin, Xubo [2 ]
Wen, Ji-Rong [4 ,5 ]
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
[2] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[3] Univ Montreal, Montreal, PQ, Canada
[4] Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China
[5] MOE, Key Lab Data Engn & Knowledge Engn, Beijing, Peoples R China
来源
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2021年
基金
中国国家自然科学基金;
关键词
Intent Graph; Search Result Diversification; Graph Neural Network;
D O I
10.1145/3404835.3462872
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Search result diversification aims to offer diverse documents that cover as many intents as possible. Most existing implicit diversification approaches model diversity through the similarity of document representation, which is indirect and unnatural. To handle the diversity more precisely, we measure the similarity of documents by their similarity of the intent coverage. Specifically, we build a classifier to judge whether two different documents contain the same intent based on the document's content. Then we construct an intent graph to present the complicated relationship of documents and the query. On the intent graph, documents are connected if they are similar, while the query and the document are gradually connected based on the document selection result. Then we employ graph convolutional networks (GCNs) to update the representation of the query and each document by aggregating its neighbors. By this means, we can obtain the context-aware query representation and the intent-aware document representations through the dynamic intent graph during the document selection process. Furthermore, these representations and intent graph features are fused into diversity features. Combined with the traditional relevance features, we obtain the final ranking score that balances the relevance and the diversity. Experimental results show that this implicit diversification model significantly outperforms all existing implicit diversification methods, and it can even beat the state-of-the-art explicit models.
引用
收藏
页码:736 / 746
页数:11
相关论文
共 49 条
[1]  
Agrawal R., 2009, WSDM, P5, DOI DOI 10.1145/1498759.1498766
[2]  
[Anonymous], 2008, SIGIR, DOI [DOI 10.1145/1390334.1390446, 10.1145/, DOI 10.1145/1390334]
[3]  
Benyu Zhang, 2005, SIGIR 2005. Proceedings of the Twenty-Eighth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P504, DOI 10.1145/1076034.1076120
[4]  
Callan Jamie, 2009, Clueweb09 data set
[5]  
Carbonell J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P335, DOI 10.1145/290941.291025
[6]  
Chapelle Olivier, 2009, P 18 ACM C INF KNOWL, P621, DOI DOI 10.1145/1645953.1646033
[7]  
Chen DL, 2020, AAAI CONF ARTIF INTE, V34, P3438
[8]  
Clark Kevin, 2019, ARXIV190704829
[9]  
Clarke CLA, 2009, LECT NOTES COMPUT SC, V5766, P188, DOI 10.1007/978-3-642-04417-5_17
[10]  
Dang V, 2013, SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, P603