Session-Aware Query Auto-completion using Extreme Multi-Label Ranking

被引：7

作者：

Yadav, Nishant ^{[1
]}

Sen, Rajat ^{[2
]}

Hill, Daniel N. ^{[3
]}

Mazumdar, Arya ^{[4
]}

Dhillon, Inderjit S. ^{[3
,5
]}

机构：

[1] Univ Massachusetts, Amherst, MA 01003 USA

[2] Google Res, San Jose, CA USA

[3] Amazon, Seattle, WA USA

[4] Univ Calif San Diego, San Diego, CA USA

[5] Univ Texas Austin, Austin, TX 78712 USA

来源：

KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2021年

关键词：

eXtreme multi-label ranking; auto-complete; session-aware;

D O I：

10.1145/3447548.3467087

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Query auto-completion (QAC) is a fundamental feature in search engines where the task is to suggest plausible completions of a prefix typed in the search bar. Previous queries in the user session can provide useful context for the user's intent and can be leveraged to suggest auto-completions that are more relevant while adhering to the user's prefix. Such session-aware QACs can be generated by recent sequence-to-sequence deep learning models; however, these generative approaches often do not meet the stringent latency requirements of responding to each user keystroke. Moreover, these generative approaches pose the risk of showing nonsensical queries. One can pre-compute a relatively small subset of relevant queries for common prefixes and rank them based on the context. However, such an approach fails when no relevant queries for the current context are present in the pre-computed set. In this paper, we provide a solution to this problem: we take the novel approach of modeling session-aware QAC as an eXtreme Multi-Label Ranking (XMR) problem where the input is the previous query in the session and the user's current prefix, while the output space is the set of tens of millions of queries entered by users in the recent past. We adapt a popular XMR algorithm for this purpose by proposing several modifications to the key steps in the algorithm. The proposed modifications yield a 10x improvement in terms of Mean Reciprocal Rank (MRR) over the baseline XMR approach on a public search logs dataset. We are able to maintain an inference latency of less than 10 ms while still using session context. When compared against baseline models of acceptable latency, we observed a 33% improvement in MRR for short prefixes of up to 3 characters. Moreover, our model yielded a statistically significant improvement of 2.81% over a production QAC system in terms of suggestion acceptance rate, when deployed on the search bar of an online shopping store as part of an A/B test.

引用

页码：3835 / 3844

页数：10

共 37 条

[1] Personalizing Query Auto-completion for Multi-session Tasks
Jiang, Danyang
Cai, Fei
Chen, Honghui
2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY (CCET), 2018, : 203 - 207
[2] EXOS: EXpansion On Session for Enhancing Effectiveness of Query Auto-Completion
Su, Fang-Hsiang
Somaiya, Manas
Mishra, Shrish
Mukherjee, Rajyashree
PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1154 - 1163
[3] Exploring Query Auto-Completion and Click Logs for Contextual-Aware Web Search and Query Suggestion
Li, Liangda
Deng, Hongbo
Dong, Anlei
Chang, Yi
Baeza-Yates, Ricardo
Zha, Hongyuan
PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 539 - 548
[4] Towards Mobile Query Auto-Completion: An Efficient Mobile Application-Aware Approach
Zhang, Aston
Goyal, Amit
Baeza-Yates, Ricardo
Chang, Yi
Han, Jiawei
Gunter, Carl A.
Deng, Hongbo
PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 579 - 590
[5] TIPS: Time-aware Personalised Semantic-based query auto-completion
Tahery, Saedeh
Farzi, Saeed
JOURNAL OF INFORMATION SCIENCE, 2022, 48 (04) : 524 - 543
[6] Learning Parametric Models for Context-Aware Query Auto-Completion via Hawkes Processes
Li, Liangda
Deng, Hongbo
Chen, Jianhui
Chang, Yi
WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 131 - 139
[7] Ranking-Based Autoencoder for Extreme Multi-label Classification
Wang, Bingyu
Chen, Li
Sun, Wei
Qin, Kechen
Li, Kefeng
Zhou, Hui
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2820 - 2830
[8] A Framework for Multi-Label Learning Using Label Ranking and Correlation
Shaukat, Malik Irfan
Usman, Muhammad
ADVANCES IN DIGITAL TECHNOLOGIES, 2015, 275 : 296 - 303
[9] Extreme Multi-label Loss Functions for Recommendation, Tagging, Ranking & Other Missing Label Applications
Jain, Himanshu
Prabhu, Yashoteja
Varma, Manik
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 935 - 944
[10] Multi-label Learning of Kernel Extreme Learning Machine with Non-Equilibrium Label Completion
Cheng Y.-S.
Zhao D.-W.
Wang Y.-B.
Pei G.-S.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (03): : 719 - 725

← 1 2 3 4 →