Session-Aware Query Auto-completion using Extreme Multi-Label Ranking

被引:7
|
作者
Yadav, Nishant [1 ]
Sen, Rajat [2 ]
Hill, Daniel N. [3 ]
Mazumdar, Arya [4 ]
Dhillon, Inderjit S. [3 ,5 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
[2] Google Res, San Jose, CA USA
[3] Amazon, Seattle, WA USA
[4] Univ Calif San Diego, San Diego, CA USA
[5] Univ Texas Austin, Austin, TX 78712 USA
来源
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2021年
关键词
eXtreme multi-label ranking; auto-complete; session-aware;
D O I
10.1145/3447548.3467087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Query auto-completion (QAC) is a fundamental feature in search engines where the task is to suggest plausible completions of a prefix typed in the search bar. Previous queries in the user session can provide useful context for the user's intent and can be leveraged to suggest auto-completions that are more relevant while adhering to the user's prefix. Such session-aware QACs can be generated by recent sequence-to-sequence deep learning models; however, these generative approaches often do not meet the stringent latency requirements of responding to each user keystroke. Moreover, these generative approaches pose the risk of showing nonsensical queries. One can pre-compute a relatively small subset of relevant queries for common prefixes and rank them based on the context. However, such an approach fails when no relevant queries for the current context are present in the pre-computed set. In this paper, we provide a solution to this problem: we take the novel approach of modeling session-aware QAC as an eXtreme Multi-Label Ranking (XMR) problem where the input is the previous query in the session and the user's current prefix, while the output space is the set of tens of millions of queries entered by users in the recent past. We adapt a popular XMR algorithm for this purpose by proposing several modifications to the key steps in the algorithm. The proposed modifications yield a 10x improvement in terms of Mean Reciprocal Rank (MRR) over the baseline XMR approach on a public search logs dataset. We are able to maintain an inference latency of less than 10 ms while still using session context. When compared against baseline models of acceptable latency, we observed a 33% improvement in MRR for short prefixes of up to 3 characters. Moreover, our model yielded a statistically significant improvement of 2.81% over a production QAC system in terms of suggestion acceptance rate, when deployed on the search bar of an online shopping store as part of an A/B test.
引用
收藏
页码:3835 / 3844
页数:10
相关论文
共 37 条
  • [1] Personalizing Query Auto-completion for Multi-session Tasks
    Jiang, Danyang
    Cai, Fei
    Chen, Honghui
    2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY (CCET), 2018, : 203 - 207
  • [2] EXOS: EXpansion On Session for Enhancing Effectiveness of Query Auto-Completion
    Su, Fang-Hsiang
    Somaiya, Manas
    Mishra, Shrish
    Mukherjee, Rajyashree
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1154 - 1163
  • [3] Exploring Query Auto-Completion and Click Logs for Contextual-Aware Web Search and Query Suggestion
    Li, Liangda
    Deng, Hongbo
    Dong, Anlei
    Chang, Yi
    Baeza-Yates, Ricardo
    Zha, Hongyuan
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 539 - 548
  • [4] Towards Mobile Query Auto-Completion: An Efficient Mobile Application-Aware Approach
    Zhang, Aston
    Goyal, Amit
    Baeza-Yates, Ricardo
    Chang, Yi
    Han, Jiawei
    Gunter, Carl A.
    Deng, Hongbo
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 579 - 590
  • [5] TIPS: Time-aware Personalised Semantic-based query auto-completion
    Tahery, Saedeh
    Farzi, Saeed
    JOURNAL OF INFORMATION SCIENCE, 2022, 48 (04) : 524 - 543
  • [6] Learning Parametric Models for Context-Aware Query Auto-Completion via Hawkes Processes
    Li, Liangda
    Deng, Hongbo
    Chen, Jianhui
    Chang, Yi
    WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 131 - 139
  • [7] Ranking-Based Autoencoder for Extreme Multi-label Classification
    Wang, Bingyu
    Chen, Li
    Sun, Wei
    Qin, Kechen
    Li, Kefeng
    Zhou, Hui
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2820 - 2830
  • [8] A Framework for Multi-Label Learning Using Label Ranking and Correlation
    Shaukat, Malik Irfan
    Usman, Muhammad
    ADVANCES IN DIGITAL TECHNOLOGIES, 2015, 275 : 296 - 303
  • [9] Extreme Multi-label Loss Functions for Recommendation, Tagging, Ranking & Other Missing Label Applications
    Jain, Himanshu
    Prabhu, Yashoteja
    Varma, Manik
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 935 - 944
  • [10] Multi-label Learning of Kernel Extreme Learning Machine with Non-Equilibrium Label Completion
    Cheng Y.-S.
    Zhao D.-W.
    Wang Y.-B.
    Pei G.-S.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (03): : 719 - 725