Entity-aware Multi-task Learning for Query Understanding at Walmart

被引:0
作者
Peng, Zhiyuan [1 ]
Dave, Vachik [2 ]
McNabb, Nicole [2 ]
Sharnagat, Rahul [2 ]
Magnani, Alessandro [2 ]
Liao, Ciya [2 ]
Fang, Yi [1 ]
Rajanala, Sravanthi [2 ]
机构
[1] Santa Clara Univ, Santa Clara, CA 95053 USA
[2] Walmart Global Tech, Sunnyvale, CA 94086 USA
来源
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023 | 2023年
关键词
Multi-task learning; Semi-supervised learning; Query Understanding; E-commerce;
D O I
10.1145/3580305.3599816
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query Understanding (QU) is a fundamental process in E-commerce search engines by extracting the shopping intents of customers. It usually includes a set of different tasks such as named entity recognization and query classification. Traditional approaches often tackle each task separately by its own network, which leads to excessive workload for development and maintenance as well as increased latency and resource usage in large-scale E-commerce platforms. To tackle these challenges, this paper presents a multitask learning approach to query understanding at Walmart. We experimented with several state-of-the-art multi-task learning architectures including MTDNN, MMoE, and PLE. Furthermore, we propose a novel large-scale entity-aware multi-task learning model (EAMT)1 by retrieving entities from engagement data as query context to augment the query representation. To the best of our knowledge, there exists no prior work on multi-task learning for E-commerce query understanding. Comprehensive offline experiments are conducted on industry-scale datasets (up to 965M queries) to illustrate the effectiveness of our approach. The results from online experiments show substantial gains in key accuracy and latency metrics.
引用
收藏
页码:4733 / 4742
页数:10
相关论文
共 27 条
  • [1] Cao H, 2007, PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 4, P129
  • [2] Deep Neural Query Understanding System at Expedia Group
    Chandrasekaran, Ramji
    Pathak, Harsh Nilesh
    Yano, Tae
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1476 - 1484
  • [3] Denk Timo I, 2020, ARXIV201015778
  • [4] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [5] Fedus W., 2021, J. Mach. Learn. Res
  • [6] Gupta Shashank, 2022, ARXIV220407689
  • [7] Embracing Change: Continual Learning in Deep Neural Networks
    Hadsell, Raia
    Rao, Dushyant
    Rusu, Andrei A.
    Pascanu, Razvan
    [J]. TRENDS IN COGNITIVE SCIENCES, 2020, 24 (12) : 1028 - 1040
  • [8] Jiang Haoming, 2022, ARXIV221003915
  • [9] Billion-Scale Similarity Search with GPUs
    Johnson, Jeff
    Douze, Matthijs
    Jegou, Herve
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2021, 7 (03) : 535 - 547
  • [10] Li Juanhui, 2022, ARXIV220406522