Web-Scale Semantic Product Search with Large Language Models

被引:3
作者
Muhamed, Aashiq [1 ]
Srinivasan, Sriram [1 ]
Teo, Choon-Hui [1 ]
Cui, Qingjun [1 ]
Zeng, Belinda [2 ]
Chilimbi, Trishul [2 ]
Vishwanathan, S. V. N. [1 ]
机构
[1] Amazon, Palo Alto, CA 94303 USA
[2] Amazon, Seattle, WA USA
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III | 2023年 / 13937卷
关键词
Matching; Retrieval; Search; Pretrained Language Models;
D O I
10.1007/978-3-031-33380-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dense embedding-based semantic matching is widely used in e-commerce product search to address the shortcomings of lexical matching such as sensitivity to spelling variants. The recent advances in BERT-like language model encoders, have however, not found their way to realtime search due to the strict inference latency requirement imposed on e-commerce websites. While bi-encoder BERT architectures enable fast approximate nearest neighbor search, training them effectively on query-product data remains a challenge due to training instabilities and the persistent generalization gap with cross-encoders. In this work, we propose a four-stage training procedure to leverage large BERT-like models for product search while preserving low inference latency. We introduce query-product interaction pre-finetuning to effectively pretrain BERT bi-encoders for matching and improve generalization. Through offline experiments on an e-commerce product dataset, we show that a distilled small BERT-based model (75M params) trained using our approach improves the search relevance metric by up to 23% over a baseline DSSM-based model with similar inference latency. The small model only suffers a 3% drop in relevance metric compared to the 20x larger teacher. We also show using online A/B tests at scale, that our approach improves over the production model in exact and substitute products retrieved.
引用
收藏
页码:73 / 85
页数:13
相关论文
共 50 条
  • [31] CPM-2: Large-scale cost-effective pre-trained language models
    Zhang, Zhengyan
    Gu, Yuxian
    Han, Xu
    Chen, Shengqi
    Xiao, Chaojun
    Sun, Zhenbo
    Yao, Yuan
    Qi, Fanchao
    Guan, Jian
    Ke, Pei
    Cai, Yanzheng
    Zeng, Guoyang
    Tan, Zhixing
    Liu, Zhiyuan
    Huang, Minlie
    Han, Wentao
    Liu, Yang
    Zhu, Xiaoyan
    Sun, Maosong
    AI OPEN, 2021, 2 : 216 - 224
  • [32] KNOWLEDGE TRANSFER FROM LARGE-SCALE PRETRAINED LANGUAGE MODELS TO END-TO-END SPEECH RECOGNIZERS
    Kubo, Yotaro
    Karita, Shigeki
    Bacchiani, Michiel
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8512 - 8516
  • [33] A Large-Scale Analysis of Variance in Written Language
    Johns, Brendan T.
    Jamieson, Randall K.
    COGNITIVE SCIENCE, 2018, 42 (04) : 1360 - 1374
  • [34] Assessing Phrase Break of ESL Speech with Pre-trained Language Models and Large Language Models
    Wang, Zhiyi
    Mao, Shaoguang
    Wu, Wenshan
    Xia, Yan
    Deng, Yan
    Tien, Jonathan
    INTERSPEECH 2023, 2023, : 4194 - 4198
  • [35] A distributed framework for large-scale semantic trajectory similarity join
    Tian, Ruijie
    Li, Jiajun
    Zhang, Weishi
    Wang, Fei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 16205 - 16229
  • [36] A MultikeyRank Model Based on Ontology for Large-Scale Semantic Data
    Jiang Yang
    Feng Zhiyong
    Wang Xin
    CHINESE JOURNAL OF ELECTRONICS, 2014, 23 (01) : 119 - 123
  • [37] Large language models for causal hypothesis generation in science
    Cohrs, Kai-Hendrik
    Diaz, Emiliano
    Sitokonstantinou, Vasileios
    Varando, Gherardo
    Camps-Valls, Gustau
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2025, 6 (01):
  • [38] Large Scale Nearest Neighbors Search Based on Neighborhood Graph
    Zhou, Wenhui
    Yuan, Chunfeng
    Gu, Rong
    Huang, Yihua
    2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 181 - 186
  • [39] Figure search by text in large scale digital document collections
    Yurtsever, M. Mucahit Enes
    Ozcan, Muhammet
    Taruz, Zubeyir
    Eken, Suleyman
    Sayar, Ahmet
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (01)
  • [40] Candidate Selection for Large Scale Personalized Search and Recommender Systems
    Arya, Dhruv
    Venkataraman, Ganesh
    Grover, Aman
    Kenthapadi, Krishnaram
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1391 - 1393