Web-Scale Semantic Product Search with Large Language Models

被引:3
作者
Muhamed, Aashiq [1 ]
Srinivasan, Sriram [1 ]
Teo, Choon-Hui [1 ]
Cui, Qingjun [1 ]
Zeng, Belinda [2 ]
Chilimbi, Trishul [2 ]
Vishwanathan, S. V. N. [1 ]
机构
[1] Amazon, Palo Alto, CA 94303 USA
[2] Amazon, Seattle, WA USA
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III | 2023年 / 13937卷
关键词
Matching; Retrieval; Search; Pretrained Language Models;
D O I
10.1007/978-3-031-33380-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dense embedding-based semantic matching is widely used in e-commerce product search to address the shortcomings of lexical matching such as sensitivity to spelling variants. The recent advances in BERT-like language model encoders, have however, not found their way to realtime search due to the strict inference latency requirement imposed on e-commerce websites. While bi-encoder BERT architectures enable fast approximate nearest neighbor search, training them effectively on query-product data remains a challenge due to training instabilities and the persistent generalization gap with cross-encoders. In this work, we propose a four-stage training procedure to leverage large BERT-like models for product search while preserving low inference latency. We introduce query-product interaction pre-finetuning to effectively pretrain BERT bi-encoders for matching and improve generalization. Through offline experiments on an e-commerce product dataset, we show that a distilled small BERT-based model (75M params) trained using our approach improves the search relevance metric by up to 23% over a baseline DSSM-based model with similar inference latency. The small model only suffers a 3% drop in relevance metric compared to the 20x larger teacher. We also show using online A/B tests at scale, that our approach improves over the production model in exact and substitute products retrieved.
引用
收藏
页码:73 / 85
页数:13
相关论文
共 50 条
  • [41] Semantic Service Search Engine (S3E): An Approach for Finding Services on the Web
    Giantsiou, Lemonia
    Loutas, Nikolaos
    Peristeras, Vassilios
    Tarabanis, Konstantinos
    VISIONING AND ENGINEERING THE KNOWLEDGE SOCIETY: A WEB SCIENCE PERSPECTIVE, PROCEEDINGS, 2009, 5736 : 316 - +
  • [42] Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding
    He, Mutian
    Garner, Philip N.
    INTERSPEECH 2023, 2023, : 1109 - 1113
  • [43] The Semantic Librarian: A search engine built from vector-space models of semantics
    Aujla, Harinder
    Crump, Matthew J. C.
    Cook, Matthew T.
    Jamieson, Randall K.
    BEHAVIOR RESEARCH METHODS, 2019, 51 (06) : 2405 - 2418
  • [44] Large Language Models for Software Engineering: Survey and Open Problems
    Fan, Angela
    Gokkaya, Beliz
    Harman, Mark
    Lyubarskiy, Mitya
    Sengupta, Shubho
    Yoo, Shin
    Zhang, Jie M.
    2023 IEEE/ACM INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: FUTURE OF SOFTWARE ENGINEERING, ICSE-FOSE, 2023, : 31 - 53
  • [45] How large language models can reshape collective intelligence
    Burton, Jason W.
    Lopez-Lopez, Ezequiel
    Hechtlinger, Shahar
    Rahwan, Zoe
    Aeschbach, Samuel
    Bakker, Michiel A.
    Becker, Joshua A.
    Berditchevskaia, Aleks
    Berger, Julian
    Brinkmann, Levin
    Flek, Lucie
    Herzog, Stefan M.
    Huang, Saffron
    Kapoor, Sayash
    Narayanan, Arvind
    Nussberger, Anne-Marie
    Yasseri, Taha
    Nickl, Pietro
    Almaatouq, Abdullah
    Hahn, Ulrike
    Kurvers, Ralf H. J. M.
    Leavy, Susan
    Rahwan, Iyad
    Siddarth, Divya
    Siu, Alice
    Woolley, Anita W.
    Wulff, Dirk U.
    Hertwig, Ralph
    NATURE HUMAN BEHAVIOUR, 2024, 8 (09): : 1643 - 1655
  • [46] Symbolic Execution with Test Cases Generated by Large Language Models
    Xu, Jiahe
    Xu, Jingwei
    Chen, Taolue
    Ma, Xiaoxing
    2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2024, : 228 - 237
  • [47] Large Scale Medical Image Search via Unsupervised PCA Hashing
    Yu, Xiang
    Zhang, Shaoting
    Liu, Bo
    Zhong, Lin
    Metaxas, Dimitris N.
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 393 - 398
  • [48] On very large scale test collection for landmark image search benchmarking
    Cheng, Zhiyong
    Shen, Jialie
    SIGNAL PROCESSING, 2016, 124 : 13 - 26
  • [49] Large scale similarity search across digital reconstructions of neural morphology
    Ljungquist, Bengt
    Akram, Masood A.
    Ascoli, Giorgio A.
    NEUROSCIENCE RESEARCH, 2022, 181 : 39 - 45
  • [50] Efficient Nearest Neighbors Search for Large-Scale Landmark Recognition
    Magliani, Federico
    Fontanini, Tomaso
    Prati, Andrea
    ADVANCES IN VISUAL COMPUTING, ISVC 2018, 2018, 11241 : 541 - 551