Web-Scale Semantic Product Search with Large Language Models

被引:3
|
作者
Muhamed, Aashiq [1 ]
Srinivasan, Sriram [1 ]
Teo, Choon-Hui [1 ]
Cui, Qingjun [1 ]
Zeng, Belinda [2 ]
Chilimbi, Trishul [2 ]
Vishwanathan, S. V. N. [1 ]
机构
[1] Amazon, Palo Alto, CA 94303 USA
[2] Amazon, Seattle, WA USA
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III | 2023年 / 13937卷
关键词
Matching; Retrieval; Search; Pretrained Language Models;
D O I
10.1007/978-3-031-33380-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dense embedding-based semantic matching is widely used in e-commerce product search to address the shortcomings of lexical matching such as sensitivity to spelling variants. The recent advances in BERT-like language model encoders, have however, not found their way to realtime search due to the strict inference latency requirement imposed on e-commerce websites. While bi-encoder BERT architectures enable fast approximate nearest neighbor search, training them effectively on query-product data remains a challenge due to training instabilities and the persistent generalization gap with cross-encoders. In this work, we propose a four-stage training procedure to leverage large BERT-like models for product search while preserving low inference latency. We introduce query-product interaction pre-finetuning to effectively pretrain BERT bi-encoders for matching and improve generalization. Through offline experiments on an e-commerce product dataset, we show that a distilled small BERT-based model (75M params) trained using our approach improves the search relevance metric by up to 23% over a baseline DSSM-based model with similar inference latency. The small model only suffers a 3% drop in relevance metric compared to the 20x larger teacher. We also show using online A/B tests at scale, that our approach improves over the production model in exact and substitute products retrieved.
引用
收藏
页码:73 / 85
页数:13
相关论文
共 50 条
  • [11] Learning and inferencing in user ontology for personalized Semantic Web search
    Jiang, Xing
    Tan, Ah-Hwee
    INFORMATION SCIENCES, 2009, 179 (16) : 2794 - 2808
  • [12] Textual and Content-Based Search in Repositories of Web Application Models
    Bislimovska, Bojana
    Bozzon, Alessandro
    Brambilla, Marco
    Fraternali, Piero
    ACM TRANSACTIONS ON THE WEB, 2014, 8 (02)
  • [13] A semantic web enabled approach to reuse functional requirements models in web engineering
    Paydar, Samad
    Kahani, Mohsen
    AUTOMATED SOFTWARE ENGINEERING, 2015, 22 (02) : 241 - 288
  • [14] Integrating Knowledge Base Retrieval with Web Search using Semantic Roles
    Karanth, Pallavi
    Mahesh, Kavi
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, IMECS 2012, VOL I, 2012, : 344 - 349
  • [15] Energy-aware architecture for information search in the semantic web of things
    Deusto Institute of Technology - DeustoTech, University of Deusto, Avda. Universidades 24, 48007 Bilbao, Spain
    不详
    Int. J. Web Grid. Serv., 2-3 (192-217): : 192 - 217
  • [16] Energy-aware architecture for information search in the semantic web of things
    Gomez-Goiri, Aitor
    Goiri, Inigo
    Lopez-de-Ipina, Diego
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2014, 10 (2-3) : 192 - 217
  • [17] HDL IP Cores Search Engine Based on Semantic Web Technologies
    Zdraveski, Vladimir
    Jovanovik, Milos
    Stojanov, Riste
    Trajanov, Dimitar
    ICT INNOVATIONS 2010, 2011, 83 : 306 - 315
  • [18] A rapid mining model for extracting sparse distribution association semantic link from large-scale web resources
    Zhang, Shunxiang
    Lu, Kui
    Yin, Xiaobo
    Zhu, Guangli
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2017, 25 (1-2) : 52 - 64
  • [19] WuDaoCorpora: A super large-scale Chinese corpora for pre-training language models
    Yuan, Sha
    Zhao, Hanyu
    Du, Zhengxiao
    Ding, Ming
    Liu, Xiao
    Cen, Yukuo
    Zou, Xu
    Yang, Zhilin
    Tang, Jie
    AI OPEN, 2021, 2 : 65 - 68
  • [20] Pre-trained Language Model forWeb-scale Retrieval in Baidu Search
    Liu, Yiding
    Lu, Weixue
    Cheng, Suqi
    Shi, Daiting
    Wang, Shuaiqiang
    Cheng, Zhicong
    Yin, Dawei
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3365 - 3375