Web-Scale Semantic Product Search with Large Language Models

被引：3

作者：

Muhamed, Aashiq ^{[1
]}

Srinivasan, Sriram ^{[1
]}

Teo, Choon-Hui ^{[1
]}

Cui, Qingjun ^{[1
]}

Zeng, Belinda ^{[2
]}

Chilimbi, Trishul ^{[2
]}

Vishwanathan, S. V. N. ^{[1
]}

机构：

[1] Amazon, Palo Alto, CA 94303 USA

[2] Amazon, Seattle, WA USA

来源：

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT III | 2023年 / 13937卷

关键词：

Matching; Retrieval; Search; Pretrained Language Models;

D O I：

10.1007/978-3-031-33380-4_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dense embedding-based semantic matching is widely used in e-commerce product search to address the shortcomings of lexical matching such as sensitivity to spelling variants. The recent advances in BERT-like language model encoders, have however, not found their way to realtime search due to the strict inference latency requirement imposed on e-commerce websites. While bi-encoder BERT architectures enable fast approximate nearest neighbor search, training them effectively on query-product data remains a challenge due to training instabilities and the persistent generalization gap with cross-encoders. In this work, we propose a four-stage training procedure to leverage large BERT-like models for product search while preserving low inference latency. We introduce query-product interaction pre-finetuning to effectively pretrain BERT bi-encoders for matching and improve generalization. Through offline experiments on an e-commerce product dataset, we show that a distilled small BERT-based model (75M params) trained using our approach improves the search relevance metric by up to 23% over a baseline DSSM-based model with similar inference latency. The small model only suffers a 3% drop in relevance metric compared to the 20x larger teacher. We also show using online A/B tests at scale, that our approach improves over the production model in exact and substitute products retrieved.

引用

页码：73 / 85

页数：13

共 50 条

[31] CPM-2: Large-scale cost-effective pre-trained language models
Zhang, Zhengyan
Gu, Yuxian
Han, Xu
Chen, Shengqi
Xiao, Chaojun
Sun, Zhenbo
Yao, Yuan
Qi, Fanchao
Guan, Jian
Ke, Pei
Cai, Yanzheng
Zeng, Guoyang
Tan, Zhixing
Liu, Zhiyuan
Huang, Minlie
Han, Wentao
Liu, Yang
Zhu, Xiaoyan
Sun, Maosong
AI OPEN, 2021, 2 : 216 - 224
[32] KNOWLEDGE TRANSFER FROM LARGE-SCALE PRETRAINED LANGUAGE MODELS TO END-TO-END SPEECH RECOGNIZERS
Kubo, Yotaro
Karita, Shigeki
Bacchiani, Michiel
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8512 - 8516
[33] A Large-Scale Analysis of Variance in Written Language
Johns, Brendan T.
Jamieson, Randall K.
COGNITIVE SCIENCE, 2018, 42 (04) : 1360 - 1374
[34] Assessing Phrase Break of ESL Speech with Pre-trained Language Models and Large Language Models
Wang, Zhiyi
Mao, Shaoguang
Wu, Wenshan
Xia, Yan
Deng, Yan
Tien, Jonathan
INTERSPEECH 2023, 2023, : 4194 - 4198
[35] A distributed framework for large-scale semantic trajectory similarity join
Tian, Ruijie
Li, Jiajun
Zhang, Weishi
Wang, Fei
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 16205 - 16229
[36] A MultikeyRank Model Based on Ontology for Large-Scale Semantic Data
Jiang Yang
Feng Zhiyong
Wang Xin
CHINESE JOURNAL OF ELECTRONICS, 2014, 23 (01) : 119 - 123
[37] Large language models for causal hypothesis generation in science
Cohrs, Kai-Hendrik
Diaz, Emiliano
Sitokonstantinou, Vasileios
Varando, Gherardo
Camps-Valls, Gustau
MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2025, 6 (01):
[38] Large Scale Nearest Neighbors Search Based on Neighborhood Graph
Zhou, Wenhui
Yuan, Chunfeng
Gu, Rong
Huang, Yihua
2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 181 - 186
[39] Figure search by text in large scale digital document collections
Yurtsever, M. Mucahit Enes
Ozcan, Muhammet
Taruz, Zubeyir
Eken, Suleyman
Sayar, Ahmet
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (01)
[40] Candidate Selection for Large Scale Personalized Search and Recommender Systems
Arya, Dhruv
Venkataraman, Ganesh
Grover, Aman
Kenthapadi, Krishnaram
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1391 - 1393

← 1 2 3 4 5 →