Language-Agnostic Representation Learning for Product Search on E-Commerce Platforms

被引:17
作者
Ahuja, Aman [1 ,2 ]
Rao, Nikhil [2 ]
Katariya, Sumeet [2 ]
Subbian, Karthik [2 ]
Reddy, Chandan K. [1 ]
机构
[1] Virginia Tech, Arlington, VA 22203 USA
[2] Amazon, Palo Alto, CA USA
来源
PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20) | 2020年
关键词
Product search; deep learning; E-commerce; multi-task learning; cross-lingual models;
D O I
10.1145/3336191.3371852
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Product search forms an indispensable component of any e-commerce service, and helps customers find products of their interest from a large catalog on these websites. When products that are irrelevant to the search query are surfaced, it leads to a poor customer experience, thus reducing user trust and increasing the likelihood of churn. While identifying and removing such results from product search is crucial, doing so is a burdensome task that requires large amounts of human annotated data to train accurate models. This problem is exacerbated when products are cross-listed across countries that speak multiple languages, and customers specify queries in multiple languages and from different cultural contexts. In this work, we propose a novel multi-lingual multi-task learning framework, to jointly train product search models on multiple languages, with limited amount of training data from each language. By aligning the query and product representations from different languages into a language-independent vector space of queries and products, respectively, the proposed model improves the performance over baseline search models in any given language. We evaluate the performance of our model on real data collected from a leading e-commerce service. Our experimental evaluation demonstrates up to 23% relative improvement in the classification F1-score compared to the state-of-the-art baseline models.
引用
收藏
页码:7 / 15
页数:9
相关论文
共 35 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2018, NAACL HLT SHORT PAPE
[3]  
[Anonymous], 2018, P EMNLP SYST DEM, DOI DOI 10.18653/V1/D18-2012
[4]  
[Anonymous], ARXIV181205774
[5]  
[Anonymous], LEVERAGE IMPLICIT FE
[6]  
[Anonymous], INT C LERN REPR ICLR
[7]  
[Anonymous], 2016, 30 AAAI C ART INT
[8]  
[Anonymous], INT C LERN REPR ICLR
[9]  
[Anonymous], 2012, P 21 ACM INT C INFOR
[10]  
Ballesteros L, 1996, LECT NOTES COMPUT SC, V1134, P791, DOI 10.1007/BFb0034731