Intent Term Weighting in E-Commerce Queries

被引:7
作者
Manchanda, Saurav [1 ,2 ]
Sharma, Mohit [2 ]
Karypis, George [1 ]
机构
[1] Univ Minnesota, Minneapolis, MN 55455 USA
[2] WalmartLabs, Sunnyvale, CA USA
来源
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19) | 2019年
基金
美国国家科学基金会;
关键词
Term weighting; query intent; query refinement; query reformulation;
D O I
10.1145/3357384.3358151
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
E-commerce search engines can fail to retrieve results that satisfy a query's product intent because: (i) conventional retrieval approaches, such as BM25, may ignore the important terms in queries owing to their low inverse document frequency (IDF), and (ii) for long queries, as is usually the case in rare queries (i.e., tail queries), they may fail to determine the relevant terms that are representative of the query's product intent. In this paper, we lever-age the historical query reformulation logs of a large e-retailer (walmart.com) to develop a distant-supervision-based approach to identify the relevant terms that characterize the query's product intent. The key idea underpinning our approach is that the terms retained in the reformulation of a query are more important in describing the query's product intent than the discarded terms. Additionally, we also use the fact that the significance of a term depends on its context (other terms in the neighborhood) in the query to determine the term's importance towards the query's product intent. We show that identifying and emphasizing the terms that define the query's product intent leads to a 3% improvement in ranking and outperforms the context-unaware baselines.
引用
收藏
页码:2345 / 2348
页数:4
相关论文
共 50 条
[41]   Query Term Ranking based on Dependency Parsing of Verbose Queries [J].
Park, Jae-Hyun ;
Croft, W. Bruce .
SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, :829-830
[42]   Information-theoretic Term Weighting Schemes for Document Clustering [J].
Ke, Weimao .
JCDL'13: PROCEEDINGS OF THE 13TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES, 2013, :143-152
[43]   Hybridized term-weighting method for Dark Web classification [J].
Sabbah, Thabit ;
Selamat, Ali ;
Selamat, Md. Hafiz ;
Ibrahim, Roliana ;
Fujita, Hamido .
NEUROCOMPUTING, 2016, 173 :1908-1926
[44]   The importance of Term Weighting in semantic understanding of text: A review of techniques [J].
Rathi, R. N. ;
Mustafi, A. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (07) :9761-9783
[45]   Supervised and Traditional Term Weighting Methods for Automatic Text Categorization [J].
Lan, Man ;
Tan, Chew Lim ;
Su, Jian ;
Lu, Yue .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (04) :721-735
[46]   A Note on the Effect of Term Weighting on Selecting Intrinsic Dimensionality of Data [J].
Kumar, Ch. Aswani ;
Srinivas, S. .
CYBERNETICS AND INFORMATION TECHNOLOGIES, 2009, 9 (01) :5-12
[47]   On entropy-based term weighting schemes for text categorization [J].
Tao Wang ;
Yi Cai ;
Ho-fung Leung ;
Raymond Y. K. Lau ;
Haoran Xie ;
Qing Li .
Knowledge and Information Systems, 2021, 63 :2313-2346
[48]   Addressing Diverse Corpora With Cluster-Based Term Weighting [J].
Organisciak, Peter .
JCDL'13: PROCEEDINGS OF THE 13TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES, 2013, :163-166
[49]   The importance of Term Weighting in semantic understanding of text: A review of techniques [J].
R. N. Rathi ;
A. Mustafi .
Multimedia Tools and Applications, 2023, 82 :9761-9783
[50]   On entropy-based term weighting schemes for text categorization [J].
Wang, Tao ;
Cai, Yi ;
Leung, Ho-fung ;
Lau, Raymond Y. K. ;
Xie, Haoran ;
Li, Qing .
KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (09) :2313-2346