Intent Term Weighting in E-Commerce Queries

被引:7
作者
Manchanda, Saurav [1 ,2 ]
Sharma, Mohit [2 ]
Karypis, George [1 ]
机构
[1] Univ Minnesota, Minneapolis, MN 55455 USA
[2] WalmartLabs, Sunnyvale, CA USA
来源
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19) | 2019年
基金
美国国家科学基金会;
关键词
Term weighting; query intent; query refinement; query reformulation;
D O I
10.1145/3357384.3358151
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
E-commerce search engines can fail to retrieve results that satisfy a query's product intent because: (i) conventional retrieval approaches, such as BM25, may ignore the important terms in queries owing to their low inverse document frequency (IDF), and (ii) for long queries, as is usually the case in rare queries (i.e., tail queries), they may fail to determine the relevant terms that are representative of the query's product intent. In this paper, we lever-age the historical query reformulation logs of a large e-retailer (walmart.com) to develop a distant-supervision-based approach to identify the relevant terms that characterize the query's product intent. The key idea underpinning our approach is that the terms retained in the reformulation of a query are more important in describing the query's product intent than the discarded terms. Additionally, we also use the fact that the significance of a term depends on its context (other terms in the neighborhood) in the query to determine the term's importance towards the query's product intent. We show that identifying and emphasizing the terms that define the query's product intent leads to a 3% improvement in ranking and outperforms the context-unaware baselines.
引用
收藏
页码:2345 / 2348
页数:4
相关论文
共 50 条
  • [21] Constrained Clustering with Seeds and Term Weighting Scheme
    Buatoom, Uraiwan
    Kongprawechnon, Waree
    Theeramunkong, Thanaruk
    2018 THIRTEENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE, INFORMATION AND CREATIVITY SUPPORT SYSTEMS (KICSS), 2018, : 99 - 104
  • [22] Comparison of term weighting schemes for document classification
    Jeong, Ho Young
    Shin, Sang Min
    Choi, Yong-Seok
    KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (02) : 265 - 276
  • [23] On Term Frequency Factor in Supervised Term Weighting Schemes for Text Classification
    Dogan, Turgut
    Uysal, Alper Kursat
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) : 9545 - 9560
  • [24] Term weighting for information retrieval based on term's discrimination power
    Li, Qing
    Lee, Seungwoo
    Jung, Hanmin
    Lee, Yeong Su
    Cho, Jae-Hyun
    Song, Sa-kwang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 71 (02) : 769 - 781
  • [25] On Term Frequency Factor in Supervised Term Weighting Schemes for Text Classification
    Turgut Dogan
    Alper Kursat Uysal
    Arabian Journal for Science and Engineering, 2019, 44 : 9545 - 9560
  • [26] Term weighting for information retrieval based on term’s discrimination power
    Qing Li
    Seungwoo Lee
    Hanmin Jung
    Yeong Su Lee
    Jae-Hyun Cho
    Sa-kwang Song
    Multimedia Tools and Applications, 2014, 71 : 769 - 781
  • [27] Using modified term frequency to improve term weighting for text classification
    Chen, Long
    Jiang, Liangxiao
    Li, Chaoqun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 101
  • [28] Developing term weighting scheme based on term occurrence ratio for sentiment analysis
    Chirawichitchai, Nivet
    Lecture Notes in Electrical Engineering, 2015, 339 : 737 - 744
  • [29] A NOVEL TERM WEIGHTING SCHEME MIDF FOR TEXT CATEGORIZATION
    Deisy, C.
    Gowri, M.
    Baskar, S.
    Kalaiarasi, S. M. A.
    Ramraj, N.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2010, 5 (01) : 94 - 107
  • [30] RANDOM WALK TERM WEIGHTING FOR IMPROVED TEXT CLASSIFICATION
    Hassan, Samer
    Mihalcea, Rada
    Banea, Carmen
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2007, 1 (04) : 421 - 439