Effective Lightweight Learning-to-Rank Method Using Unified Term Impacts

被引:2
|
作者
Silva, Sheila de N. [1 ]
de Moura, Edleno S. [1 ]
Calado, Pavel P. [2 ]
da Silva, Altigran S. [1 ]
机构
[1] Univ Fed Amazonas, Inst Comp, BR-69067005 Manaus, Amazonas, Brazil
[2] Inst Super Tecn, INESC ID, P-1049001 Lisbon, Portugal
关键词
Query processing; Indexing; Computational modeling; Search engines; Computational efficiency; Boosting; Gradient boosting; indexing; LambdaMART; learning-to-rank; search engines;
D O I
10.1109/ACCESS.2020.2986943
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, we propose and evaluate a novel learning-to-rank (L2R) approach that produces results on par with those of the state-of-the-art L2R methods while being computationally effective. We start by presenting a modified gradient boosted regression tree algorithm to generate unified term impact (UTI) values at indexing time. Each unified term impact replaces several features with a single value in the document index, thereby reducing the effort to compute the document scores at query processing time because the system fetches and processes fewer values. The adoption of UTI values produces competitive ranking results. However, the lack of features available only at query time might lead to accuracy loss. To solve this problem, we propose a hybrid model that uses UTI values with query-dependent features. We demonstrate that our hybrid methods can deliver high-quality results on par with those of the existing state-of-the-art neural ranking models. Our methods can also reduce the computational costs for processing queries, serving as an interesting alternative for L2R practical applications. Our best hybrid, HLambdaMART, achieves an NDCG@10 value of 0.495 using only 36 features at query processing time when applied to the MQ2007 collection, while the best baseline achieves 0.490 using a larger set of features at query processing time. The use of our hybrid framework reduces the time to run LambdaMART to about 35% of the time to run it without using our proposals. In summary, we present a competitive and lightweight alternative L2R approach to be adopted in search systems.
引用
收藏
页码:70420 / 70437
页数:18
相关论文
共 50 条
  • [1] Document Selection Methodologies for Efficient and Effective Learning-to-Rank
    Aslam, Javed A.
    Kanoulas, Evangelos
    Pavlu, Virgil
    Savev, Stefan
    Yilmaz, Emine
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 468 - 475
  • [2] A learning-to-rank method for information updating task
    Minh Quang Nhat Pham
    Minh Le Nguyen
    Bach Xuan Ngo
    Akira Shimazu
    Applied Intelligence, 2012, 37 : 499 - 510
  • [3] A learning-to-rank method for information updating task
    Minh Quang Nhat Pham
    Minh Le Nguyen
    Bach Xuan Ngo
    Shimazu, Akira
    APPLIED INTELLIGENCE, 2012, 37 (04) : 499 - 510
  • [4] Is Interpretable Machine Learning Effective at Feature Selection for Neural Learning-to-Rank?
    Lyu, Lijun
    Roy, Nirmal
    Oosterhuis, Harrie
    Anand, Avishek
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT IV, 2024, 14611 : 384 - 402
  • [5] Horse race rank prediction using learning-to-rank approaches
    Chung, Junhyoung
    Shin, Donguk
    Hwang, Seyong
    Park, Gunwoong
    KOREAN JOURNAL OF APPLIED STATISTICS, 2024, 37 (02)
  • [6] Rax: Composable Learning-to-Rank Using JAX
    Jagerman, Rolf
    Wang, Xuanhui
    Zhuang, Honglei
    Qin, Zhen
    Bendersky, Michael
    Najork, Marc
    Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2022, : 3051 - 3060
  • [7] Rax: Composable Learning-to-Rank using JAX
    Jagerman, Rolf
    Wang, Xuanhui
    Zhuang, Honglei
    Qin, Zhen
    Bendersky, Michael
    Najork, Marc
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3051 - 3060
  • [8] ListBM: A Learning-to-Rank Method for XML Keyword Search
    Gao, Nmg
    Deng, Zhi-Hong
    Xiang, Yong-Qing
    Hang, Yu
    FOCUSED RETRIEVAL AND EVALUATION, 2010, 6203 : 81 - 87
  • [9] Feature Selection for Learning-to-Rank using Simulated Annealing
    Allvi, Mustafa Wasif
    Hasan, Mahamudul
    Rayon, Lazim
    Shahabuddin, Mohammad
    Khan, Md Mosaddek
    Ibrahim, Muhammad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 699 - 705
  • [10] RankFormer: Listwise Learning-to-Rank Using Listwide Labels
    Buyl, Maarten
    Missault, Paul
    Sondag, Pierre-Antoine
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3762 - 3773