Effective Lightweight Learning-to-Rank Method Using Unified Term Impacts

被引：2

作者：

Silva, Sheila de N. ^{[1
]}

de Moura, Edleno S. ^{[1
]}

Calado, Pavel P. ^{[2
]}

da Silva, Altigran S. ^{[1
]}

机构：

[1] Univ Fed Amazonas, Inst Comp, BR-69067005 Manaus, Amazonas, Brazil

[2] Inst Super Tecn, INESC ID, P-1049001 Lisbon, Portugal

来源：

IEEE ACCESS | 2020年 / 8卷

关键词：

Query processing; Indexing; Computational modeling; Search engines; Computational efficiency; Boosting; Gradient boosting; indexing; LambdaMART; learning-to-rank; search engines;

D O I：

10.1109/ACCESS.2020.2986943

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this study, we propose and evaluate a novel learning-to-rank (L2R) approach that produces results on par with those of the state-of-the-art L2R methods while being computationally effective. We start by presenting a modified gradient boosted regression tree algorithm to generate unified term impact (UTI) values at indexing time. Each unified term impact replaces several features with a single value in the document index, thereby reducing the effort to compute the document scores at query processing time because the system fetches and processes fewer values. The adoption of UTI values produces competitive ranking results. However, the lack of features available only at query time might lead to accuracy loss. To solve this problem, we propose a hybrid model that uses UTI values with query-dependent features. We demonstrate that our hybrid methods can deliver high-quality results on par with those of the existing state-of-the-art neural ranking models. Our methods can also reduce the computational costs for processing queries, serving as an interesting alternative for L2R practical applications. Our best hybrid, HLambdaMART, achieves an NDCG@10 value of 0.495 using only 36 features at query processing time when applied to the MQ2007 collection, while the best baseline achieves 0.490 using a larger set of features at query processing time. The use of our hybrid framework reduces the time to run LambdaMART to about 35% of the time to run it without using our proposals. In summary, we present a competitive and lightweight alternative L2R approach to be adopted in search systems.

引用

页码：70420 / 70437

页数：18

共 50 条

[1] Document Selection Methodologies for Efficient and Effective Learning-to-Rank
Aslam, Javed A.
Kanoulas, Evangelos
Pavlu, Virgil
Savev, Stefan
Yilmaz, Emine
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 468 - 475
[2] A learning-to-rank method for information updating task
Minh Quang Nhat Pham
Minh Le Nguyen
Bach Xuan Ngo
Akira Shimazu
Applied Intelligence, 2012, 37 : 499 - 510
[3] A learning-to-rank method for information updating task
Minh Quang Nhat Pham
Minh Le Nguyen
Bach Xuan Ngo
Shimazu, Akira
APPLIED INTELLIGENCE, 2012, 37 (04) : 499 - 510
[4] Is Interpretable Machine Learning Effective at Feature Selection for Neural Learning-to-Rank?
Lyu, Lijun
Roy, Nirmal
Oosterhuis, Harrie
Anand, Avishek
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT IV, 2024, 14611 : 384 - 402
[5] Horse race rank prediction using learning-to-rank approaches
Chung, Junhyoung
Shin, Donguk
Hwang, Seyong
Park, Gunwoong
KOREAN JOURNAL OF APPLIED STATISTICS, 2024, 37 (02)
[6] Rax: Composable Learning-to-Rank Using JAX
Jagerman, Rolf
Wang, Xuanhui
Zhuang, Honglei
Qin, Zhen
Bendersky, Michael
Najork, Marc
Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2022, : 3051 - 3060
[7] Rax: Composable Learning-to-Rank using JAX
Jagerman, Rolf
Wang, Xuanhui
Zhuang, Honglei
Qin, Zhen
Bendersky, Michael
Najork, Marc
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3051 - 3060
[8] ListBM: A Learning-to-Rank Method for XML Keyword Search
Gao, Nmg
Deng, Zhi-Hong
Xiang, Yong-Qing
Hang, Yu
FOCUSED RETRIEVAL AND EVALUATION, 2010, 6203 : 81 - 87
[9] Feature Selection for Learning-to-Rank using Simulated Annealing
Allvi, Mustafa Wasif
Hasan, Mahamudul
Rayon, Lazim
Shahabuddin, Mohammad
Khan, Md Mosaddek
Ibrahim, Muhammad
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 699 - 705
[10] RankFormer: Listwise Learning-to-Rank Using Listwide Labels
Buyl, Maarten
Missault, Paul
Sondag, Pierre-Antoine
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3762 - 3773

← 1 2 3 4 5 →