DESIRE-ME: Domain-Enhanced Supervised Information Retrieval Using Mixture-of-Experts

被引:0
作者
Kasela, Pranav [1 ,2 ]
Pasi, Gabriella [1 ]
Perego, Raffaele [2 ]
Tonellotto, Nicola [2 ,3 ]
机构
[1] Univ Milano Biocca, Milan, Italy
[2] ISTI CNR, Pisa, Italy
[3] Univ Pisa, Pisa, Italy
来源
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT II | 2024年 / 14609卷
关键词
Open-domain Q&A; Mixture-of-Experts; Domain Specialization;
D O I
10.1007/978-3-031-56060-6_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Open-domain question answering requires retrieval systems able to cope with the diverse and varied nature of questions, providing accurate answers across a broad spectrum of query types and topics. To deal with such topic heterogeneity through a unique model, we propose DESIRE-ME, a neural information retrieval model that leverages the Mixture-of-Experts framework to combine multiple specialized neural models. We rely on Wikipedia data to train an effective neural gating mechanism that classifies the incoming query and that weighs the predictions of the different domain-specific experts correspondingly. This allows DESIRE-ME to specialize adaptively in multiple domains. Through extensive experiments on publicly available datasets, we show that our proposal can effectively generalize domain-enhanced neural models. DESIRE-ME excels in handling open-domain questions adaptively, boosting by up to 12% in NDCG@10 and 22% in P@1, the underlying state-of-the-art dense retrieval model.
引用
收藏
页码:111 / 125
页数:15
相关论文
共 30 条
[1]   ranxhub: An Online Repository for Information Retrieval Runs [J].
Bassani, Elias .
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, :3210-3214
[2]   ranx: A Blazing-Fast Python']Python Library for Ranking Evaluation and Comparison [J].
Bassani, Elias .
ADVANCES IN INFORMATION RETRIEVAL, PT II, 2022, 13186 :259-264
[3]  
Collobert R., 2001, Advances in Neural Information Processing Systems, V14
[4]  
Dai Damai, 2022, arXiv
[5]  
Dauphin YN, 2017, PR MACH LEARN RES, V70
[6]  
Diggelmann T, 2021, Climate-fever: A dataset for verification of real-world climate claims
[7]  
Eigen David, 2013, INT C LEARN REPR ICL
[8]  
Fedus W, 2022, J MACH LEARN RES, V23
[9]   MIXTURE OF INFORMED EXPERTS FOR MULTILINGUAL SPEECH RECOGNITION [J].
Gaur, Neeraj ;
Farris, Brian ;
Haghani, Parisa ;
Leal, Isabel ;
Moreno, Pedro J. ;
Prasad, Manasa ;
Ramabhadran, Bhuvana ;
Zhu, Yun .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :6234-6238
[10]  
Gururangan Suchin, 2021, Demix layers: Disentangling domains for modular language modeling