Advancing continual lifelong learning in neural information retrieval: Definition, dataset, framework, and empirical evaluation

被引:1
作者
Hou, Jingrui [1 ]
Cosma, Georgina [1 ]
Finke, Axel [2 ]
机构
[1] Loughborough Univ, Sch Sci, Dept Comp Sci, Epinal Way, Loughborough LE11 3TU, Leics, England
[2] Loughborough Univ, Dept Math Sci, Epinal Way, Loughborough LE11 3TU, Leics, England
关键词
Neural information retrieval; Continual learning; Catastrophic forgetting; Topic shift; Data augmentation;
D O I
10.1016/j.ins.2024.121368
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Continual learning refers to the capability of a machine learning model to learn and adapt to new information, without compromising its performance on previously learned tasks. Although several studies have investigated continual learning methods for neural information retrieval (NIR) tasks, a well-defined task definition is still lacking, and it is unclear how typical learning strategies perform in this context. To address this challenge, a systematic task definition of continual NIR is presented, along with a multiple-topic dataset that simulates continuous information retrieval. A comprehensive continual neural information retrieval framework consisting of typical retrieval models and continual learning strategies is then proposed. Empirical evaluations illustrate that the proposed framework can successfully prevent catastrophic forgetting in neural information retrieval and enhance performance on previously learned tasks. The results also indicate that embedding-based retrieval models experience a decline in their continual learning performance as the topic shift distance and dataset volume of new tasks increase. In contrast, pretraining-based models do not show any such correlation. Adopting suitable learning strategies can mitigate the effects of topic shift and data augmentation in continual neural information retrieval.
引用
收藏
页数:17
相关论文
共 46 条
[21]   Overcoming Catastrophic Forgetting in Continual Learning by Exploring Eigenvalues of Hessian Matrix [J].
Kong, Yajing ;
Liu, Liu ;
Chen, Huanhuan ;
Kacprzyk, Janusz ;
Tao, Dacheng .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) :16196-16210
[22]  
Lee S, 2021, P MACHINE LEARNING R, V139
[23]   AdaER: An adaptive experience replay approach for continual lifelong learning [J].
Li, Xingyu ;
Tang, Bo ;
Li, Haifeng .
NEUROCOMPUTING, 2024, 572
[24]   Lifelong machine learning: a paradigm for continuous learning [J].
Liu, Bing .
FRONTIERS OF COMPUTER SCIENCE, 2017, 11 (03) :359-361
[25]  
Liu XL, 2018, INT C PATT RECOG, P2262, DOI 10.1109/ICPR.2018.8545895
[26]  
Lopez-Paz D, 2017, ADV NEUR IN, V30
[27]   Studying Catastrophic Forgetting in Neural Ranking Models [J].
Lovon-Melgarejo, Jesus ;
Soulier, Laure ;
Pinel-Sauvagnat, Karen ;
Tamine, Lynda .
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2021, PT I, 2021, 12656 :375-390
[28]   Target layer regularization for continual learning using Cramer-Wold distance [J].
Mazur, Marcin ;
Pustelnik, Lukasz ;
Knop, Szymon ;
Pagacz, Patryk ;
Spurek, Przemyslaw .
INFORMATION SCIENCES, 2022, 609 :1369-1380
[29]   Learning to Match using Local and Distributed Representations of Text for Web Search [J].
Mitra, Bhaskar ;
Diaz, Fernando ;
Craswell, Nick .
PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, :1291-1299
[30]  
Nguyen T, 2016, CEUR WORKSHOP P, V1773