Advancing continual lifelong learning in neural information retrieval: Definition, dataset, framework, and empirical evaluation

被引:1
作者
Hou, Jingrui [1 ]
Cosma, Georgina [1 ]
Finke, Axel [2 ]
机构
[1] Loughborough Univ, Sch Sci, Dept Comp Sci, Epinal Way, Loughborough LE11 3TU, Leics, England
[2] Loughborough Univ, Dept Math Sci, Epinal Way, Loughborough LE11 3TU, Leics, England
关键词
Neural information retrieval; Continual learning; Catastrophic forgetting; Topic shift; Data augmentation;
D O I
10.1016/j.ins.2024.121368
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Continual learning refers to the capability of a machine learning model to learn and adapt to new information, without compromising its performance on previously learned tasks. Although several studies have investigated continual learning methods for neural information retrieval (NIR) tasks, a well-defined task definition is still lacking, and it is unclear how typical learning strategies perform in this context. To address this challenge, a systematic task definition of continual NIR is presented, along with a multiple-topic dataset that simulates continuous information retrieval. A comprehensive continual neural information retrieval framework consisting of typical retrieval models and continual learning strategies is then proposed. Empirical evaluations illustrate that the proposed framework can successfully prevent catastrophic forgetting in neural information retrieval and enhance performance on previously learned tasks. The results also indicate that embedding-based retrieval models experience a decline in their continual learning performance as the topic shift distance and dataset volume of new tasks increase. In contrast, pretraining-based models do not show any such correlation. Adopting suitable learning strategies can mitigate the effects of topic shift and data augmentation in continual neural information retrieval.
引用
收藏
页数:17
相关论文
共 46 条
[31]  
Nogueira R, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, P708
[32]  
Pennington Jeffrey., 2014, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, P1532, DOI [10.3115/v1/D14-1162, DOI 10.3115/V1/D14-1162, 10.3115/v1/d14-1162, 10.3115/v1]
[33]   Deep stacked stochastic configuration networks for lifelong learning of non-stationary data streams [J].
Pratama, Mahardhika ;
Wang, Dianhui .
INFORMATION SCIENCES, 2019, 495 :150-174
[34]   iCaRL: Incremental Classifier and Representation Learning [J].
Rebuffi, Sylvestre-Alvise ;
Kolesnikov, Alexander ;
Sperl, Georg ;
Lampert, Christoph H. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5533-5542
[35]  
Rolnick David, Experience Replay for Continual Learning
[36]   Deep continual hashing with gradient-aware memory for cross-modal retrieval [J].
Song, Ge ;
Tan, Xiaoyang ;
Yang, Ming .
PATTERN RECOGNITION, 2023, 137
[37]   Continual learning in cross-modal retrieval [J].
Wang, Kai ;
Herranz, Luis ;
van de Weijer, Joost .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :3623-3633
[38]   A Comprehensive Survey of Continual Learning: Theory, Method and Application [J].
Wang, Liyuan ;
Zhang, Xingxing ;
Su, Hang ;
Zhu, Jun .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) :5362-5383
[39]  
Wolf T, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING: SYSTEM DEMONSTRATIONS, P38
[40]  
Wu J., 2023, 2023 IEEE INT C BIG, P2247, DOI [10.1109/BigData55660.2023.00112〉, DOI 10.1109/BIGDATA55660.2023.00112, DOI 10.1109/BIGDATA59044.2023.10386743]