Studying Catastrophic Forgetting in Neural Ranking Models

被引：5

作者：

Lovon-Melgarejo, Jesus ^{[1
]}

Soulier, Laure ^{[2
]}

Pinel-Sauvagnat, Karen ^{[1
]}

Tamine, Lynda ^{[1
]}

机构：

[1] Univ Paul Sabatier, IRIT, Toulouse, France

[2] Sorbonne Univ, CNRS, LIP6, F-75005 Paris, France

来源：

ADVANCES IN INFORMATION RETRIEVAL, ECIR 2021, PT I | 2021年 / 12656卷

关键词：

Neural ranking; Catastrophic forgetting; Lifelong learning;

D O I：

10.1007/978-3-030-72113-8_25

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Several deep neural ranking models have been proposed in the recent IR literature. While their transferability to one target domain held by a dataset has been widely addressed using traditional domain adaptation strategies, the question of their cross-domain transferability is still under-studied. We study here in what extent neural ranking models catastrophically forget old knowledge acquired from previously observed domains after acquiring new knowledge, leading to performance decrease on those domains. Our experiments show that the effectiveness of neural IR ranking models is achieved at the cost of catastrophic forgetting and that a lifelong learning strategy using a cross-domain regularizer successfully mitigates the problem. Using an explanatory approach built on a regression model, we also show the effect of domain characteristics on the rise of catastrophic forgetting. We believe that the obtained results can be useful for both theoretical and practical future work in neural IR.

引用

页码：375 / 390

页数：16

共 53 条

[1]

Rusu AA, 2016, Arxiv, DOI [arXiv:1606.04671, DOI 10.43550/ARXIV:1606.04671, DOI 10.48550/ARXIV.1606.04671]

[2] Impact of Data Characteristics on Recommender Systems Performance [J].

Adomavicius, Gediminas ;

Zhang, Jingjing .

ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2012, 3 (01)

[3]

Asghar N, 2020, Arxiv, DOI arXiv:1811.00239

[4]

Bajaj P, 2018, Arxiv, DOI [arXiv:1611.09268, DOI 10.48550/ARXIV.1611.09268]

[5]

Bengio Y, 2011, P MACH LEARN RES JUN, V27, P17

[6]

Cai HY, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P1793

[7]

Chen Z., 2018, Synthesis Lectures on Artificial Intelligence and Machine Learning, V12, DOI [DOI 10.2200/S00737ED1V01Y201610AIM033, 10.2200/S00832ED1V01Y201802AIM037, DOI 10.2200/S00832ED1V01Y201802AIM037]

[8] Cross Domain Regularization for Neural Ranking Models using Adversarial Learning [J].

Cohen, Daniel ;

Mitra, Bhaskar ;

Hofmann, Katja ;

Croft, W. Bruce .

ACM/SIGIR PROCEEDINGS 2018, 2018, :1025-1028

[9]

d'Autume CD, 2019, Arxiv, DOI arXiv:1906.01076

[10] Neural Ranking Models with Weak Supervision [J].

Dehghani, Mostafa ;

Zamani, Hamed ;

Severyn, Aliaksei ;

Kamps, Jaap ;

Croft, W. Bruce .

SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, :65-74

← 1 2 3 4 5 6 →