Class incremental named entity recognition without forgetting

被引:0
作者
Liu, Ye [1 ]
Huang, Shaobin [1 ]
Wei, Chi [1 ]
Tian, Sicheng [1 ]
Li, Rongsheng [1 ]
Yan, Naiyu [1 ]
Du, Zhijuan [2 ,3 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Inner Mongolia Univ, Hohhot, Peoples R China
[3] Minist Educ, Engn Res Ctr Ecol Big Data, Beijing, Peoples R China
关键词
Class incremental learning; Named entity recognition; Multi-model framework; Continual learning;
D O I
10.1007/s10115-024-02220-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class Incremental Named Entity Recognition (CINER) needs to learn new entity classes without forgetting old entity classes under the setting where the data only contain annotations for new entity classes. As is well known, the forgetting problem is the biggest challenge in Class Incremental Learning (CIL). In the CINER scenario, the unlabeled old class entities will further aggravate the forgetting problem. The current CINER method based on a single model cannot completely avoid the forgetting problem and is sensitive to the learning order of entity classes. To this end, we propose a Multi-Model (MM) framework that trains a new model for each incremental step and uses all the models for inference. In MM, each model only needs to learn the entity classes included in corresponding step, so MM has no forgetting problem and is robust to the different entity class learning orders. Furthermore, we design an error-correction training strategy and conflict-handling rules for MM to further improve performance. We evaluate MM on CoNLL-03 and OntoNotes-V5, and the experimental results show that our framework outperforms the current state-of-the-art (SOTA) methods by a large margin.
引用
收藏
页码:301 / 324
页数:24
相关论文
共 50 条
[31]   Multilingual Transformers for Named Entity Recognition [J].
Viksna, Rinalds ;
Skadin, Inguna .
BALTIC JOURNAL OF MODERN COMPUTING, 2022, 10 (03) :457-469
[32]   Named Entity Recognition and transliteration in Bengali [J].
Ekbal, Asif ;
Naskar, Sudip Kumar ;
Bandyopadhyay, Sivaji .
LINGUISTICAE INVESTIGATIONES, 2007, 30 (01) :95-114
[33]   Named Entity Recognition for Defense Industry [J].
Tanrisever, Ozer ;
Ayan, Emre Tolga ;
Zengin, Muhammed Said ;
Duru, Haci Ali ;
Bardak, Batuhan .
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
[34]   Nested Named Entity Recognition: A Survey [J].
Wang, Yu ;
Tong, Hanghang ;
Zhu, Ziye ;
Li, Yun .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (06)
[35]   A Named Entity Recognition Dataset for Turkish [J].
Kucuk, Dilek ;
Kucuk, Dogan ;
Arici, Nursal .
2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, :329-332
[36]   Towards Bangla Named Entity Recognition [J].
Chowdhury, Shammur Absar ;
Alam, Firoj ;
Khan, Naira .
2018 21ST INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2018,
[37]   Named Entity Recognition as Graph Classification [J].
Harrando, Ismail ;
Troncy, Raphael .
SEMANTIC WEB: ESWC 2021 SATELLITE EVENTS, 2021, 12739 :103-108
[38]   A review of Chinese named entity recognition [J].
Cheng, Jieren ;
Liu, Jingxin ;
Xu, Xinbin ;
Xia, Dongwan ;
Liu, Le ;
Sheng, Victor S. .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (06) :2012-2030
[39]   On the Use of Parsing for Named Entity Recognition [J].
Alonso, Miguel A. ;
Gomez-Rodriguez, Carlos ;
Vilares, Jesus .
APPLIED SCIENCES-BASEL, 2021, 11 (03) :1-24
[40]   Latent semantics in Named Entity Recognition [J].
Konkol, Michal ;
Brychcin, Tomas ;
Konopik, Miloslav .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (07) :3470-3479