Adversarial Adaptation for French Named Entity Recognition

被引:0
作者
Choudhry, Arjun [1 ,2 ]
Khatri, Inder [1 ]
Gupta, Pankaj [1 ]
Gupta, Aaryan [1 ]
Nicol, Maxime
Meurs, Marie-Jean [2 ]
Vishwakarma, Dinesh Kumar [1 ]
机构
[1] Delhi Technol Univ, Biometr Res Lab, New Delhi, India
[2] Univ Quebec Montreal, IKB Lab, Montreal, PQ, Canada
来源
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT II | 2023年 / 13981卷
关键词
Named entity recognition; Adversarial adaptation; Transformer; Limited resource languages; Large-scale corpora;
D O I
10.1007/978-3-031-28238-6_28
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) is the task of identifying and classifying named entities in large-scale texts into predefined classes. NER in French and other relatively limited-resource languages cannot always benefit from approaches proposed for languages like English due to a dearth of large, robust datasets. In this paper, we present our work that aims to mitigate the effects of this dearth of large, labeled datasets. We propose a Transformer-based NER approach for French, using adversarial adaptation to similar domain or general corpora to improve feature extraction and enable better generalization. Our approach allows learning better features using large-scale unlabeled corpora from the same domain or mixed domains to introduce more variations during training and reduce overfitting. Experimental results on three labeled datasets show that our adaptation framework outperforms the corresponding non-adaptive models for various combinations of Transformer models, source datasets, and target corpora. We also show that adversarial adaptation to large-scale unlabeled corpora can help mitigate the performance dip incurred on using Transformer models pre-trained on smaller corpora.
引用
收藏
页码:386 / 395
页数:10
相关论文
共 27 条
[1]  
[Anonymous], 2016, P C N AM CHAPT ASS C
[2]  
Choudhry A, 2022, Arxiv, DOI [arXiv:2212.03692, 10.48550/ARXIV.2212.03692, DOI 10.48550/ARXIV.2212.03692]
[3]  
Copara J., 2020, ACT 6 C CONJ JOURN E, P36
[4]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5]  
Ganin Y, 2015, Arxiv, DOI [arXiv:1409.7495, DOI 10.48550/ARXIV.1409.7495, 10.48550/arXiv.1409.7495]
[6]  
Ganin Y, 2016, J MACH LEARN RES, V17
[7]  
Gong C, 2019, DEStech Transactions on Computer Science and Engineering cisnrc.
[8]   Arabic Named Entity Recognition: A Bidirectional GRU-CRF Approach [J].
Gridach, Mourad ;
Haddad, Hatem .
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 :264-275
[9]   Deep learning with word embeddings improves biomedical named entity recognition [J].
Habibi, Maryam ;
Weber, Leon ;
Neves, Mariana ;
Wiegandt, David Luis ;
Leser, Ulf .
BIOINFORMATICS, 2017, 33 (14) :I37-I48
[10]  
Le H, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P2479