Domain-Invariant Feature Progressive Distillation with Adversarial Adaptive Augmentation for Low-Resource Cross-Domain NER

被引：2

作者：

Zhang, Tao ^{[1
]}

Xia, Congying ^{[1
]}

Liu, Zhiwei ^{[1
]}

Zhao, Shu ^{[2
]}

Peng, Hao ^{[3
]}

Yu, Philip ^{[1
]}

机构：

[1] Univ Illinois, Dept Comp Sci, 851 South Morgan St, Chicago, IL 60607 USA

[2] Anhui Univ, Sch Comp Sci & Technol, 111 Jiulong Rd, Hefei 230601, Anhui, Peoples R China

[3] Beihang Univ, Beijing Adv Innovat Ctr Big Data & Brain Comp, 37 Xue Yuan Rd, Beijing 100191, Peoples R China

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2023年 / 22卷 / 03期

基金：

国家重点研发计划; 北京市自然科学基金;

关键词：

NER; adversarial augmentation; cross-domain; domain adaptation; low-resource; knowledge distillation;

D O I：

10.1145/3570502

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Considering the expensive annotation in Named Entity Recognition (NER), Cross-domain NER enables NER in low-resource target domains with few or without labeled data, by transferring the knowledge of high-resource domains. However, the discrepancy between different domains causes the domain shift problem and hampers the performance of cross-domain NER in low-resource scenarios. In this article, we first propose an adversarial adaptive augmentation, where we integrate the adversarial strategy into a multi-task learner to augment and qualify domain adaptive data. We extract domain-invariant features of the adaptive data to bridge the cross-domain gap and alleviate the label-sparsity problem simultaneously. Therefore, another important component in this article is the progressive domain-invariant feature distillation framework. A multi-grained MMD (Maximum Mean Discrepancy) approach in the framework to extract the multi-level domain invariant features and enable knowledge transfer across domains through the adversarial adaptive data. Advanced Knowledge Distillation (KD) schema processes progressively domain adaptation through the powerful pre-trained language models and multi-level domain invariant features. Extensive comparative experiments over four English and two Chinese benchmarks show the importance of adversarial augmentation and effective adaptation from high-resource domains to low-resource target domains. Comparison with two vanilla and four latest baselines indicates the state-of-the-art performance and superiority confronted with both zero-resource and minimal-resource scenarios.

引用

页数：21

共 29 条

[1] Domain-invariant feature extraction and fusion for cross-domain person re-identification
Jia, Zhaoqian
Li, Ye
Tan, Zhuofu
Wang, Wenchao
Wang, Zhiguo
Yin, Guangqiang
VISUAL COMPUTER, 2023, 39 (03) : 1205 - 1216
[2] Domain-invariant feature extraction and fusion for cross-domain person re-identification
Zhaoqian Jia
Ye Li
Zhuofu Tan
Wenchao Wang
Zhiguo Wang
Guangqiang Yin
The Visual Computer, 2023, 39 : 1205 - 1216
[3] DOMAIN-INVARIANT REGION PROPOSAL NETWORK FOR CROSS-DOMAIN DETECTION
Yang, Xuebin
Wan, Shouhong
Jin, Peiquan
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[4] Adversarial Feature Augmentation for Cross-domain Few-Shot Classification
Hu, Yanxu
Ma, Andy J.
COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 20 - 37
[5] Cross-domain few-shot learning based on feature adaptive distillation
Dingwei Zhang
Hui Yan
Yadang Chen
Dichao Li
Chuanyan Hao
Neural Computing and Applications, 2024, 36 : 4451 - 4465
[6] Cross-domain few-shot learning based on feature adaptive distillation
Zhang, Dingwei
Yan, Hui
Chen, Yadang
Li, Dichao
Hao, Chuanyan
NEURAL COMPUTING & APPLICATIONS, 2024, 36 (08) : 4451 - 4465
[7] Domain Generalization and Feature Fusion for Cross-domain Imperceptible Adversarial Attack Detection
Li, Yi
Angelov, Plamen
Suri, Neeraj
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[8] DOMAIN-INVARIANT FEATURE LEARNING FOR CROSS CORPUS SPEECH EMOTION RECOGNITION
Gao, Yuan
Okada, Shogo
Wang, Longbiao
Liu, Jiaxing
Dang, Jianwu
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6427 - 6431
[9] Low-Resource Adversarial Domain Adaptation for Cross-modality Nucleus Detection
Xing, Fuyong
Cornish, Toby C.
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 639 - 649
[10] Domain adaptation based on domain-invariant and class-distinguishable feature learning using multiple adversarial networks
Fan, Cangning
Liu, Peng
Xiao, Ting
Zhao, Wei
Tang, Xianglong
NEUROCOMPUTING, 2020, 411 : 178 - 192

← 1 2 3 →