Domain-Invariant Feature Progressive Distillation with Adversarial Adaptive Augmentation for Low-Resource Cross-Domain NER

被引:2
|
作者
Zhang, Tao [1 ]
Xia, Congying [1 ]
Liu, Zhiwei [1 ]
Zhao, Shu [2 ]
Peng, Hao [3 ]
Yu, Philip [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, 851 South Morgan St, Chicago, IL 60607 USA
[2] Anhui Univ, Sch Comp Sci & Technol, 111 Jiulong Rd, Hefei 230601, Anhui, Peoples R China
[3] Beihang Univ, Beijing Adv Innovat Ctr Big Data & Brain Comp, 37 Xue Yuan Rd, Beijing 100191, Peoples R China
基金
北京市自然科学基金; 国家重点研发计划;
关键词
NER; adversarial augmentation; cross-domain; domain adaptation; low-resource; knowledge distillation;
D O I
10.1145/3570502
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering the expensive annotation in Named Entity Recognition (NER), Cross-domain NER enables NER in low-resource target domains with few or without labeled data, by transferring the knowledge of high-resource domains. However, the discrepancy between different domains causes the domain shift problem and hampers the performance of cross-domain NER in low-resource scenarios. In this article, we first propose an adversarial adaptive augmentation, where we integrate the adversarial strategy into a multi-task learner to augment and qualify domain adaptive data. We extract domain-invariant features of the adaptive data to bridge the cross-domain gap and alleviate the label-sparsity problem simultaneously. Therefore, another important component in this article is the progressive domain-invariant feature distillation framework. A multi-grained MMD (Maximum Mean Discrepancy) approach in the framework to extract the multi-level domain invariant features and enable knowledge transfer across domains through the adversarial adaptive data. Advanced Knowledge Distillation (KD) schema processes progressively domain adaptation through the powerful pre-trained language models and multi-level domain invariant features. Extensive comparative experiments over four English and two Chinese benchmarks show the importance of adversarial augmentation and effective adaptation from high-resource domains to low-resource target domains. Comparison with two vanilla and four latest baselines indicates the state-of-the-art performance and superiority confronted with both zero-resource and minimal-resource scenarios.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Domain-Invariant Feature Distillation for Cross-Domain Sentiment Classification
    Hu, Mengting
    Wu, Yike
    Zhao, Shiwan
    Guo, Honglei
    Cheng, Renhong
    Su, Zhong
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5559 - 5568
  • [2] Adaptive Domain-Invariant Feature Extraction for Cross-Domain Linguistic Steganalysis
    Xue, Yiming
    Wu, Jiaxuan
    Ji, Ronghua
    Zhong, Ping
    Wen, Juan
    Peng, Wanli
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 920 - 933
  • [3] Domain-invariant feature learning with label information integration for cross-domain classification
    Jiang L.
    Wu J.
    Zhao S.
    Li J.
    Neural Computing and Applications, 2024, 36 (21) : 13107 - 13126
  • [4] Domain-Invariant Task Optimization for Cross-domain Recommendation
    Liu, Dou
    Hao, Qingbo
    Xiao, Yingyuan
    Zheng, Wenguang
    Wang, Jinsong
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT III, 2024, 14449 : 488 - 499
  • [5] Domain-invariant feature extraction and fusion for cross-domain person re-identification
    Jia, Zhaoqian
    Li, Ye
    Tan, Zhuofu
    Wang, Wenchao
    Wang, Zhiguo
    Yin, Guangqiang
    VISUAL COMPUTER, 2023, 39 (03): : 1205 - 1216
  • [6] Domain-invariant feature extraction and fusion for cross-domain person re-identification
    Zhaoqian Jia
    Ye Li
    Zhuofu Tan
    Wenchao Wang
    Zhiguo Wang
    Guangqiang Yin
    The Visual Computer, 2023, 39 : 1205 - 1216
  • [7] DOMAIN-INVARIANT REGION PROPOSAL NETWORK FOR CROSS-DOMAIN DETECTION
    Yang, Xuebin
    Wan, Shouhong
    Jin, Peiquan
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [8] Cross-Domain Feature Augmentation for Domain Generalization
    Liu, Yingnan
    Zou, Yingtian
    Qiao, Rui
    Liu, Fusheng
    Lee, Mong Li
    Hsu, Wynne
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 1146 - 1154
  • [9] Adversarial Feature Augmentation for Cross-domain Few-Shot Classification
    Hu, Yanxu
    Ma, Andy J.
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 20 - 37
  • [10] Progressive cross-domain knowledge distillation for efficient unsupervised domain adaptive object detection
    Li, Wei
    Li, Lingqiao
    Yang, Huihua
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119