Margin Discrepancy-Based Adversarial Training for Multi-Domain Text Classification

被引:0
作者
Wu, Yuan [1 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Changchun, Peoples R China
来源
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024 | 2025年 / 15361卷
关键词
Multi-domain text classification; Adversarial training; Margin discrepancy;
D O I
10.1007/978-981-97-9437-9_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-domain text classification (MDTC) endeavors to harness available resources from correlated domains to enhance the classification accuracy of the target domain. Presently, most MDTC approaches that embrace adversarial training and the shared-private paradigm exhibit cutting-edge performance. Unfortunately, these methods face a non-negligible challenge: the absence of theoretical guarantees in the design of MDTC algorithms. The dearth of theoretical underpinning poses a substantial impediment to the advancement of MDTC algorithms. To tackle this problem, we first provide a theoretical analysis of MDTC by decomposing the MDTC task into multiple domain adaptation tasks. We incorporate the margin discrepancy as the measure of domain divergence and establish a new generalization bound based on Rademacher complexity. Subsequently, we propose a margin discrepancy-based adversarial training (MDAT) approach for MDTC, in accordance with our theoretical analysis. To validate the efficacy of the proposed MDAT method, we conduct empirical studies on two MDTC benchmarks. The experimental results demonstrate that our MDAT approach surpasses state-of-the-art baselines on both datasets.
引用
收藏
页码:170 / 182
页数:13
相关论文
共 25 条
  • [1] [Anonymous], 2007, P 45 ANN M ASS COMPU
  • [2] [Anonymous], 2008, Proceedings of ACL-08: HLT
  • [3] Domain Adaptation on the Statistical Manifold
    Baktashmotlagh, Mahsa
    Harandi, Mehrtash T.
    Lovell, Brian C.
    Salzmann, Mathieu
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2481 - 2488
  • [4] A theory of learning from different domains
    Ben-David, Shai
    Blitzer, John
    Crammer, Koby
    Kulesza, Alex
    Pereira, Fernando
    Vaughan, Jennifer Wortman
    [J]. MACHINE LEARNING, 2010, 79 (1-2) : 151 - 175
  • [5] Ben-David Shai, 2007, Advances in neural information processing systems, P137, DOI DOI 10.7551/MITPRESS/7503.003.0022
  • [6] Bousmalis K., 2016, Advances in neural information processing systems, DOI DOI 10.48550/ARXIV.1608.06019
  • [7] Chen X., 2018, P 2018 C N AM CHAPT, V1, DOI [10.18653/v1/N18-1111, DOI 10.18653/V1/N18-1111]
  • [8] Collobert R., 2008, P 25 INT C MACH LEAR, P160, DOI DOI 10.1145/1390156.1390177
  • [9] Farahani A., 2021, ADV DAT SCI INF ENG, P877, DOI DOI 10.1007/978-3-030-71704-9_65
  • [10] Ganin Y, 2016, J MACH LEARN RES, V17