Margin Discrepancy-Based Adversarial Training for Multi-Domain Text Classification

被引：0

作者：

Wu, Yuan ^{[1
]}

机构：

[1] Jilin Univ, Sch Artificial Intelligence, Changchun, Peoples R China

来源：

NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024 | 2025年 / 15361卷

关键词：

Multi-domain text classification; Adversarial training; Margin discrepancy;

D O I：

10.1007/978-981-97-9437-9_14

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-domain text classification (MDTC) endeavors to harness available resources from correlated domains to enhance the classification accuracy of the target domain. Presently, most MDTC approaches that embrace adversarial training and the shared-private paradigm exhibit cutting-edge performance. Unfortunately, these methods face a non-negligible challenge: the absence of theoretical guarantees in the design of MDTC algorithms. The dearth of theoretical underpinning poses a substantial impediment to the advancement of MDTC algorithms. To tackle this problem, we first provide a theoretical analysis of MDTC by decomposing the MDTC task into multiple domain adaptation tasks. We incorporate the margin discrepancy as the measure of domain divergence and establish a new generalization bound based on Rademacher complexity. Subsequently, we propose a margin discrepancy-based adversarial training (MDAT) approach for MDTC, in accordance with our theoretical analysis. To validate the efficacy of the proposed MDAT method, we conduct empirical studies on two MDTC benchmarks. The experimental results demonstrate that our MDAT approach surpasses state-of-the-art baselines on both datasets.

引用

页码：170 / 182

页数：13

共 25 条

[1] [Anonymous], 2007, P 45 ANN M ASS COMPU
[2] [Anonymous], 2008, Proceedings of ACL-08: HLT
[3] Domain Adaptation on the Statistical Manifold
Baktashmotlagh, Mahsa
Harandi, Mehrtash T.
Lovell, Brian C.
Salzmann, Mathieu
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2481 - 2488
[4] A theory of learning from different domains
Ben-David, Shai
Blitzer, John
Crammer, Koby
Kulesza, Alex
Pereira, Fernando
Vaughan, Jennifer Wortman
[J]. MACHINE LEARNING, 2010, 79 (1-2) : 151 - 175
[5] Ben-David Shai, 2007, Advances in neural information processing systems, P137, DOI DOI 10.7551/MITPRESS/7503.003.0022
[6] Bousmalis K., 2016, Advances in neural information processing systems, DOI DOI 10.48550/ARXIV.1608.06019
[7] Chen X., 2018, P 2018 C N AM CHAPT, V1, DOI [10.18653/v1/N18-1111, DOI 10.18653/V1/N18-1111]
[8] Collobert R., 2008, P 25 INT C MACH LEAR, P160, DOI DOI 10.1145/1390156.1390177
[9] Farahani A., 2021, ADV DAT SCI INF ENG, P877, DOI DOI 10.1007/978-3-030-71704-9_65
[10] Ganin Y, 2016, J MACH LEARN RES, V17

← 1 2 3 →