Reinforced Counterfactual Data Augmentation for Dual Sentiment Classification

被引:0
|
作者
Chen, Hao [1 ]
Xia, Rui [1 ]
Yu, Jianfei [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Jiangsu, Peoples R China
来源
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021) | 2021年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation and adversarial perturbation approaches have recently achieved promising results in solving the over-fitting problem in many natural language processing (NLP) tasks including sentiment classification. However, existing studies aimed to improve the generalization ability by augmenting the training data with synonymous examples or adding random noises to word embeddings, which cannot address the spurious association problem. In this work, we propose an end-toend reinforcement learning framework, which jointly performs counterfactual data generation and dual sentiment classification. Our approach has three characteristics: 1) the generator automatically generates massive and diverse antonymous sentences; 2) the discriminator contains a original-side sentiment predictor and an antonymous-side sentiment predictor, which jointly evaluate the quality of the generated sample and help the generator iteratively generate higher-quality antonymous samples; 3) the discriminator is directly used as the final sentiment classifier without the need to build an extra one. Extensive experiments show that our approach outperforms strong data augmentation baselines on several benchmark sentiment classification datasets. Further analysis confirms our approach's advantages in generating more diverse training samples and solving the spurious association problem in sentiment classification.
引用
收藏
页码:269 / 278
页数:10
相关论文
共 50 条
  • [1] SubCrime: Counterfactual Data Augmentation for Target Sentiment Analysis
    Chenl, Wei
    Wangl, Lulu
    Due, Jinglong
    Het, Zhongshi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 307 - 319
  • [2] A Novel Counterfactual Data Augmentation Method for Aspect-Based Sentiment Analysis
    Wu, Dongming
    Wen, Lulu
    Chen, Chao
    Shi, Zhaoshu
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [3] Data augmentation for sentiment classification with semantic preservation and diversity
    Chao, Guoqing
    Liu, Jingyao
    Wang, Mingyu
    Chu, Dianhui
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [4] Counterfactual Representation Augmentation for Cross-Domain Sentiment Analysis
    Wang, Ke
    Wan, Xiaojun
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 1979 - 1990
  • [5] Pseudo dense counterfactual augmentation for aspect-based sentiment analysis
    Ouyang, Jihong
    Feng, Shi
    Wang, Bing
    Yang, Zhiyao
    NEUROCOMPUTING, 2023, 561
  • [6] CHARCNN-SVM FOR CHINESE TEXT DATASETS SENTIMENT CLASSIFICATION WITH DATA AUGMENTATION
    Wang, Xingkai
    Sheng, Yiqiang
    Deng, Haojiang
    Zhao, Zhenyu
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2019, 15 (01): : 227 - 246
  • [7] Counterfactual Data Augmentation for Neural Machine Translation
    Liu, Qi
    Kusner, Matt J.
    Blunsom, Phil
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 187 - 197
  • [8] Lexical data augmentation for sentiment analysis
    Xiang, Rong
    Chersoni, Emmanuele
    Lu, Qin
    Huang, Chu-Ren
    Li, Wenjie
    Long, Yunfei
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2021, 72 (11) : 1432 - 1447
  • [9] Adversarial counterfactual augmentation: application in Alzheimer's disease classification
    Xia, Tian
    Sanchez, Pedro
    Qin, Chen
    Tsaftaris, Sotirios A.
    FRONTIERS IN RADIOLOGY, 2022, 2
  • [10] MDA: Multimodal Data Augmentation Framework for Boosting Performance on Sentiment/Emotion Classification Tasks
    Xu, Nan
    Mao, Wenji
    Wei, Penghui
    Zeng, Daniel
    IEEE INTELLIGENT SYSTEMS, 2021, 36 (06) : 3 - 11