Counterfactual Generator: A Weakly-Supervised Method for Named Entity Recognition

被引:0
作者
Zeng, Xiangji [1 ]
Li, Yunliang [1 ]
Zhai, Yuchen [1 ]
Zhang, Yin [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) | 2020年
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Past progress on neural models has proven that named entity recognition is no longer a problem if we have enough labeled data. However, collecting enough data and annotating them are labor-intensive, time-consuming, and expensive. In this paper, we decompose the sentence into two parts: entity and context, and rethink the relationship between them and model performance from a causal perspective. Based on this, we propose the Counterfactual Generator, which generates counterfactual examples by the interventions on the existing observational examples to enhance the original dataset. Experiments across three datasets show that our method improves the generalization ability of models under limited observational examples. Besides, we provide a theoretical foundation by using a structural causal model to explore the spurious correlations between input features and output labels. We investigate the causal effects of entity or context on model performance under both conditions: the non-augmented and the augmented. Interestingly, we find that the non-spurious correlations are more located in entity representation rather than context representation. As a result, our method eliminates part of the spurious correlations between context representation and output labels. The code is available at https://github.com/xijiz/cfgen.
引用
收藏
页码:7270 / 7280
页数:11
相关论文
共 50 条
[21]   Weakly Supervised Attention Networks for Entity Recognition [J].
Patra, Barun ;
Moniz, Joel Ruben Antony .
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, :6268-6273
[22]   Noise Detection for Distant Supervised Named Entity Recognition [J].
Wang J. ;
Wang K. ;
Wang H. ;
Du W. ;
He Z. ;
Ruan T. ;
Liu J. .
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (04) :916-928
[23]   Analysis of Different Supervised Techniques for Named Entity Recognition [J].
Goyal, Archana ;
Gupta, Vishal ;
Kumar, Manish .
ADVANCED INFORMATICS FOR COMPUTING RESEARCH, PT I, 2019, 1075 :184-195
[24]   Weakly Supervised Cross-Lingual Named Entity Recognition via Effective Annotation and Representation Projection [J].
Ni, Jian ;
Dinu, Georgiana ;
Florian, Radu .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1470-1480
[25]   External Knowledge-Based Weakly Supervised Learning Approach on Chinese Clinical Named Entity Recognition [J].
Duan, Yeheng ;
Ma, Long-Long ;
Han, Xianpei ;
Sun, Le ;
Dong, Bin ;
Jiang, Shanshan .
SEMANTIC TECHNOLOGY, JIST 2019: PROCEEDINGS, 2020, 12032 :336-352
[26]   A Method of Named Entity Recognition for Tigrinya [J].
Yohannes, Hailemariam Mehari ;
Amagasa, Toshiyuki .
APPLIED COMPUTING REVIEW, 2022, 22 (03) :56-68
[27]   Weakly-Supervised Facial Expression Recognition in the Wild With Noisy Data [J].
Zhang, Feifei ;
Xu, Mingliang ;
Xu, Changsheng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1800-1814
[28]   Learning to select pseudo labels: a semi-supervised method for named entity recognition [J].
Zhen-zhen Li ;
Da-wei Feng ;
Dong-sheng Li ;
Xi-cheng Lu .
Frontiers of Information Technology & Electronic Engineering, 2020, 21 :903-916
[29]   Learning to select pseudo labels: a semi-supervised method for named entity recognition [J].
Li, Zhen-zhen ;
Feng, Da-wei ;
Li, Dong-sheng ;
Lu, Xi-cheng .
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (06) :903-916
[30]   Weakly-Supervised Symptom Recognition for Rare Diseases in Biomedical Text [J].
Holat, Pierre ;
Tomeh, Nadi ;
Charnois, Thierry ;
Battistelli, Delphine ;
Jaulent, Marie-Christine ;
Metivier, Jean-Philippe .
ADVANCES IN INTELLIGENT DATA ANALYSIS XV, 2016, 9897 :192-203