Counterfactual Generator: A Weakly-Supervised Method for Named Entity Recognition

被引:0
作者
Zeng, Xiangji [1 ]
Li, Yunliang [1 ]
Zhai, Yuchen [1 ]
Zhang, Yin [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) | 2020年
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Past progress on neural models has proven that named entity recognition is no longer a problem if we have enough labeled data. However, collecting enough data and annotating them are labor-intensive, time-consuming, and expensive. In this paper, we decompose the sentence into two parts: entity and context, and rethink the relationship between them and model performance from a causal perspective. Based on this, we propose the Counterfactual Generator, which generates counterfactual examples by the interventions on the existing observational examples to enhance the original dataset. Experiments across three datasets show that our method improves the generalization ability of models under limited observational examples. Besides, we provide a theoretical foundation by using a structural causal model to explore the spurious correlations between input features and output labels. We investigate the causal effects of entity or context on model performance under both conditions: the non-augmented and the augmented. Interestingly, we find that the non-spurious correlations are more located in entity representation rather than context representation. As a result, our method eliminates part of the spurious correlations between context representation and output labels. The code is available at https://github.com/xijiz/cfgen.
引用
收藏
页码:7270 / 7280
页数:11
相关论文
共 50 条
[41]   Named entity recognition: a semi-supervised learning approach [J].
Sintayehu H. ;
Lehal G.S. .
International Journal of Information Technology, 2021, 13 (4) :1659-1665
[42]   Better Sampling of Negatives for Distantly Supervised Named Entity Recognition [J].
Xu, Lu ;
Bing, Lidong ;
Lu, Wei .
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, :4874-4882
[43]   Weakly-Supervised Crack Detection [J].
Inoue, Yuki ;
Nagayoshi, Hiroto .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) :12050-12061
[44]   Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora [J].
Klementiev, Alexandre ;
Roth, Dan .
COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, :817-824
[45]   Weakly-Supervised Cross-Domain Dictionary Learning for Visual Recognition [J].
Zhu, Fan ;
Shao, Ling .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 109 (1-2) :42-59
[46]   GEOGRAPHIC INFORMATION USE IN WEAKLY-SUPERVISED DEEP LEARNING FOR LANDMARK RECOGNITION [J].
Yin, Yifang ;
Liu, Zhenguang ;
Zimmermann, Roger .
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, :1015-1020
[47]   A Hybrid Method for Persian Named Entity Recognition [J].
Ahmadi, Farid ;
Moradi, Hamed .
2015 7th Conference on Information and Knowledge Technology (IKT), 2015,
[48]   A named entity recognition method of english product [J].
Gu, Chuan ;
Zheng, Xia ;
Yu, Jiangde .
Boletin Tecnico/Technical Bulletin, 2017, 55 (05) :56-61
[49]   Named entity recognition using acyclic weighted digraphs: A semi-supervised statistical method [J].
Kim, Kono ;
Yoon, Yeohoon ;
Kim, Harksoo ;
Seo, Jungyun .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, 4426 :571-+
[50]   A Fusion Tuning Method for Named Entity Recognition [J].
Wang, Jitian ;
Chen, Yanping ;
Zou, Anqi ;
Qin, Yongbin ;
Huang, Ruizhang .
BIG DATA, BIGDATA 2024, 2025, 2301 :260-274