Improving long-tail relation extraction via adaptive adjustment and causal inference

被引:4
作者
Tang, Jingyao [1 ]
Li, Lishuang [1 ]
Lu, Hongbin [1 ]
Zhang, Beibei [1 ]
Wu, Haiming [2 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, 2 Linggong Rd, Dalian 116024, Liaoning, Peoples R China
[2] Beijing Inst Technol, Sch Comp Sci & Technol, 5 South St, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Long tail; Relation Extraction; Adaptive adjustment; Causal inference;
D O I
10.1016/j.neucom.2023.126563
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting long-tail relations poses a significant challenge. Traditional models struggle with weak generalization on tail classes due to the limited sample size. To overcome the limitation, we propose a novel long-tail relation extraction model based on Adaptive Adjustment and Causal Inference (AACI). Specifically, AACI leverages class -adaptive adjustment terms to increase the relative margins between head and tail classes, improving the dis-criminability of tail classes and further enhancing their generalization. Moreover, the learning of our model may encounter multiple spurious correlations due to confounding variables. Therefore, we construct a Structural Causal Model (SCM) for AACI to formalize all spurious correlations and apply causal inference methods to eliminate negative effects of these correlations, thus improving the robustness of AACI. We evaluate our model on the NYT24 and NYT datasets. Our experiments demonstrate that AACI effectively modulates the class margins, eliminates the spurious correlations, and outperforms existing state-of-the-art methods.
引用
收藏
页数:12
相关论文
共 27 条
[21]  
Riedel S, 2010, LECT NOTES ARTIF INT, V6323, P148, DOI 10.1007/978-3-642-15939-8_10
[22]   Unbiased Scene Graph Generation from Biased Training [J].
Tang, Kaihua ;
Niu, Yulei ;
Huang, Jianqiang ;
Shi, Jiaxin ;
Zhang, Hanwang .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3713-3722
[23]  
Wang J., 2023, arXiv
[24]  
Wang YW, 2022, NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, P3071
[25]  
Zhang NY, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P3016
[26]  
Zhang Y, 2023, Arxiv, DOI arXiv:2110.04596
[27]  
Zhang YH, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2205