PTEKC: pre-training with event knowledge of ConceptNet for cross-lingual event causality identification

被引:0
作者
Zhu, Enchang [1 ,2 ]
Yu, Zhengtao [1 ,2 ]
Huang, Yuxin [1 ,2 ]
Gao, Shengxiang [1 ,2 ]
Xian, Yantuan [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
[2] Yunnan Key Lab Artificial Intelligence, Kunming 650500, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Event causality identification; Event knowledge; Multilingual pre-trained language models; Parameter-sharing adapter; Pre-training; GRAPH;
D O I
10.1007/s13042-024-02367-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event causality identification (ECI) aims to identify causal relations between events in texts. Although existing event causality identification works based on fine-tuning pre-trained language models (PLMs) have achieved promising results, they suffer from prohibitive computation costs, catastrophic forgetting of distributional knowledge, as well as poor interpretability. Particularly in low-resource and cross-linguistic scenarios, existing multi-lingual models are generally confronted with the so-called curse of multilinguality, language bias, and hence result in low accuracy and generalization ability. In this paper, we propose a paradigm, termed Pre-training with Event Knowledge of ConceptNet (PTEKC), to couple Multi-lingual Pre-trained Language Models (mPLMs) with event knowledge for cross-lingual event causality identification. Specifically, we have develop a parameter-sharing adapter plugin that facilitates the integration of event knowledge into the frozen PLMs. This approach significantly diminishes the number of trainable parameters and greatly reduces the risk of catastrophic forgetting. Our Adapter integrates multi-lingual alignment event knowledge into the mPLMs through two designed pre-training tasks, namely event masking and self-supervised link prediction. Extensive experiments on the benchmark dataset MECI show that PTEKC is parameter-efficient and can effectively incorporate multi-lingual alignment event knowledge for improving cross-lingual event causality identification.
引用
收藏
页码:1859 / 1872
页数:14
相关论文
共 46 条
  • [1] BAKER C F, 1998, Association for Computational Linguistics and Computational Linguistics, P86
  • [2] Chen MQ, 2023, PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, P10804
  • [3] Chen Meiqi, 2022, P 29 INT C COMP LING, P2118
  • [4] Chowdhury S. B. R., 2018, P 2018 C N AM CHAPT
  • [5] Conneau Alexis, 2020, P 58 ANN M ASS COMPU, DOI [10.18653/v1/2020.acl-main.747, DOI 10.18653/V1/2020.ACL-MAIN.747]
  • [6] Devlin J., 2018, arXiv
  • [7] Event causality identification via graph contrast-based knowledge augmented networks
    Ding, Ling
    Chen, Jianting
    Du, Peng
    Xiang, Yang
    [J]. INFORMATION SCIENCES, 2024, 656
  • [8] Feng Chao, 2023, arXiv
  • [9] Hashimoto C, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P2988
  • [10] Hashimoto C, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P987