A survey on extraction of causal relations from natural language text

被引:40
作者
Yang, Jie [1 ]
Han, Soyeon Caren [1 ]
Poon, Josiah [1 ]
机构
[1] Univ Sydney, Sch Comp Sci, 1 Cleveland St, Sydney, NSW 2006, Australia
关键词
Causality extraction; Explicit intra-sentential causality; Implicit causality; Inter-sentential causality; Deep learning; CORPUS;
D O I
10.1007/s10115-022-01665-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an essential component of human cognition, cause-effect relations appear frequently in text, and curating cause-effect relations from text helps in building causal networks for predictive tasks. Existing causality extraction techniques include knowledge-based, statistical machine learning (ML)-based, and deep learning-based approaches. Each method has its advantages and weaknesses. For example, knowledge-based methods are understandable but require extensive manual domain knowledge and have poor cross-domain applicability. Statistical machine learning methods are more automated because of natural language processing (NLP) toolkits. However, feature engineering is labor-intensive, and toolkits may lead to error propagation. In the past few years, deep learning techniques attract substantial attention from NLP researchers because of its powerful representation learning ability and the rapid increase in computational resources. Their limitations include high computational costs and a lack of adequate annotated training data. In this paper, we conduct a comprehensive survey of causality extraction. We initially introduce primary forms existing in the causality extraction: explicit intra-sentential causality, implicit causality, and inter-sentential causality. Next, we list benchmark datasets and modeling assessment methods for causal relation extraction. Then, we present a structured overview of the three techniques with their representative systems. Lastly, we highlight existing open challenges with their potential directions.
引用
收藏
页码:1161 / 1186
页数:26
相关论文
共 100 条
  • [1] All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning
    Airola, Antti
    Pyysalo, Sampo
    Bjoerne, Jari
    Pahikkala, Tapio
    Ginter, Filip
    Salakoski, Tapio
    [J]. BMC BIOINFORMATICS, 2008, 9 (Suppl 11)
  • [2] Asghar, 2016, ARXIV PREPRINT ARXIV
  • [3] Balashankar A, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P2338
  • [4] Beamer B., 2008, AAAI, P824
  • [5] Bekoulis G, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2830
  • [6] Joint entity recognition and relation extraction as a multi-head selection problem
    Bekoulis, Giannis
    Deleu, Johannes
    Demeester, Thomas
    Develder, Chris
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 34 - 45
  • [7] Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615
  • [8] Bethard S., 2008, P ACL 08, P177
  • [9] Blanco E, 2008, SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, P310
  • [10] Brown P. F., 1992, Computational Linguistics, V18, P467