A survey on extraction of causal relations from natural language text

被引:40
作者
Yang, Jie [1 ]
Han, Soyeon Caren [1 ]
Poon, Josiah [1 ]
机构
[1] Univ Sydney, Sch Comp Sci, 1 Cleveland St, Sydney, NSW 2006, Australia
关键词
Causality extraction; Explicit intra-sentential causality; Implicit causality; Inter-sentential causality; Deep learning; CORPUS;
D O I
10.1007/s10115-022-01665-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As an essential component of human cognition, cause-effect relations appear frequently in text, and curating cause-effect relations from text helps in building causal networks for predictive tasks. Existing causality extraction techniques include knowledge-based, statistical machine learning (ML)-based, and deep learning-based approaches. Each method has its advantages and weaknesses. For example, knowledge-based methods are understandable but require extensive manual domain knowledge and have poor cross-domain applicability. Statistical machine learning methods are more automated because of natural language processing (NLP) toolkits. However, feature engineering is labor-intensive, and toolkits may lead to error propagation. In the past few years, deep learning techniques attract substantial attention from NLP researchers because of its powerful representation learning ability and the rapid increase in computational resources. Their limitations include high computational costs and a lack of adequate annotated training data. In this paper, we conduct a comprehensive survey of causality extraction. We initially introduce primary forms existing in the causality extraction: explicit intra-sentential causality, implicit causality, and inter-sentential causality. Next, we list benchmark datasets and modeling assessment methods for causal relation extraction. Then, we present a structured overview of the three techniques with their representative systems. Lastly, we highlight existing open challenges with their potential directions.
引用
收藏
页码:1161 / 1186
页数:26
相关论文
共 100 条
  • [11] Extracting causal relations on HIV drug resistance from literature
    Bui, Quoc-Chinh
    Nuallain, Breanndan O.
    Boucher, Charles A.
    Sloot, Peter M. A.
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [12] Incremental cue phrase learning and bootstrapping method for causality extraction using cue phrase and word pair probabilities
    Chang, DS
    Choi, KS
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (03) : 662 - 678
  • [13] Chen Daoyuan, 2020, P 58 ANN M ASS COMP, P5940, DOI DOI 10.18653/V1/2020.ACL-MAIN.527
  • [14] Chen JF, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1726
  • [15] The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation
    Chicco, Davide
    Jurman, Giuseppe
    [J]. BMC GENOMICS, 2020, 21 (01)
  • [16] Christopoulou F, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P4925
  • [17] Christopoulou F, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P81
  • [18] A lightweight tool for automatically extracting causal relationships from text
    Cole, Stephen V.
    Royal, Matthew D.
    Valtorta, Marco G.
    Huhns, Michael N.
    Bowles, John B.
    [J]. PROCEEDINGS OF THE IEEE SOUTHEASTCON 2006, 2006, : 125 - 129
  • [19] Information extraction
    Cowie, J
    Lehnert, W
    [J]. COMMUNICATIONS OF THE ACM, 1996, 39 (01) : 80 - 91
  • [20] Dasgupta T, 2018, 19TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2018), P306