A practical approach towards causality mining in clinical text using active transfer learning

被引:4
作者
Hussain, Musarrat [1 ]
Satti, Fahad Ahmed [1 ]
Hussain, Jamil [2 ]
Ali, Taqdir [1 ]
Ali, Syed Imran [1 ]
Bilal, Hafiz Syed Muhammad [1 ]
Park, Gwang Hoon [1 ]
Lee, Sungyoung [1 ]
Chung, TaeChoong [1 ]
机构
[1] Kyung Hee Univ Seocheon Dong, Dept Comp Sci & Engn, Yongin 446701, Gyeonggi Do, South Korea
[2] Sejong Univ, Dept Data Sci, Seoul, South Korea
关键词
Causality mining; Active transfer learning; Clinical text mining; Machine learning; LANGUAGE; INFORMATION; EXTRACTION; WORDNET;
D O I
10.1016/j.jbi.2021.103932
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: Causality mining is an active research area, which requires the application of state-of-the-art natural language processing techniques. In the healthcare domain, medical experts create clinical text to overcome the limitation of well-defined and schema driven information systems. The objective of this research work is to create a framework, which can convert clinical text into causal knowledge. Methods: A practical approach based on term expansion, phrase generation, BERT based phrase embedding and semantic matching, semantic enrichment, expert verification, and model evolution has been used to construct a comprehensive causality mining framework. This active transfer learning based framework along with its supplementary services, is able to extract and enrich, causal relationships and their corresponding entities from clinical text. Results: The multi-model transfer learning technique when applied over multiple iterations, gains substantial performance improvements. We also present a comparative analysis of the presented techniques with their common alternatives, which demonstrate the correctness of our approach and its ability to capture most causal relationships. Conclusion: The presented framework has provided cutting-edge results in the healthcare domain. However, the framework can be tweaked to provide causality detection in other domains, as well. Significance: The presented framework is generic enough to be utilized in any domain, healthcare services can gain massive benefits due to the voluminous and various nature of its data. This causal knowledge extraction framework can be used to summarize clinical text, create personas, discover medical knowledge, and provide evidence to clinical decision making.
引用
收藏
页数:15
相关论文
共 46 条
[1]  
Akbik A, 2009, WWW WORKSH, V48
[2]   Snowball: Extracting Causal Chains from Climate Change Text Corpora [J].
Alashri, Saud ;
Tsai, Jiun-Yi ;
Koppela, Anvesh Reddy ;
Davulcu, Hasan .
2018 1ST INTERNATIONAL CONFERENCE ON DATA INTELLIGENCE AND SECURITY (ICDIS 2018), 2018, :234-241
[3]   Extracting causal relations from the literature with word vector mapping [J].
An, Ning ;
Xiao, Yongbo ;
Yuan, Jing ;
Yang, Jiaoyun ;
Alterovitz, Gil .
COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 115
[4]  
[Anonymous], 2010, HDB RES MACHINE LEAR
[5]  
Asghar N., 2016, ARXIV160507895
[6]  
Blanco E, 2008, SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, P310
[7]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[8]   Extracting causal relations on HIV drug resistance from literature [J].
Bui, Quoc-Chinh ;
Nuallain, Breanndan O. ;
Boucher, Charles A. ;
Sloot, Peter M. A. .
BMC BIOINFORMATICS, 2010, 11
[9]  
Chang DS, 2005, LECT NOTES COMPUT SC, V3248, P61
[10]   The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation [J].
Chicco, Davide ;
Jurman, Giuseppe .
BMC GENOMICS, 2020, 21 (01)