Clinical information extraction for preterm birth risk prediction

被引:10
作者
Sterckx, Lucas [1 ]
Vandewiele, Gilles [1 ]
Dehaene, Isabelle [2 ]
Janssens, Olivier [1 ]
Ongenae, Femke [1 ]
De Backere, Femke [1 ]
De Turck, Filip [1 ]
Roelens, Kristien [2 ]
Decruyenaere, Johan [3 ]
Van Hoecke, Sofie [1 ]
Demeester, Thomas [1 ]
机构
[1] Univ Ghent, IDLab, IMEC, Technol Pk Zwijnaarde 126, Ghent, Belgium
[2] Ghent Univ Hosp, Dept Gynaecol & Obstet, Corneel Heymanslaan 10, Ghent, Belgium
[3] Ghent Univ Hosp, Dept Intens Care Med, Corneel Heymanslaan 10, Ghent, Belgium
关键词
Clinical information extraction; Clinical decision support models; Preterm birth; Text mining; SYSTEM; TEXT; HISTORY;
D O I
10.1016/j.jbi.2020.103544
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper contributes to the pursuit of leveraging unstructured medical notes to structured clinical decision making. In particular, we present a pipeline for clinical information extraction from medical notes related to preterm birth, and discuss the main challenges as well as its potential for clinical practice. A large collection of medical notes, created by staff during hospitalizations of patients who were at risk of delivering preterm, was gathered and analyzed. Based on an annotated collection of notes, we trained and evaluated information extraction components to discover clinical entities such as symptoms, events, anatomical sites and procedures, as well as attributes linked to these clinical entities. In a retrospective study, we show that these are highly informative for clinical decision support models that are trained to predict whether delivery is likely to occur within specific time windows, in combination with structured information from electronic health records.
引用
收藏
页数:16
相关论文
共 57 条
[1]  
[Anonymous], 2012, COLING
[2]  
Boag Willie, 2018, AMIA Jt Summits Transl Sci Proc, V2017, P26
[3]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[4]   Cervical length and obstetric history predict spontaneous preterm birth: development and validation of a model to provide individualized risk assessment [J].
Celik, E. ;
To, M. ;
Gajewska, K. ;
Smith, G. C. S. ;
Nicolaides, K. H. .
ULTRASOUND IN OBSTETRICS & GYNECOLOGY, 2008, 31 (05) :549-554
[5]   A simple algorithm for identifying negated findings and diseases in discharge summaries [J].
Chapman, WW ;
Bridewell, W ;
Hanbury, P ;
Cooper, GF ;
Buchanan, BG .
JOURNAL OF BIOMEDICAL INFORMATICS, 2001, 34 (05) :301-310
[6]   Global, regional, and national estimates of levels of preterm birth in 2014: a systematic review and modelling analysis [J].
Chawanpaiboon, Saifon ;
Vogel, Joshua P. ;
Moller, Ann-Beth ;
Lumbiganon, Pisake ;
Petzold, Max ;
Hogan, Daniel ;
Landoulsi, Sihem ;
Jampathong, Nampet ;
Kongwattanakul, Kiattisak ;
Laopaiboon, Malinee ;
Lewis, Cameron ;
Rattanakanokchai, Siwanon ;
Teng, Ditza N. ;
Thinkhamrop, Jadsada ;
Watananirun, Kanokwaroon ;
Zhang, Jun ;
Zhou, Wei ;
Gulmezoglu, A. Metin .
LANCET GLOBAL HEALTH, 2019, 7 (01) :E37-E46
[7]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[8]  
Click C., 2017, H2O
[9]   A survey of current work in biomedical text mining [J].
Cohen, AM ;
Hersh, WR .
BRIEFINGS IN BIOINFORMATICS, 2005, 6 (01) :57-71
[10]  
CREASY RK, 1980, OBSTET GYNECOL, V55, P692