Identifying risk factors for heart disease over time: Overview of 2014 i2b2/UTHealth shared task Track 2

被引:72
作者
Stubbs, Amber [1 ]
Kotfila, Christopher [2 ]
Xu, Hua [3 ]
Uzuner, Ozlem [2 ]
机构
[1] Simmons Coll, Sch Lib & Informat Sci, 300 Fenway, Boston, MA 02115 USA
[2] SUNY Albany, Dept Informat Studies, Albany, NY 12222 USA
[3] Univ Texas Hlth Sci Ctr Houston, Ctr Computat Biomed, Houston, TX 77030 USA
关键词
Natural language processing; Clinical narratives; Diabetes; CAD;
D O I
10.1016/j.jbi.2015.07.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The second track of the 2014 i2b2/UTHealth natural language processing shared task focused on identifying medical risk factors related to Coronary Artery Disease (CAD) in the narratives of longitudinal medical records of diabetic patients. The risk factors included hypertension, hyperlipidemia, obesity, smoking status, and family history, as well as diabetes and CAD, and indicators that suggest the presence of those diseases. In addition to identifying the risk factors, this track of the 2014 i2b2/UTHealth shared task studied the presence and progression of the risk factors in longitudinal medical records. Twenty teams participated in this track, and submitted 49 system runs for evaluation. Six of the top 10 teams achieved F 1 scores over 0.90, and all 10 scored over 0.87. The most successful system used a combination of additional annotations, external lexicons, hand-written rules and Support Vector Machines. The results of this track indicate that identification of risk factors and their progression over time is well within the reach of automated systems. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:S67 / S77
页数:11
相关论文
共 31 条
[1]  
[Anonymous], 1992, P 4 C MESS UND MCLEA
[2]   A context-aware approach for progression tracking of medical concepts in electronic medical records [J].
Chang, Nai-Wen ;
Dai, Hong-Jie ;
Jonnagaddala, Jitendra ;
Chen, Chih-Wei ;
Tsai, Richard Tzong-Han ;
Hsu, Wen-Lian .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 :S150-S157
[3]   Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions [J].
Chapman, Wendy W. ;
Nadkarni, Prakash M. ;
Hirschman, Lynette ;
D'Avolio, Leonard W. ;
Savova, Guergana K. ;
Uzuner, Ozlem .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) :540-543
[4]   An automatic system to identify heart disease risk factors in clinical texts over time [J].
Chen, Qingcai ;
Li, Haodi ;
Tang, Buzhou ;
Wang, Xiaolong ;
Liu, Xin ;
Liu, Zengjian ;
Liu, Shu ;
Wang, Weida ;
Deng, Qiwen ;
Zhu, Suisong ;
Chen, Yangxin ;
Wang, Jingfeng .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 :S158-S163
[5]   Agile text mining for the 2014 i2b2/UTHealth Cardiac risk factors challenge [J].
Cormack, James ;
Nath, Chinmoy ;
Milward, David ;
Raja, Kalpana ;
Jonnalagadda, Siddhartha R. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 :S120-S127
[6]  
Dokken BB., 2008, Diabetes Spectr, V21, P160, DOI DOI 10.2337/DIASPECT.21.3.160
[7]   ConText: An algorithm for determining negation, experiencer, and temporal status from clinical reports [J].
Harkema, Henk ;
Dowling, John N. ;
Thornblade, Tyler ;
Chapman, Wendy W. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) :839-851
[8]   Using local lexicalized rules to identify heart disease risk factors in clinical notes [J].
Karystianis, George ;
Dehghan, Azad ;
Kovacevic, Aleksandar ;
Keane, John A. ;
Nenadic, Goran .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 :S183-S188
[9]   Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes [J].
Khalifa, Abdulrahman ;
Meystre, Stephane .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 :S128-S132
[10]   Creation of a new longitudinal corpus of clinical narratives [J].
Kumar, Vishesh ;
Stubbs, Amber ;
Shaw, Stanley ;
Uzuner, Ozlem .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 :S6-S10