A semantic sequence similarity based approach for extracting medical entities from clinical conversations

被引:11
作者
Satti, Fahad Ahmed [1 ,2 ]
Hussain, Musarrat [1 ]
Ali, Syed Imran [1 ,2 ]
Saleem, Misha [3 ]
Ali, Husnain [3 ]
Chung, Tae Choong [1 ]
Lee, Sungyoung [1 ]
机构
[1] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin 17104, South Korea
[2] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci SEECS, Islamabad 44000, Pakistan
[3] Care Med Ctr, Dept Neonatol, G-8, Islamabad 44080, Pakistan
关键词
Clinical data mining; Semantic similarity; Natural language processing;
D O I
10.1016/j.ipm.2022.103213
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clinical conversations between physicians and patients can provide a rich source of data, information, and knowledge. A plethora of tools and technologies have been developed to identify attributes of interest in unstructured text. However, identifying the name and correct value of an attribute, from real world data, in a timely manner is a nontrivial task. In this manuscript we present a novel pipeline using transfer learning, clinical concept dictionaries, and pattern matching to provide an end-to-end solution for identifying attributes and extracting their values from natural clinical text. On real-world data, with 1176 instances, we achieve an accuracy of 56.21%, which is 3% higher than the baseline methodology.
引用
收藏
页数:17
相关论文
共 45 条
[1]  
Abdullah MF, 2013, INT CONF RES INNOV, P151, DOI 10.1109/ICRIIS.2013.6716700
[2]   RETRACTED: Machine Translation System Using Deep Learning for English to Urdu (Retracted Article) [J].
Andrabi, Syed Abdul Basit ;
Wahid, Abdul .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[3]  
[Anonymous], 2009, P WORKSHOP BIONLP BI, DOI DOI 10.3115/1572364.1572390
[4]   Community Health Programs Delivered Through Information and Communications Technology in High-Income Countries: Scoping Review [J].
Beks, Hannah ;
King, Olivia ;
Clapham, Renee ;
Alston, Laura ;
Glenister, Kristen ;
McKinstry, Carol ;
Quilliam, Claire ;
Wellwood, Ian ;
Williams, Catherine ;
Wong Shee, Anna .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (03)
[5]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[6]  
Cer D., 2017, P 2017 SEMVAL INT WO, V2017, DOI [DOI 10.18653/V1/S17-2001, 10.18653/v1/s17-2001]
[7]   Digital technologies, healthcare and Covid-19: insights from developing and emerging nations [J].
Chandra, Mukesh ;
Kumar, Kunal ;
Thakur, Prabhat ;
Chattopadhyaya, Somnath ;
Alam, Firoz ;
Kumar, Satish .
HEALTH AND TECHNOLOGY, 2022, 12 (02) :547-568
[8]   Exploring the Online Doctor-Patient Interaction on Patient Satisfaction Based on Text Mining and Empirical Analysis [J].
Chen, Shuqing ;
Guo, Xitong ;
Wu, Tianshi ;
Ju, Xiaofeng .
INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (05)
[9]   Identifying Electronic Nicotine Delivery System Brands and Flavors on Instagram: Natural Language Processing Analysis [J].
Chew, Rob ;
Wenger, Michael ;
Guillory, Jamie ;
Nonnemaker, James ;
Kim, Annice .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (01)
[10]  
Chiticariu L, 2010, P C EMP METH NAT LAN