quEHRy: a question answering system to query electronic health records

被引:5
作者
Soni, Sarvesh [1 ]
Datta, Surabhi [1 ]
Roberts, Kirk [1 ,2 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, Houston, TX USA
[2] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, 7000 Fannin St,Suite 600, Houston, TX 77030 USA
关键词
question answering; electronic health records; natural language processing; artificial intelligence; machine learning; FHIR; CLINICAL QUESTIONS; CARE;
D O I
10.1093/jamia/ocad050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective We propose a system, quEHRy, to retrieve precise, interpretable answers to natural language questions from structured data in electronic health records (EHRs). Materials and Methods We develop/synthesize the main components of quEHRy: concept normalization (MetaMap), time frame classification (new), semantic parsing (existing), visualization with question understanding (new), and query module for FHIR mapping/processing (new). We evaluate quEHRy on 2 clinical question answering (QA) datasets. We evaluate each component separately as well as holistically to gain deeper insights. We also conduct a thorough error analysis for a crucial subcomponent, medical concept normalization. Results Using gold concepts, the precision of quEHRy is 98.33% and 90.91% for the 2 datasets, while the overall accuracy was 97.41% and 87.75%. Precision was 94.03% and 87.79% even after employing an automated medical concept extraction system (MetaMap). Most incorrectly predicted medical concepts were broader in nature than gold-annotated concepts (representative of the ones present in EHRs), eg, Diabetes versus Diabetes Mellitus, Non-Insulin-Dependent. Discussion The primary performance barrier to deployment of the system is due to errors in medical concept extraction (a component not studied in this article), which affects the downstream generation of correct logical structures. This indicates the need to build QA-specific clinical concept normalizers that understand EHR context to extract the "relevant" medical concepts from questions. Conclusion We present an end-to-end QA system that allows information access from EHRs using natural language and returns an exact, verifiable answer. Our proposed system is high-precision and interpretable, checking off the requirements for clinical use.
引用
收藏
页码:1091 / 1102
页数:12
相关论文
共 50 条
[31]   Semantic computation in a Chinese Question-Answering system [J].
Sujian Li ;
Jian Zhang ;
Xiong Huang ;
Shuo Bai ;
Qun Liu .
Journal of Computer Science and Technology, 2002, 17 :933-939
[32]   Question-Answering and Recommendation System on Cooking Recipes [J].
Manna, Riyanka ;
Das, Dipankar ;
Gelbukh, Alexander .
COMPUTACION Y SISTEMAS, 2021, 25 (01) :223-235
[33]   Early Detection of Pancreatic Cancer Applying Artificial Intelligence to Electronic Health Records [J].
Kenner, Barbara J. ;
Abrams, Natalie D. ;
Chari, Suresh T. ;
Field, Bruce F. ;
Goldberg, Ann E. ;
Hoos, William A. ;
Klimstra, David S. ;
Rothschild, Laura J. ;
Srivastava, Sudhir ;
Young, Matthew R. ;
Go, Vay Liang W. .
PANCREAS, 2021, 50 (07) :916-922
[34]   Semantic computation in a Chinese question-answering system [J].
Li, SJ ;
Zhang, J ;
Huang, X ;
Bai, S ;
Liu, Q .
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (06) :933-939
[35]   DAWQAS: A Dataset for Arabic Why Question Answering System [J].
Ismail, Walaa Saber ;
Homsi, Masun Nabhan .
ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 :123-131
[36]   Predicting answer acceptability for question-answering system [J].
Roy, Pradeep Kumar .
INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2024, 25 (04) :555-568
[37]   Question answering system with text mining and deep networks [J].
Ardac, Hueseyin Avni ;
Erdogmus, Pakize .
EVOLVING SYSTEMS, 2024, 15 (05) :1787-1799
[38]   AQUEOS: A System for Question Answering over Semantic Data [J].
Toti, Daniele .
2014 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS), 2014, :716-719
[39]   Deep learning based question answering system in Bengali [J].
Mayeesha, Tasmiah Tahsin ;
Sarwar, Abdullah Md ;
Rahman, Rashedur M. .
JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2021, 5 (02) :145-178
[40]   Secure decentralized electronic health records sharing system based on blockchains [J].
Shuaib, Khaled ;
Abdella, Juhar ;
Sallabi, Farag ;
Serhani, Mohamed Adel .
JOURNAL OF KING SAUD UNIVERSITY COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) :5045-5058