quEHRy: a question answering system to query electronic health records

被引:3
作者
Soni, Sarvesh [1 ]
Datta, Surabhi [1 ]
Roberts, Kirk [1 ,2 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, Houston, TX USA
[2] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, 7000 Fannin St,Suite 600, Houston, TX 77030 USA
关键词
question answering; electronic health records; natural language processing; artificial intelligence; machine learning; FHIR; CLINICAL QUESTIONS; CARE;
D O I
10.1093/jamia/ocad050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective We propose a system, quEHRy, to retrieve precise, interpretable answers to natural language questions from structured data in electronic health records (EHRs). Materials and Methods We develop/synthesize the main components of quEHRy: concept normalization (MetaMap), time frame classification (new), semantic parsing (existing), visualization with question understanding (new), and query module for FHIR mapping/processing (new). We evaluate quEHRy on 2 clinical question answering (QA) datasets. We evaluate each component separately as well as holistically to gain deeper insights. We also conduct a thorough error analysis for a crucial subcomponent, medical concept normalization. Results Using gold concepts, the precision of quEHRy is 98.33% and 90.91% for the 2 datasets, while the overall accuracy was 97.41% and 87.75%. Precision was 94.03% and 87.79% even after employing an automated medical concept extraction system (MetaMap). Most incorrectly predicted medical concepts were broader in nature than gold-annotated concepts (representative of the ones present in EHRs), eg, Diabetes versus Diabetes Mellitus, Non-Insulin-Dependent. Discussion The primary performance barrier to deployment of the system is due to errors in medical concept extraction (a component not studied in this article), which affects the downstream generation of correct logical structures. This indicates the need to build QA-specific clinical concept normalizers that understand EHR context to extract the "relevant" medical concepts from questions. Conclusion We present an end-to-end QA system that allows information access from EHRs using natural language and returns an exact, verifiable answer. Our proposed system is high-precision and interpretable, checking off the requirements for clinical use.
引用
收藏
页码:1091 / 1102
页数:12
相关论文
共 50 条
  • [21] IQA: Interactive query construction in semantic question answering systems
    Zafar, Hamid
    Dubey, Mohnish
    Lehmann, Jens
    Demidova, Elena
    JOURNAL OF WEB SEMANTICS, 2020, 64 (64):
  • [22] Investigating Query Expansion and Coreference Resolution in Question Answering on BERT
    Bhattacharjee, Santanu
    Haque, Rejwanul
    Wenniger, Gideon Maillette De Buy
    Way, Andy
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2020), 2020, 12089 : 47 - 59
  • [23] SPEECH-DRIVEN QUERY RETRIEVAL FOR QUESTION-ANSWERING
    Mishra, Taniya
    Bangalore, Srinivas
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5318 - 5321
  • [24] A Vietnamese Question Answering System
    Dai Quoc Nguyen
    Dat Quoc Nguyen
    Son Bao Pham
    INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2009), 2009, : 26 - 32
  • [25] SBUQA Question Answering System
    Yarmohammadi, Mahsa A.
    Shamsfard, Mehrnoush
    Yarmohammadi, Mahshid A.
    Rouhizadeh, Masoud
    ADVANCES IN COMPUTER SCIENCE AND ENGINEERING, 2008, 6 : 316 - 323
  • [26] Savana: Re-using Electronic Health Records with Artificial Intelligence
    Hernandez Medrano, Ignacio
    Tello Guijarro, Jorge
    Belda, Cristobal
    Urena, Alberto
    Salcedo, Ignacio
    Espinosa-Anke, Luis
    Saggion, Horacio
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2018, 4 (07): : 8 - 12
  • [27] Query-Driven Knowledge Graph Construction using Question Answering and Multimodal Fusion
    Peng, Yang
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 1119 - 1126
  • [28] Knowledge Base Question Answering via Structured Query Generation using Question domain
    Li, Jiecheng
    Peng, Zizhen
    Zhu, Xiaoying
    Lu, Keda
    2022 IEEE 21ST INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS, IUCC/CIT/DSCI/SMARTCNS, 2022, : 394 - 400
  • [29] Automated methods for the summarization of electronic health records
    Pivovarov, Rimma
    Elhadad, Noemie
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2015, 22 (05) : 938 - 947
  • [30] A Question Answering System based on Conceptual Graph Formalism
    Salloum, Wael
    2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 3, 2009, : 383 - 386