A unified framework of medical information annotation and extraction for Chinese clinical text

被引:7
作者
Zhu, Enwei [1 ,2 ]
Sheng, Qilin [1 ]
Yang, Huanwan [1 ]
Liu, Yiyang [1 ,2 ]
Cai, Ting [1 ,2 ]
Li, Jinpeng [1 ,2 ]
机构
[1] Ningbo 2 Hosp, Ningbo 315010, Zhejiang, Peoples R China
[2] Univ Chinese Acad Sci, Ningbo Inst Life & Hlth Ind, Ningbo 315016, Zhejiang, Peoples R China
关键词
Information extraction; Annotation scheme; Electronic medical record; Chinese clinical text; NEURAL-NETWORKS; CORPUS;
D O I
10.1016/j.artmed.2023.102573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical information extraction consists of a group of natural language processing (NLP) tasks, which collaboratively convert clinical text to pre-defined structured formats. This is a critical step to exploit electronic medical records (EMRs). Given the recent thriving NLP technologies, model implementation and performance seem no longer an obstacle, whereas the bottleneck locates on a high-quality annotated corpus and the whole engineering workflow. This study presents an engineering framework consisting of three tasks, i.e., medical entity recognition, relation extraction and attribute extraction. Within this framework, the whole workflow is demonstrated from EMR data collection through model performance evaluation. Our annotation scheme is designed to be comprehensive and compatible between the multiple tasks. With the EMRs from a general hospital in Ningbo, China, and the manual annotation by experienced physicians, our corpus is of large scale and high quality. Built upon this Chinese clinical corpus, the medical information extraction system show performance that approaches human annotation. The annotation scheme, (a subset of) the annotated corpus, and the code are all publicly released, to facilitate further research.
引用
收藏
页数:12
相关论文
共 50 条
[41]   Text Summarization towards Scientific Information Extraction [J].
Keller, Abigail ;
Furst, Jacob ;
Raicu, Daniela ;
Hastings, Peter ;
Tchoua, Roselyne .
2022 IEEE 18TH INTERNATIONAL CONFERENCE ON E-SCIENCE (ESCIENCE 2022), 2022, :225-235
[42]   A pilot investigation of Information Extraction in the semantic annotation of archaeological reports [J].
Vlachidis, Andreas ;
Tudhope, Douglas .
International Journal of Metadata, Semantics and Ontologies, 2012, 7 (03) :222-235
[43]   Pictorial Visualization of EMR Summary Interface and Medical Information Extraction of Clinical Notes [J].
Ruan, Wei ;
Appasani, Naveenkumar ;
Kim, Katherine ;
Vincelli, Joseph ;
Kim, Hyun ;
Lee, Won-Sook .
2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND VIRTUAL ENVIRONMENTS FOR MEASUREMENT SYSTEMS AND APPLICATIONS (CIVEMSA), 2018,
[44]   A Hybrid Method to Extract Clinical Information From Chinese Electronic Medical Records [J].
Cheng, Ming ;
Li, Liming ;
Ren, Yafeng ;
Lou, Yinxia ;
Gao, Jianbo .
IEEE ACCESS, 2019, 7 :70624-70633
[45]   Information Extraction from Free Text in Clinical Trials with Knowledge-based Distant Supervision [J].
Sun, Yingcheng ;
Loparo, Kenneth .
2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2019, :954-955
[46]   An Unsupervised Method for Entity Mentions Extraction in Chinese Text [J].
Xu, Jing ;
Gan, Liang ;
Zhou, Bin ;
Wu, Quanyuan .
ADVANCES IN SERVICES COMPUTING, 2016, 10065 :320-328
[47]   Open Relation Extraction from Chinese Microblog Text [J].
Xu, Jing ;
Gan, Liang ;
Yan, Zhou ;
Wu, Quanyuan ;
Jia, Yan .
2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, :673-677
[48]   QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval [J].
Tan, Hongming ;
Zhan, Shaoxiong ;
Lin, Hai ;
Zheng, Hai-Tao ;
Chan, Wai Kin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (06) :3669-3683
[49]   A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction [J].
Zefa Hu ;
Ziyi Ni ;
Jing Shi ;
Shuang Xu ;
Bo Xu .
Machine Intelligence Research, 2024, 21 :153-168
[50]   A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction [J].
Hu, Zefa ;
Ni, Ziyi ;
Shi, Jing ;
Xu, Shuang ;
Xu, Bo .
MACHINE INTELLIGENCE RESEARCH, 2024, 21 (01) :153-168