Extracting Clinical entities and their assertions from Chinese Electronic Medical Records Based on Machine Learning

被引:0
作者
Wang, Jianhong [1 ]
Peng, Yousong [1 ]
Liu, Bin [1 ]
Wu, Zhiqiang [1 ]
Deng, Lizong [2 ,3 ,4 ]
Jiang, Taijiao [1 ,2 ,3 ,4 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha, Hunan, Peoples R China
[2] Chinese Acad Med Sci, Inst Basic Med Sci, Ctr Syst Med, Beijing, Peoples R China
[3] Peking Union Med Coll, Beijing, Peoples R China
[4] Chinese Acad Med Sci, Ctr Syst Med, Suzhou, Peoples R China
来源
PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING, MANUFACTURING TECHNOLOGY AND CONTROL | 2016年 / 67卷
关键词
Chinese Electronic Medical Records; Information extraction; Named Entity Recognition; assertion classification; Machine Learning; RECOGNITION; INFORMATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid growth of electronic medical records (EMRs) in China, large amounts of clinical data have been accumulated. However, limited work for extracting information from EMRs in Chinese has been conducted. In this work, using manually annotated dataset of EMRs in Chinese, we investigated the clinical Named Entities Recognition (NER) based on Conditional Random Field (CRF) and further built a Support Vector Machine (SVM) classifier to determine their assertion status and evaluate the contributions of different features for assertion classification. For Chinese clinical NER, our CRF-based classifier achieved the best F-measure of 89.07%, while the SVM-based assertion classifier achieved a maximum F-measure of 94.10%. Our work suggests that machine learning methods are helpful in NER and assertion determination for Chinese medical clinical records.
引用
收藏
页码:1503 / 1508
页数:6
相关论文
共 14 条
  • [1] [Anonymous], P 2010 I2B2 VA WORKS
  • [2] MITRE system for clinical assertion status classification
    Clark, Cheryl
    Aberdeen, John
    Coarr, Matt
    Tresner-Kirsch, David
    Wellner, Ben
    Yeh, Alexander
    Hirschman, Lynette
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) : 563 - 567
  • [3] Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010
    de Bruijn, Berry
    Cherry, Colin
    Kiritchenko, Svetlana
    Martin, Joel
    Zhu, Xiaodan
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) : 557 - 562
  • [4] Recognition of medication information from discharge summaries using ensembles of classifiers
    Doan, Son
    Collier, Nigel
    Xu, Hua
    Pham Hoang Duy
    Tu Minh Phuong
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2012, 12
  • [5] Agreement, the F-measure, and reliability in information retrieval
    Hripcsak, G
    Rothschild, AS
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2005, 12 (03) : 296 - 298
  • [6] A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries
    Jiang, Min
    Chen, Yukun
    Liu, Mei
    Rosenbloom, S. Trent
    Mani, Subramani
    Denny, Joshua C.
    Xu, Hua
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) : 601 - 606
  • [7] A comprehensive study of named entity recognition in Chinese clinical text
    Lei, Jianbo
    Tang, Buzhou
    Lu, Xueqin
    Gao, Kaihua
    Jiang, Min
    Xu, Hua
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (05) : 808 - 814
  • [8] 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text
    Uzuner, Oezlem
    South, Brett R.
    Shen, Shuying
    DuVall, Scott L.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) : 552 - 556
  • [9] Extracting medication information from clinical text
    Uzuner, Oezlem
    Solti, Imre
    Cadag, Eithon
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (05) : 514 - 518
  • [10] Machine Learning and Rule-based Approaches to Assertion Classification
    Uzuner, Oezlem
    Zhang, Xiaoran
    Sibanda, Tawanda
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2009, 16 (01) : 109 - 115