Machine Learning for Automatic Encoding of French Electronic Medical Records: Is More Data Better ?

被引:1
|
作者
Gobeill, Julien [1 ,2 ]
Ruch, Patrick [1 ,2 ]
Meyer, Rodolphe [3 ]
机构
[1] Swiss Inst Bioinformat, SIB Text Min Grp, Geneva, Switzerland
[2] HES So HEG, Informat Sci, Geneva, Switzerland
[3] Univ Hospitals Geneva HUG, Informat Syst Dept, Geneva, Switzerland
来源
DIGITAL PERSONALIZED HEALTH AND MEDICINE | 2020年 / 270卷
关键词
Medical coding; machine learning; text mining;
D O I
10.3233/SHTI200173
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The encoding of Electronic Medical Records is a complex and time-consuming task. We report on a machine learning model for proposing diagnoses and procedures codes, from a large realistic dataset of 245 000 electronic medical records at the University Hospitals of Geneva. Our study particularly focuses on the impact of training data quantity on the model's performances. We show that the performances of the models do not increase while encoded instances from previous years are exploited for learning data. Furthermore, supervised models are shown to be highly perishable: we show a potential drop in performances of around -10% per year. Consequently, great and constant care must be exercised for designing and updating the content of such knowledge bases exploited by machine learning.
引用
收藏
页码:312 / 316
页数:5
相关论文
共 50 条
  • [1] Natural language processing and machine learning to enable automatic extraction and classification of patients' smoking status from electronic medical records
    Caccamisi, Andrea
    Jorgensen, Leif
    Dalianis, Hercules
    Rosenlund, Mats
    UPSALA JOURNAL OF MEDICAL SCIENCES, 2020, 125 (04) : 316 - 324
  • [2] Applications of Machine Learning Using Electronic Medical Records in Spine Surgery
    Schwartz, John T.
    Gao, Michael
    Geng, Eric A.
    Mody, Kush S.
    Mikhail, Christopher M.
    Cho, Samuel K.
    NEUROSPINE, 2019, 16 (04) : 643 - 653
  • [3] Using Electronic Health Records and Machine Learning to Make Medical-Related Predictions from Non-Medical Data
    Pitoglou, Stavros
    Koumpouros, Yiannis
    Anastasiou, Athanasios
    2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA ENGINEERING (ICMLDE 2018), 2018, : 56 - 60
  • [4] Automatic infection detection based on electronic medical records
    Huaixiao Tou
    Lu Yao
    Zhongyu Wei
    Xiahai Zhuang
    Bo Zhang
    BMC Bioinformatics, 19
  • [5] Automatic infection detection based on electronic medical records
    Tou, Huaixiao
    Yao, Lu
    Wei, Zhongyu
    Zhuang, Xiahai
    Zhang, Bo
    BMC BIOINFORMATICS, 2018, 19
  • [6] Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage
    Tang, Jianxiang
    Wang, Xiaoyu
    Wan, Hongli
    Lin, Chunying
    Shao, Zilun
    Chang, Yang
    Wang, Hexuan
    Wu, Yi
    Zhang, Tao
    Du, Yu
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [7] Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage
    Jianxiang Tang
    Xiaoyu Wang
    Hongli Wan
    Chunying Lin
    Zilun Shao
    Yang Chang
    Hexuan Wang
    Yi Wu
    Tao Zhang
    Yu Du
    BMC Medical Informatics and Decision Making, 22
  • [8] Predicting Antibiotic Resistance in Hospitalized Patients by Applying Machine Learning to Electronic Medical Records
    Lewin-Epstein, Ohad
    Baruch, Shoham
    Hadany, Lilach
    Stein, Gideon Y.
    Obolski, Uri
    CLINICAL INFECTIOUS DISEASES, 2021, 72 (11) : E848 - E855
  • [9] Cognitive performance classification of older patients using machine learning and electronic medical records
    Richter-Laskowska, Monika
    Sobotnicka, Ewelina
    Bednorz, Adam
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [10] Machine learning computational model to predict lung cancer using electronic medical records
    Levi, Matanel
    Lazebnik, Teddy
    Kushnir, Shiri
    Yosef, Noga
    Shlomi, Dekel
    CANCER EPIDEMIOLOGY, 2024, 92