Machine Learning for Automatic Encoding of French Electronic Medical Records: Is More Data Better ?

被引:1
|
作者
Gobeill, Julien [1 ,2 ]
Ruch, Patrick [1 ,2 ]
Meyer, Rodolphe [3 ]
机构
[1] Swiss Inst Bioinformat, SIB Text Min Grp, Geneva, Switzerland
[2] HES So HEG, Informat Sci, Geneva, Switzerland
[3] Univ Hospitals Geneva HUG, Informat Syst Dept, Geneva, Switzerland
来源
DIGITAL PERSONALIZED HEALTH AND MEDICINE | 2020年 / 270卷
关键词
Medical coding; machine learning; text mining;
D O I
10.3233/SHTI200173
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The encoding of Electronic Medical Records is a complex and time-consuming task. We report on a machine learning model for proposing diagnoses and procedures codes, from a large realistic dataset of 245 000 electronic medical records at the University Hospitals of Geneva. Our study particularly focuses on the impact of training data quantity on the model's performances. We show that the performances of the models do not increase while encoded instances from previous years are exploited for learning data. Furthermore, supervised models are shown to be highly perishable: we show a potential drop in performances of around -10% per year. Consequently, great and constant care must be exercised for designing and updating the content of such knowledge bases exploited by machine learning.
引用
收藏
页码:312 / 316
页数:5
相关论文
共 50 条
  • [21] Prediction and diagnosis of depression using machine learning with electronic health records data: a systematic review
    David Nickson
    Caroline Meyer
    Lukasz Walasek
    Carla Toro
    BMC Medical Informatics and Decision Making, 23
  • [22] An Application of Machine Learning to Etiological Diagnosis of Secondary Hypertension: Retrospective Study Using Electronic Medical Records
    Diao, Xiaolin
    Huo, Yanni
    Yan, Zhanzheng
    Wang, Haibin
    Yuan, Jing
    Wang, Yuxin
    Cai, Jun
    Zhao, Wei
    JMIR MEDICAL INFORMATICS, 2021, 9 (01)
  • [23] Implementation and Use of Disease Diagnosis Systems for Electronic Medical Records Based on Machine Learning: A Complete Review
    Latif, Jahanzaib
    Xiao, Chuangbai
    Tu, Shanshan
    Rehman, Sadaqat Ur
    Imran, Azhar
    Bilal, Anas
    IEEE ACCESS, 2020, 8 : 150489 - 150513
  • [24] Machine learning models for diabetes management in acute care using electronic medical records: A systematic review
    Rahimi, Amir Kamel
    Canfell, Oliver J.
    Chan, Wilkin
    Sly, Benjamin
    Pole, Jason D.
    Sullivan, Clair
    Shrapnel, Sally
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 162
  • [25] A machine learning model to predict the risk of 30-day readmissions in patients with heart failure: a retrospective analysis of electronic medical records data
    Golas, Sara Bersche
    Shibahara, Takuma
    Agboola, Stephen
    Otaki, Hiroko
    Sato, Jumpei
    Nakae, Tatsuya
    Hisamitsu, Toru
    Kojima, Go
    Felsted, Jennifer
    Kakarmath, Sujay
    Kvedar, Joseph
    Jethwani, Kamal
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [26] A machine learning model to predict the risk of 30-day readmissions in patients with heart failure: a retrospective analysis of electronic medical records data
    Sara Bersche Golas
    Takuma Shibahara
    Stephen Agboola
    Hiroko Otaki
    Jumpei Sato
    Tatsuya Nakae
    Toru Hisamitsu
    Go Kojima
    Jennifer Felsted
    Sujay Kakarmath
    Joseph Kvedar
    Kamal Jethwani
    BMC Medical Informatics and Decision Making, 18
  • [27] On an Approach of the Solution of Machine Learning Problems Integrated with Data from the Open-Source System of Electronic Medical Records: Application for Fractures Prediction
    Martsenyuk, Vasyl
    Povoroznyuk, Vladyslav
    Semenets, Andriy
    Martynyuk, Larysa
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2019, PT II, 2019, 11509 : 228 - 239
  • [28] Machine Learning for Medical Data Integration
    Mueller, Armin
    Christmann, Lara-Sophie
    Kohler, Severin
    Eils, Roland
    Prasser, Fabian
    CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 691 - 695
  • [29] Subphenotyping depression using machine learning and electronic health records
    Xu, Zhenxing
    Wang, Fei
    Adekkanattu, Prakash
    Bose, Budhaditya
    Vekaria, Veer
    Brandt, Pascal
    Jiang, Guoqian
    Kiefer, Richard C.
    Luo, Yuan
    Pacheco, Jennifer A.
    Rasmussen, Luke V.
    Xu, Jie
    Alexopoulos, George
    Pathak, Jyotishman
    LEARNING HEALTH SYSTEMS, 2020, 4 (04):
  • [30] Automatic and Precise Data Validation for Machine Learning
    Shankar, Shreya
    Fawaz, Labib
    Gyllstrom, Karl
    Parameswaran, Aditya
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2198 - 2207