Machine Learning for Automatic Encoding of French Electronic Medical Records: Is More Data Better ?

被引:1
|
作者
Gobeill, Julien [1 ,2 ]
Ruch, Patrick [1 ,2 ]
Meyer, Rodolphe [3 ]
机构
[1] Swiss Inst Bioinformat, SIB Text Min Grp, Geneva, Switzerland
[2] HES So HEG, Informat Sci, Geneva, Switzerland
[3] Univ Hospitals Geneva HUG, Informat Syst Dept, Geneva, Switzerland
来源
DIGITAL PERSONALIZED HEALTH AND MEDICINE | 2020年 / 270卷
关键词
Medical coding; machine learning; text mining;
D O I
10.3233/SHTI200173
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The encoding of Electronic Medical Records is a complex and time-consuming task. We report on a machine learning model for proposing diagnoses and procedures codes, from a large realistic dataset of 245 000 electronic medical records at the University Hospitals of Geneva. Our study particularly focuses on the impact of training data quantity on the model's performances. We show that the performances of the models do not increase while encoded instances from previous years are exploited for learning data. Furthermore, supervised models are shown to be highly perishable: we show a potential drop in performances of around -10% per year. Consequently, great and constant care must be exercised for designing and updating the content of such knowledge bases exploited by machine learning.
引用
收藏
页码:312 / 316
页数:5
相关论文
共 50 条
  • [41] Predicting opioid dependence from electronic health records with machine learning
    Ellis, Randall J.
    Wang, Zichen
    Genes, Nicholas
    Ma'ayan, Avi
    BIODATA MINING, 2019, 12 (1)
  • [42] An Analysis of Integrating Machine Learning in Healthcare for Ensuring Confidentiality of the Electronic Records
    Seh, Adil Hussain
    Al-Amri, Jehad F.
    Subahi, Ahmad F.
    Agrawal, Alka
    Pathak, Nitish
    Kumar, Rajeev
    Khan, Raees Ahmad
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2022, 130 (03): : 1387 - 1422
  • [43] Using machine learning to detect sarcopenia from electronic health records
    Luo, Xiao
    Ding, Haoran
    Broyles, Andrea
    Warden, Stuart J.
    Moorthi, Ranjani N.
    Imel, Erik A.
    DIGITAL HEALTH, 2023, 9
  • [44] Machine learning approaches for electronic health records phenotyping: a methodical review
    Yang, Siyue
    Varghese, Paul
    Stephenson, Ellen
    Tu, Karen
    Gronsbell, Jessica
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (02) : 367 - 381
  • [45] Descriptive and Predictive Analytics on Electronic Health Records using Machine Learning
    Anandi, V
    Ramesh, M.
    2022 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL, COMPUTING, COMMUNICATION AND SUSTAINABLE TECHNOLOGIES (ICAECT), 2022,
  • [46] Predicting opioid dependence from electronic health records with machine learning
    Randall J. Ellis
    Zichen Wang
    Nicholas Genes
    Avi Ma’ayan
    BioData Mining, 12
  • [47] Machine Learning Analysis for Data Incompleteness (MADI): Analyzing the Data Completeness of Patient Records Using a Random Variable Approach to Predict the Incompleteness of Electronic Health Records
    Gurupur, Varadraj P.
    Shelleh, Muhammed
    IEEE ACCESS, 2021, 9 : 95994 - 96001
  • [48] Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records
    Zheyi Dong
    Qian Wang
    Yujing Ke
    Weiguang Zhang
    Quan Hong
    Chao Liu
    Xiaomin Liu
    Jian Yang
    Yue Xi
    Jinlong Shi
    Li Zhang
    Ying Zheng
    Qiang Lv
    Yong Wang
    Jie Wu
    Xuefeng Sun
    Guangyan Cai
    Shen Qiao
    Chengliang Yin
    Shibin Su
    Xiangmei Chen
    Journal of Translational Medicine, 20
  • [49] Prediction of the risk of cytopenia in hospitalized HIV/AIDS patients using machine learning methods based on electronic medical records
    Huang, Liling
    Xie, Bo
    Zhang, Kai
    Xu, Yuanlong
    Su, Lingsong
    Lv, Yu
    Lu, Yangjie
    Qin, Jianqiu
    Pang, Xianwu
    Qiu, Hong
    Li, Lanxiang
    Wei, Xihua
    Huang, Kui
    Meng, Zhihao
    Hu, Yanling
    Lv, Jiannan
    FRONTIERS IN PUBLIC HEALTH, 2023, 11
  • [50] Prediction of 3-year risk of diabetic kidney disease using machine learning based on electronic medical records
    Dong, Zheyi
    Wang, Qian
    Ke, Yujing
    Zhang, Weiguang
    Hong, Quan
    Liu, Chao
    Liu, Xiaomin
    Yang, Jian
    Xi, Yue
    Shi, Jinlong
    Zhang, Li
    Zheng, Ying
    Lv, Qiang
    Wang, Yong
    Wu, Jie
    Sun, Xuefeng
    Cai, Guangyan
    Qiao, Shen
    Yin, Chengliang
    Su, Shibin
    Chen, Xiangmei
    JOURNAL OF TRANSLATIONAL MEDICINE, 2022, 20 (01)