Refining electronic medical records representation in manifold subspace

被引:1
|
作者
Wang, Bolin [1 ]
Sun, Yuanyuan [1 ]
Chu, Yonghe [1 ]
Zhao, Di [1 ]
Yang, Zhihao [1 ]
Wang, Jian [1 ]
机构
[1] Dalian Univ Technol, Coll Comp Sci & Technol, Dalian, Peoples R China
关键词
Electronic medical records; Distributed word representation; Geometric structure; Manifold; NONLINEAR DIMENSIONALITY REDUCTION;
D O I
10.1186/s12859-022-04653-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Electronic medical records (EMR) contain detailed information about patient health. Developing an effective representation model is of great significance for the downstream applications of EMR. However, processing data directly is difficult because EMR data has such characteristics as incompleteness, unstructure and redundancy. Therefore, preprocess of the original data is the key step of EMR data mining. The classic distributed word representations ignore the geometric feature of the word vectors for the representation of EMR data, which often underestimate the similarities between similar words and overestimate the similarities between distant words. This results in word similarity obtained from embedding models being inconsistent with human judgment and much valuable medical information being lost. Results In this study, we propose a biomedical word embedding framework based on manifold subspace. Our proposed model first obtains the word vector representations of the EMR data, and then re-embeds the word vector in the manifold subspace. We develop an efficient optimization algorithm with neighborhood preserving embedding based on manifold optimization. To verify the algorithm presented in this study, we perform experiments on intrinsic evaluation and external classification tasks, and the experimental results demonstrate its advantages over other baseline methods. Conclusions Manifold learning subspace embedding can enhance the representation of distributed word representations in electronic medical record texts. Reduce the difficulty for researchers to process unstructured electronic medical record text data, which has certain biomedical research value.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Documentation of Dual Sensory Impairment in Electronic Medical Records
    Dullard, Brittney
    Saunders, Gabrielle H.
    GERONTOLOGIST, 2016, 56 (02) : 313 - 317
  • [22] Automatic infection detection based on electronic medical records
    Tou, Huaixiao
    Yao, Lu
    Wei, Zhongyu
    Zhuang, Xiahai
    Zhang, Bo
    BMC BIOINFORMATICS, 2018, 19
  • [23] Automatic infection detection based on electronic medical records
    Huaixiao Tou
    Lu Yao
    Zhongyu Wei
    Xiahai Zhuang
    Bo Zhang
    BMC Bioinformatics, 19
  • [24] Comparison of Sequence Variants and the Application in Electronic Medical Records
    Li, Yuqing
    Le, Hieu Hanh
    Matsuo, Ryosuke
    Yamazaki, Tomoyoshi
    Araki, Kenji
    Yokota, Haruo
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2022, PT II, 2022, 13427 : 117 - 130
  • [25] Learning and recommending treatments using electronic medical records
    Hoang, Khanh Hung
    Ho, Tu Bao
    KNOWLEDGE-BASED SYSTEMS, 2019, 181
  • [26] A Predictive Method to Determine Incomplete Electronic Medical Records
    Talaei-Khoei, Amir
    Motiwalla, Luvai F.
    Kazemi, S. Farzan
    SIGMIS-CPR'18: PROCEEDINGS OF THE 2018 ACM SIGMIS CONFERENCE ON COMPUTERS AND PEOPLE RESEARCH, 2018, : 99 - 106
  • [27] Information Extraction for Intestinal Cancer Electronic Medical Records
    Wang, Sufen
    Pang, Minmin
    Pan, Changqing
    Yuan, Junyi
    Xu, Bo
    Du, Ming
    Zhang, Hong
    IEEE ACCESS, 2020, 8 : 125923 - 125934
  • [28] How are Electronic Medical Records Used by Nurse Practitioners?
    Borycki, Elizabeth M.
    Sangster-Gormley, Esther
    Schreiber, Rita
    Thompson, Joanne
    Griffith, Janessa
    Feddema, April
    Kuo, Alex
    E-HEALTH - FOR CONTINUITY OF CARE, 2014, 205 : 196 - 200
  • [30] Personal health records as portal to the electronic medical record
    Jennifer E. Cahill
    Mark R. Gilbert
    Terri S. Armstrong
    Journal of Neuro-Oncology, 2014, 117 : 1 - 6