Refining electronic medical records representation in manifold subspace

被引:1
|
作者
Wang, Bolin [1 ]
Sun, Yuanyuan [1 ]
Chu, Yonghe [1 ]
Zhao, Di [1 ]
Yang, Zhihao [1 ]
Wang, Jian [1 ]
机构
[1] Dalian Univ Technol, Coll Comp Sci & Technol, Dalian, Peoples R China
关键词
Electronic medical records; Distributed word representation; Geometric structure; Manifold; NONLINEAR DIMENSIONALITY REDUCTION;
D O I
10.1186/s12859-022-04653-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Electronic medical records (EMR) contain detailed information about patient health. Developing an effective representation model is of great significance for the downstream applications of EMR. However, processing data directly is difficult because EMR data has such characteristics as incompleteness, unstructure and redundancy. Therefore, preprocess of the original data is the key step of EMR data mining. The classic distributed word representations ignore the geometric feature of the word vectors for the representation of EMR data, which often underestimate the similarities between similar words and overestimate the similarities between distant words. This results in word similarity obtained from embedding models being inconsistent with human judgment and much valuable medical information being lost. Results In this study, we propose a biomedical word embedding framework based on manifold subspace. Our proposed model first obtains the word vector representations of the EMR data, and then re-embeds the word vector in the manifold subspace. We develop an efficient optimization algorithm with neighborhood preserving embedding based on manifold optimization. To verify the algorithm presented in this study, we perform experiments on intrinsic evaluation and external classification tasks, and the experimental results demonstrate its advantages over other baseline methods. Conclusions Manifold learning subspace embedding can enhance the representation of distributed word representations in electronic medical record texts. Reduce the difficulty for researchers to process unstructured electronic medical record text data, which has certain biomedical research value.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Association between Electronic Medical Records and Healthcare Quality
    Lin, Hong-Ling
    Wu, Ding-Chung
    Cheng, Shu-Meng
    Chen, Cheng-Jueng
    Wang, Mei-Chuen
    Cheng, Chun-An
    MEDICINE, 2020, 99 (31)
  • [42] Investment subsidies and the adoption of electronic medical records in hospitals
    Dranove, David
    Garthwaite, Craig
    Li, Bingyang
    Ody, Christopher
    JOURNAL OF HEALTH ECONOMICS, 2015, 44 : 309 - 319
  • [43] Privacy Challenges in Electronic Medical Records: A Systematic Review
    Rahim, Fiza Abdul
    Ismail, Zuraini
    Samy, Ganthan Narayana
    PROCEEDING OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2014, VOLS 1 AND 2, 2014, : 584 - 588
  • [44] Demographic Aware Probabilistic Medical Knowledge Graph Embeddings of Electronic Medical Records
    Guluzade, Aynur
    Kacupaj, Endri
    Maleshkova, Maria
    ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2021), 2021, : 408 - 417
  • [45] Implementation of Electronic Medical Records at Seoul National University Hospital
    Seo, Jeong-Wook
    Kim, Kyung Hwan
    Choi, Jin Wook
    Ha, Kyoo-Seob
    Chin, Ho Jun
    Kim, Jong-Uk
    Kim, Suk Wha
    Im, Jung-Gi
    Kim, Suhnggwon
    HEALTHCARE INFORMATICS RESEARCH, 2006, 12 (03) : 213 - 225
  • [46] Adoption of Electronic Medical Records in Healthcare Facilities in the Emirate of Dubai
    Abdulrahman, Mahera
    El-Hassan, Osama
    Al Redha, Mohammad Abdulqader
    Almalki, Manal
    HEALTHCARE INFORMATICS RESEARCH, 2024, 30 (02) : 154 - 161
  • [47] Harnessing electronic medical records to advance research on multiple sclerosis
    Damotte, Vincent
    Lizee, Antoine
    Tremblay, Matthew
    Agrawal, Alisha
    Khankhanian, Pouya
    Santaniello, Adam
    Gomez, Refujia
    Lincoln, Robin
    Tang, Wendy
    Chen, Tiffany
    Lee, Nelson
    Villoslada, Pablo
    Hollenbach, Jill A.
    Bevan, Carolyn D.
    Graves, Jennifer
    Bove, Riley
    Goodin, Douglas S.
    Green, Ari J.
    Baranzini, Sergio E.
    Cree, Bruce A. C.
    Henry, Roland G.
    Hauser, Stephen L.
    Gelfand, Jeffrey M.
    Gourraud, Pierre-Antoine
    MULTIPLE SCLEROSIS JOURNAL, 2019, 25 (03) : 408 - 418
  • [48] Benchmarking Electronic Medical Records Initiatives in the US: a Conceptual Model
    Palacio, Carlos
    Harrison, Jeffrey P.
    Garets, David
    JOURNAL OF MEDICAL SYSTEMS, 2010, 34 (03) : 273 - 279
  • [49] Measuring data reliability for preventive services in electronic medical records
    Greiver, Michelle
    Barnsley, Jan
    Glazier, Richard H.
    Harvey, Bart J.
    Moineddin, Rahim
    BMC HEALTH SERVICES RESEARCH, 2012, 12
  • [50] An empirical study of the antecedents of data completeness in electronic medical records
    Liu, Caihua
    Zowghi, Didar
    Talaei-Khoei, Amir
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2020, 50 : 155 - 170