EMR2vec: Bridging the gap between patient data and clinical trial

被引:14
作者
Dhayne, Houssein [1 ]
Kilany, Rima [1 ]
Haque, Rafiqul [2 ]
Taher, Yehia [3 ]
机构
[1] St Joseph Univ, Beirut, Lebanon
[2] Intelligencia, 66 Ave Champs Elysees, Paris, France
[3] David Lab, 45 Ave Etats Unis, Versailles, France
关键词
EMR; Clinical trial; Medical data integration; Neural network; Semantic web; MSC; 2010; 00-01; 99-00; ELECTRONIC HEALTH RECORDS; TEXT; CLASSIFICATION; IDENTIFICATION; INTEGRATION; SEARCH;
D O I
10.1016/j.cie.2021.107236
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Y The human suffering from diseases caused by life-threatening viruses such as SARS, Ebola, and COVID-19 motivated many of us to study and discover the best means to harness the potential of data integration to assist clinical researchers to curb these viruses. Integrating patients data with clinical trials data is enormously promising as it provides a comprehensive knowledge base that accelerates the clinical research response-ability to tackle emerging infectious disease outbreaks. This work introduces EMR2vec, a platform that customises advanced NLP, machine learning and semantic web techniques to link potential patients to suitable clinical trials. Linking these two different but complementary datasets allows clinicians and researchers to compare patients to clinical research opportunities or to automatically select patients for personalized clinical care. The platform derives a 'bag of medical terms' (BoMT) from eligibility criteria by normalizing extracted entities through SNOMED-CT ontology. With the usage of BoMT, an ontological reasoning method is proposed to represent EMR and clinical trials in a vector space model. The platform presents a matching process that reduces vector dimensionality using a neural network, then applies orthogonality projection to measure the similarity between vectors. Finally, the proposed EMR2vec platform is evaluated with an extendable prototype based on Big data tools.
引用
收藏
页数:15
相关论文
共 65 条
[51]  
Shirani-Mehr H., 2014, Applications of deep learning to sentiment analysis of movie reviews
[52]   An Introduction to Conditional Random Fields [J].
Sutton, Charles ;
McCallum, Andrew .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 4 (04) :267-373
[53]   A practical method for transforming free-text eligibility criteria into computable criteria [J].
Tu, Samson W. ;
Peleg, Mor ;
Carini, Simona ;
Bobak, Michael ;
Ross, Jessica ;
Rubin, Daniel ;
Sim, Ida .
JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (02) :239-250
[54]  
Wang F., 2011, P 2011 SIAM INT C DA, P59
[55]  
Wang F, 2012, INT C PATT RECOG, P1799
[56]   EliXR: an approach to eligibility criteria extraction and representation [J].
Weng, Chunhua ;
Wu, Xiaoying ;
Luo, Zhihui ;
Boland, Mary Regina ;
Theodoratos, Dimitri ;
Johnson, Stephen B. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 :I116-I124
[57]  
West R., 2000, EVIDENCE BASED MED O
[58]  
Wood R, 2013, LANCET, V382, P10
[59]   Sim2vec: Node similarity preserving network embedding [J].
Xie, Yu ;
Gong, Maoguo ;
Wang, Shanfeng ;
Liu, Wenfeng ;
Yu, Bin .
INFORMATION SCIENCES, 2019, 495 :37-51
[60]  
Xu M., 2020, PATTERN RECOGN