EMR-based medical knowledge representation and inference via Markov random fields and distributed representation learning

被引：25

作者：

Zhao, Chao ^{[1
]}

Jiang, Jingchi ^{[1
]}

Guan, Yi ^{[1
]}

Guo, Xitong ^{[2
]}

He, Bin ^{[1
]}

机构：

[1] Sch Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China

[2] Harbin Inst Technol, Sch Management, Harbin 150001, Heilongjiang, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE IN MEDICINE | 2018年 / 87卷

基金：

中国国家自然科学基金;

关键词：

Electronic medical record; Clinical decision support; Medical knowledge network; Markov random field; Distributed representation; INFORMATION EXTRACTION; BAYESIAN NETWORKS; DIAGNOSIS; MODELS;

D O I：

10.1016/j.artmed.2018.03.005

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Objective: Electronic medical records (EMRs) contain medical knowledge that can be used for clinical decision support (CDS). Our objective is to develop a general system that can extract and represent knowledge contained in EMRs to support three CDS tasks-test recommendation, initial diagnosis, and treatment plan recommendation-given the condition of a patient. Methods: We extracted four kinds of medical entities from records and constructed an EMR-based medical knowledge network (EMKN), in which nodes are entities and edges reflect their co-occurrence in a record. Three bipartite subgraphs (bigraphs) were extracted from the EMKN, one to support each task. One part of the bigraph was the given condition (e.g., symptoms), and the other was the condition to be inferred (e.g., diseases). Each bigraph was regarded as a Markov random field (MRF) to support the inference. We proposed three graph-based energy functions and three likelihood-based energy functions. Two of these functions are based on knowledge representation learning and can provide distributed representations of medical entities. Two EMR datasets and three metrics were utilized to evaluate the performance. Results: As a whole, the evaluation results indicate that the proposed system outperformed the baseline methods. The distributed representation of medical entities does reflect similarity relationships with respect to knowledge level. Conclusion: Combining EMKN and MRF is an effective approach for general medical knowledge representation and inference. Different tasks, however, require individually designed energy functions. (C) 2018 Elsevier B.V. All rights reserved.

引用

页码：49 / 59

页数：11

共 60 条

[1] Artificial neural networks in medical diagnosis [J].

Amato, Filippo ;

Lopez, Alberto ;

Pena-Mendez, Eladia Maria ;

Vanhara, Petr ;

Hampl, Ales ;

Havel, Josef .

JOURNAL OF APPLIED BIOMEDICINE, 2013, 11 (02) :47-58

[2]

[Anonymous], 2017, Briefings in bioinformatics

[3]

Bastian M., 2009, 3 INT AAAI C WEBLOGS, DOI [10.13140/2.1.1341.1520, DOI 10.1609/ICWSM.V3I1.13937]

[4]

Bengio Y, 2001, ADV NEUR IN, V13, P932

[5] Representation Learning: A Review and New Perspectives [J].

Bengio, Yoshua ;

Courville, Aaron ;

Vincent, Pascal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828

[6]

Bordes A, 2011, C ART INT NUMB EPFL

[7]

Bordes A., 2013, ADV NEURAL INFORM PR, P2787

[8] Structure space of Bayesian networks is dramatically reduced by subdividing it in sub-networks [J].

Bouhamed, Heni ;

Masmoudi, Afif ;

Lecroq, Thierry ;

Rebai, Ahmed .

JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2015, 287 :48-62

[9]

Buckley C., 2000, Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P33, DOI DOI 10.1145/345508.345543

[10]

Cheng Y., 2016, P 2016 SIAM INT C DA, P432, DOI 10.1137/1.9781611974348.49

← 1 2 3 4 5 6 →