Artificial Intelligence-Based Traditional Chinese Medicine Assistive Diagnostic System: Validation Study

被引：58

作者：

Zhang, Hong ^{[1
]}

Ni, Wandong ^{[2
]}

Li, Jing ^{[1
]}

Zhang, Jiajun ^{[3
]}

机构：

[1] China Acad Chinese Med Sci, Guanganmen Hosp, Comp Ctr, 5 Beixiange, Beijing 100053, Peoples R China

[2] State Adm Tradit Chinese Med, Certificat Ctr Tradit Chinese Med, Phys Qualificat, Beijing, Peoples R China

[3] NCT Lab Corp, Dept Software Engn, Billerica, MA USA

来源：

JMIR MEDICAL INFORMATICS | 2020年 / 8卷 / 06期

关键词：

traditional Chinese medicine; TCM; disease diagnosis; syndrome prediction; syndrome differentiation; natural language processing; NLP; artificial intelligence; AI; assistive diagnostic system; convolutional neural network; CNN; machine learning; ML; BiLSTM-CRF;

D O I：

10.2196/17608

中图分类号：

R-058 [];

学科分类号：

摘要：

Background: Artificial intelligence-based assistive diagnostic systems imitate the deductive reasoning process of a human physician in biomedical disease diagnosis and treatment decision making. While impressive progress in this area has been reported, most of the reported successes are applications of artificial intelligence in Western medicine. The application of artificial intelligence in traditional Chinese medicine has lagged mainly because traditional Chinese medicine practitioners need to perform syndrome differentiation as well as biomedical disease diagnosis before a treatment decision can be made. Syndrome, a concept unique to traditional Chinese medicine, is an abstraction of a variety of signs and symptoms. The fact that the relationship between diseases and syndromes is not one-to-one but rather many-to-many makes it very challenging for a machine to perform syndrome predictions. So far, only a handful of artificial intelligence-based assistive traditional Chinese medicine diagnostic models have been reported, and they are limited in application to a single disease-type. Objective: The objective was to develop an artificial intelligence-based assistive diagnostic system capable of diagnosing multiple types of diseases that are common in traditional Chinese medicine, given a patient's electronic health record notes. The system was designed to simultaneously diagnose the disease and produce a list of corresponding syndromes. Methods: Unstructured freestyle electronic health record notes were processed by natural language processing techniques to extract clinical information such as signs and symptoms which were represented by named entities. Natural language processing used a recurrent neural network model called bidirectional long short-term memory network-conditional random forest. A convolutional neural network was then used to predict the disease-type out of 187 diseases in traditional Chinese medicine. A novel traditional Chinese medicine syndrome prediction method-an integrated learning model-was used to produce a corresponding list of probable syndromes. By following a majority-rule voting method, the integrated learning model for syndrome prediction can take advantage of four existing prediction methods (back propagation, random forest, extreme gradient boosting, and support vector classifier) while avoiding their respective weaknesses which resulted in a consistently high prediction accuracy. Results: A data set consisting of 22,984 electronic health records from Guanganmen Hospital of the China Academy of Chinese Medical Sciences that were collected between January 1, 2017 and September 7, 2018 was used. The data set contained a total of 187 diseases that are commonly diagnosed in traditional Chinese medicine. The diagnostic system was designed to be able to detect any one of the 187 disease-types. The data set was partitioned into a training set, a validation set, and a testing set in a ratio of 8:1:1. Test results suggested that the proposed system had a good diagnostic accuracy and a strong capability for generalization. The disease-type prediction accuracies of the top one, top three, and top five were 80.5%, 91.6%, and 94.2%, respectively. Conclusions: The main contributions of the artificial intelligence-based traditional Chinese medicine assistive diagnostic system proposed in this paper are that 187 commonly known traditional Chinese medicine diseases can be diagnosed and a novel prediction method called an integrated learning model is demonstrated. This new prediction method outperformed all four existing methods in our preliminary experimental results. With further improvement of the algorithms and the availability of additional electronic health record data, it is expected that a wider range of traditional Chinese medicine disease-types could be diagnosed and that better diagnostic accuracies could be achieved.

引用

页数：12

共 23 条

[11] Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning
Kermany, Daniel S.
Goldbaum, Michael
Cai, Wenjia
Valentim, Carolina C. S.
Liang, Huiying
Baxter, Sally L.
McKeown, Alex
Yang, Ge
Wu, Xiaokang
Yan, Fangbing
Dong, Justin
Prasadha, Made K.
Pei, Jacqueline
Ting, Magdalena
Zhu, Jie
Li, Christina
Hewett, Sierra
Dong, Jason
Ziyar, Ian
Shi, Alexander
Zhang, Runze
Zheng, Lianghong
Hou, Rui
Shi, William
Fu, Xin
Duan, Yaou
Huu, Viet A. N.
Wen, Cindy
Zhang, Edward D.
Zhang, Charlotte L.
Li, Oulan
Wang, Xiaobo
Singer, Michael A.
Sun, Xiaodong
Xu, Jie
Tafreshi, Ali
Lewis, M. Anthony
Xia, Huimin
Zhang, Kang
[J]. CELL, 2018, 172 (05) : 1122 - +
[12] Lample G, 2016, 2016 C N AM CHAPTER, DOI DOI 10.18653/V1/N16-1030
[13] Gradient-based learning applied to document recognition
Lecun, Y
Bottou, L
Bengio, Y
Haffner, P
[J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
[14] Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence
Liang, Huiying
Tsui, Brian Y.
Ni, Hao
Valentim, Carolina C. S.
Baxter, Sally L.
Liu, Guangjian
Cai, Wenjia
Kermany, Daniel S.
Sun, Xin
Chen, Jiancong
He, Liya
Zhu, Jie
Tian, Pin
Shao, Hua
Zheng, Lianghong
Hou, Rui
Hewett, Sierra
Li, Gen
Liang, Ping
Zang, Xuan
Zhang, Zhiqi
Pan, Liyan
Cai, Huimin
Ling, Rujuan
Li, Shuhua
Cui, Yongwang
Tang, Shusheng
Ye, Hong
Huang, Xiaoyan
He, Waner
Liang, Wenqing
Zhang, Qing
Jiang, Jianmin
Yu, Wei
Gao, Jianqun
Ou, Wanxing
Deng, Yingmin
Hou, Qiaozhen
Wang, Bei
Yao, Cuichan
Liang, Yan
Zhang, Shu
Duan, Yaou
Zhang, Runze
Gibson, Sarah
Zhang, Charlotte L.
Li, Oulan
Zhang, Edward D.
Karin, Gabriel
Nguyen, Nathan
[J]. NATURE MEDICINE, 2019, 25 (03) : 433 - +
[15] Liu G., 2014, Math. Probl. Eng, V2014, P1
[16] Detecting pioglitazone use and risk of cardiovascular events using electronic health record data in a large cohort of Chinese patients with type 2 diabetes
Miao, Shumei
Dong, Xiao
Zhang, Xin
Jing, Shenqi
Zhang, Xiaoliang
Xu, Tingyu
Wang, Li
Du, Xianglin
Xu, Hua
Liu, Yun
[J]. JOURNAL OF DIABETES, 2019, 11 (08) : 684 - 689
[17] Nagpal K., 2019, NATURE, P2, DOI [10.1038/s41746-019-0196-8, DOI 10.1038/S41746-019-0196-8]
[18] Poostchi H., 2018, 2018 P 11 INT C LANG
[19] Scalable and accurate deep learning with electronic health records
Rajkomar, Alvin
Oren, Eyal
Chen, Kai
Dai, Andrew M.
Hajaj, Nissan
Hardt, Michaela
Liu, Peter J.
Liu, Xiaobing
Marcus, Jake
Sun, Mimi
Sundberg, Patrik
Yee, Hector
Zhang, Kun
Zhang, Yi
Flores, Gerardo
Duggan, Gavin E.
Irvine, Jamie
Quoc Le
Litsch, Kurt
Mossin, Alexander
Tansuwan, Justin
Wang, De
Wexler, James
Wilson, Jimbo
Ludwig, Dana
Volchenboum, Samuel L.
Chou, Katherine
Pearson, Michael
Madabushi, Srinivasan
Shah, Nigam H.
Butte, Atul J.
Howell, Michael D.
Cui, Claire
Corrado, Greg S.
Dean, Jeffrey
[J]. NPJ DIGITAL MEDICINE, 2018, 1
[20] Sun G., 2017, INT J PERFORMABILITY, V13, P446, DOI [10.23940/ijpe.17.04.p12.446457, DOI 10.23940/IJPE.17.04.P12.446457]

← 1 2 3 →