Artificial Intelligence-Based Traditional Chinese Medicine Assistive Diagnostic System: Validation Study

被引:58
作者
Zhang, Hong [1 ]
Ni, Wandong [2 ]
Li, Jing [1 ]
Zhang, Jiajun [3 ]
机构
[1] China Acad Chinese Med Sci, Guanganmen Hosp, Comp Ctr, 5 Beixiange, Beijing 100053, Peoples R China
[2] State Adm Tradit Chinese Med, Certificat Ctr Tradit Chinese Med, Phys Qualificat, Beijing, Peoples R China
[3] NCT Lab Corp, Dept Software Engn, Billerica, MA USA
关键词
traditional Chinese medicine; TCM; disease diagnosis; syndrome prediction; syndrome differentiation; natural language processing; NLP; artificial intelligence; AI; assistive diagnostic system; convolutional neural network; CNN; machine learning; ML; BiLSTM-CRF;
D O I
10.2196/17608
中图分类号
R-058 [];
学科分类号
摘要
Background: Artificial intelligence-based assistive diagnostic systems imitate the deductive reasoning process of a human physician in biomedical disease diagnosis and treatment decision making. While impressive progress in this area has been reported, most of the reported successes are applications of artificial intelligence in Western medicine. The application of artificial intelligence in traditional Chinese medicine has lagged mainly because traditional Chinese medicine practitioners need to perform syndrome differentiation as well as biomedical disease diagnosis before a treatment decision can be made. Syndrome, a concept unique to traditional Chinese medicine, is an abstraction of a variety of signs and symptoms. The fact that the relationship between diseases and syndromes is not one-to-one but rather many-to-many makes it very challenging for a machine to perform syndrome predictions. So far, only a handful of artificial intelligence-based assistive traditional Chinese medicine diagnostic models have been reported, and they are limited in application to a single disease-type. Objective: The objective was to develop an artificial intelligence-based assistive diagnostic system capable of diagnosing multiple types of diseases that are common in traditional Chinese medicine, given a patient's electronic health record notes. The system was designed to simultaneously diagnose the disease and produce a list of corresponding syndromes. Methods: Unstructured freestyle electronic health record notes were processed by natural language processing techniques to extract clinical information such as signs and symptoms which were represented by named entities. Natural language processing used a recurrent neural network model called bidirectional long short-term memory network-conditional random forest. A convolutional neural network was then used to predict the disease-type out of 187 diseases in traditional Chinese medicine. A novel traditional Chinese medicine syndrome prediction method-an integrated learning model-was used to produce a corresponding list of probable syndromes. By following a majority-rule voting method, the integrated learning model for syndrome prediction can take advantage of four existing prediction methods (back propagation, random forest, extreme gradient boosting, and support vector classifier) while avoiding their respective weaknesses which resulted in a consistently high prediction accuracy. Results: A data set consisting of 22,984 electronic health records from Guanganmen Hospital of the China Academy of Chinese Medical Sciences that were collected between January 1, 2017 and September 7, 2018 was used. The data set contained a total of 187 diseases that are commonly diagnosed in traditional Chinese medicine. The diagnostic system was designed to be able to detect any one of the 187 disease-types. The data set was partitioned into a training set, a validation set, and a testing set in a ratio of 8:1:1. Test results suggested that the proposed system had a good diagnostic accuracy and a strong capability for generalization. The disease-type prediction accuracies of the top one, top three, and top five were 80.5%, 91.6%, and 94.2%, respectively. Conclusions: The main contributions of the artificial intelligence-based traditional Chinese medicine assistive diagnostic system proposed in this paper are that 187 commonly known traditional Chinese medicine diseases can be diagnosed and a novel prediction method called an integrated learning model is demonstrated. This new prediction method outperformed all four existing methods in our preliminary experimental results. With further improvement of the algorithms and the availability of additional electronic health record data, it is expected that a wider range of traditional Chinese medicine disease-types could be diagnosed and that better diagnostic accuracies could be achieved.
引用
收藏
页数:12
相关论文
共 23 条
  • [11] Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning
    Kermany, Daniel S.
    Goldbaum, Michael
    Cai, Wenjia
    Valentim, Carolina C. S.
    Liang, Huiying
    Baxter, Sally L.
    McKeown, Alex
    Yang, Ge
    Wu, Xiaokang
    Yan, Fangbing
    Dong, Justin
    Prasadha, Made K.
    Pei, Jacqueline
    Ting, Magdalena
    Zhu, Jie
    Li, Christina
    Hewett, Sierra
    Dong, Jason
    Ziyar, Ian
    Shi, Alexander
    Zhang, Runze
    Zheng, Lianghong
    Hou, Rui
    Shi, William
    Fu, Xin
    Duan, Yaou
    Huu, Viet A. N.
    Wen, Cindy
    Zhang, Edward D.
    Zhang, Charlotte L.
    Li, Oulan
    Wang, Xiaobo
    Singer, Michael A.
    Sun, Xiaodong
    Xu, Jie
    Tafreshi, Ali
    Lewis, M. Anthony
    Xia, Huimin
    Zhang, Kang
    [J]. CELL, 2018, 172 (05) : 1122 - +
  • [12] Lample G, 2016, 2016 C N AM CHAPTER, DOI DOI 10.18653/V1/N16-1030
  • [13] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
  • [14] Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence
    Liang, Huiying
    Tsui, Brian Y.
    Ni, Hao
    Valentim, Carolina C. S.
    Baxter, Sally L.
    Liu, Guangjian
    Cai, Wenjia
    Kermany, Daniel S.
    Sun, Xin
    Chen, Jiancong
    He, Liya
    Zhu, Jie
    Tian, Pin
    Shao, Hua
    Zheng, Lianghong
    Hou, Rui
    Hewett, Sierra
    Li, Gen
    Liang, Ping
    Zang, Xuan
    Zhang, Zhiqi
    Pan, Liyan
    Cai, Huimin
    Ling, Rujuan
    Li, Shuhua
    Cui, Yongwang
    Tang, Shusheng
    Ye, Hong
    Huang, Xiaoyan
    He, Waner
    Liang, Wenqing
    Zhang, Qing
    Jiang, Jianmin
    Yu, Wei
    Gao, Jianqun
    Ou, Wanxing
    Deng, Yingmin
    Hou, Qiaozhen
    Wang, Bei
    Yao, Cuichan
    Liang, Yan
    Zhang, Shu
    Duan, Yaou
    Zhang, Runze
    Gibson, Sarah
    Zhang, Charlotte L.
    Li, Oulan
    Zhang, Edward D.
    Karin, Gabriel
    Nguyen, Nathan
    [J]. NATURE MEDICINE, 2019, 25 (03) : 433 - +
  • [15] Liu G., 2014, Math. Probl. Eng, V2014, P1
  • [16] Detecting pioglitazone use and risk of cardiovascular events using electronic health record data in a large cohort of Chinese patients with type 2 diabetes
    Miao, Shumei
    Dong, Xiao
    Zhang, Xin
    Jing, Shenqi
    Zhang, Xiaoliang
    Xu, Tingyu
    Wang, Li
    Du, Xianglin
    Xu, Hua
    Liu, Yun
    [J]. JOURNAL OF DIABETES, 2019, 11 (08) : 684 - 689
  • [17] Nagpal K., 2019, NATURE, P2, DOI [10.1038/s41746-019-0196-8, DOI 10.1038/S41746-019-0196-8]
  • [18] Poostchi H., 2018, 2018 P 11 INT C LANG
  • [19] Scalable and accurate deep learning with electronic health records
    Rajkomar, Alvin
    Oren, Eyal
    Chen, Kai
    Dai, Andrew M.
    Hajaj, Nissan
    Hardt, Michaela
    Liu, Peter J.
    Liu, Xiaobing
    Marcus, Jake
    Sun, Mimi
    Sundberg, Patrik
    Yee, Hector
    Zhang, Kun
    Zhang, Yi
    Flores, Gerardo
    Duggan, Gavin E.
    Irvine, Jamie
    Quoc Le
    Litsch, Kurt
    Mossin, Alexander
    Tansuwan, Justin
    Wang, De
    Wexler, James
    Wilson, Jimbo
    Ludwig, Dana
    Volchenboum, Samuel L.
    Chou, Katherine
    Pearson, Michael
    Madabushi, Srinivasan
    Shah, Nigam H.
    Butte, Atul J.
    Howell, Michael D.
    Cui, Claire
    Corrado, Greg S.
    Dean, Jeffrey
    [J]. NPJ DIGITAL MEDICINE, 2018, 1
  • [20] Sun G., 2017, INT J PERFORMABILITY, V13, P446, DOI [10.23940/ijpe.17.04.p12.446457, DOI 10.23940/IJPE.17.04.P12.446457]