End-to-End syndrome differentiation of Yin deficiency and Yang deficiency in traditional Chinese medicine

被引:47
作者
Hu, Qinan [1 ,2 ]
Yu, Tong [3 ]
Li, Jinghua [3 ]
Yu, Qi [3 ]
Zhu, Ling [3 ]
Gu, Yueguo [1 ,2 ]
机构
[1] Chinese Acad Social Sci, Inst Linguist, Beijing 100732, Peoples R China
[2] China Multilingual & Multimodal Corpora & Big Dat, Beijing 100089, Peoples R China
[3] China Acad Chinese Med Sci, Inst Informat Tradit Chinese Med, Beijing 100700, Peoples R China
关键词
Traditional Chinese medicine; Syndrome differentiation; Yin deficiency; Yang deficiency; Text classification; End-to-end; Convolutional neural networks; FastText;
D O I
10.1016/j.cmpb.2018.10.011
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective. Yin and Yang, two concepts adapted from classical Chinese philosophy, play a diagnostic role in Traditional Chinese Medicine (TCM). The Yin and Yang in harmonious balance indicate health, whereas imbalances to either side indicate unhealthiness, which may result in diseases. Yin-yang disharmony is considered to be the cause of pathological changes. Syndrome differentiation of yin-yang is crucial to clinical diagnosis. It lays a foundation for subsequent medical judgments, including therapeutic methods, and formula, among many others. However, because of the complexities of the mechanisms and manifestations of disease, it is difficult to exactly point out which one, yin or yang, is disharmonious. There has been inadequate research conducted on syndrome differentiation of yin and yang from a computational perspective. In this study, we present a computational method, viz. an end-to-end syndrome differentiation of yin deficiency and yang deficiency. Methods. Unlike most previous studies on syndrome differentiation, which use structured datasets, this study takes unstructured texts in medical records as its inputs. It models syndrome differentiation as a task of text classification. This study experiments on two state-of-the-art end-to-end algorithms for text classification, i.e. a classic convolutional neural network (CNN) and fastText. These two systems take the n-grams of several types of tokens as their inputs, including characters, terms, and words. Results. When evaluated on a data set with 7326 modern medical records in TCM, it is observed that CNN and fastText generally give rise to comparable performances. The best accuracy rate of 92.55% comes from the system taking inputs as raw as n-grams of characters. It implies that one can build at least a moderate system for the differentiation of yin deficiency and yang deficiency even if he has no glossary or tokenizer at hand. Conclusions. This study has demonstrated the feasibility of using end-to-end text classification algorithms to differentiate yin deficiency and yang deficiency on unstructured medical records. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:9 / 15
页数:7
相关论文
共 27 条
  • [1] [Anonymous], ACTA CHI MED PHARM
  • [2] [Anonymous], EVIDENCE BASED COMPL
  • [3] [Anonymous], 2014, DATA ANAL TRADITIONA
  • [4] [Anonymous], INT J ROB RES
  • [5] [Anonymous], 2015, EVIDENCE BASED COMPL
  • [6] [Anonymous], EVIDENCE BASED COMPL
  • [7] The Unified Medical Language System (UMLS): integrating biomedical terminology
    Bodenreider, O
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D267 - D270
  • [8] Che W, 2010, P COL 2010 DEM BEIJ, P13, DOI [10.5555/1944284.1944288, DOI 10.5555/1944284.1944288]
  • [9] Hou J, 2017, 2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), P725, DOI 10.1109/ICBDA.2017.8078731
  • [10] Syndrome differentiation in modern research of traditional Chinese medicine
    Jiang, Miao
    Lu, Cheng
    Zhang, Chi
    Yang, Jing
    Tan, Yong
    Lu, Aiping
    Chan, Kelvin
    [J]. JOURNAL OF ETHNOPHARMACOLOGY, 2012, 140 (03) : 634 - 642