Analysis of disease comorbidity patterns in a large-scale China population

被引:37
作者
Guo, Mengfei [1 ,2 ]
Yu, Yanan [1 ,2 ]
Wen, Tiancai [3 ,4 ]
Zhang, Xiaoping [5 ]
Liu, Baoyan [5 ]
Zhang, Jin [6 ]
Zhang, Runshun [7 ]
Zhang, Yanning [4 ]
Zhou, Xuezhong [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Beijing Key Lab Traff Data Anal & Min, Beijing 100044, Peoples R China
[3] China Acad Chinese Med Sci, Inst Basic Res Clin Med, Beijing 100700, Peoples R China
[4] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Shanxi, Peoples R China
[5] China Acad Chinese Med Sci, Beijing 100070, Peoples R China
[6] China Acad Chinese Med Sci, Data Ctr Tradit Chinese Med, Beijing 100700, Peoples R China
[7] China Acad Chinese Med Sci, Guanganmen Hosp, Beijing 100053, Peoples R China
基金
中国国家自然科学基金;
关键词
Disease comorbidity; Complex network; Network medicine; RISK-FACTOR; CARDIOVASCULAR-DISEASE; KIDNEY-DISEASE; HYPERTENSION; PREVALENCE; OBESITY; ADULTS; HEART;
D O I
10.1186/s12920-019-0629-x
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Disease comorbidity is popular and has significant indications for disease progress and management. We aim to detect the general disease comorbidity patterns in Chinese populations using a large-scale clinical data set. Methods: We extracted the diseases from a large-scale anonymized data set derived from 8,572,137 inpatients in 453 hospitals across China. We built a Disease Comorbidity Network (DCN) using correlation analysis and detected the topological patterns of disease comorbidity using both complex network and data mining methods. The comorbidity patterns were further validated by shared molecular mechanisms using disease-gene associations and pathways. To predict the disease occurrence during the whole disease progressions, we applied four machine learning methods to model the disease trajectories of patients. Results: We obtained the DCN with 5702 nodes and 258,535 edges, which shows a power law distribution of the degree and weight. It further indicated that there exists high heterogeneity of comorbidities for different diseases and we found that the DCN is a hierarchical modular network with community structures, which have both homogeneous and heterogeneous disease categories. Furthermore, adhering to the previous work from US and Europe populations, we found that the disease comorbidities have their shared underlying molecular mechanisms. Furthermore, take hypertension and psychiatric disease as instance, we used four classification methods to predicte the disease occurrence using the comorbid disease trajectories and obtained acceptable performance, in which in particular, random forest obtained an overall best performance (with F1-score 0.6689 for hypertension and 0.6802 for psychiatric disease). Conclusions: Our study indicates that disease comorbidity is significant and valuable to understand the disease incidences and their interactions in real-world populations, which will provide important insights for detection of the patterns of disease classification, diagnosis and prognosis.
引用
收藏
页数:10
相关论文
共 44 条
[1]  
Bragina EY, 2016, BIOMED GENET GENOM, V1, pZ2
[2]  
Cao X, 2016, J CARDIOVASC SURG, V5, P24
[3]  
Capobianco E, 2013, TRENDS MOL MED, V19, P515, DOI 10.1016/j.molmed.2013.07.004
[4]   State of Diabetes, Hypertension, and Comorbidity in Kuwait: Showcasing the Trends as Seen in Native Versus Expatriate Populations [J].
Channanath, Arshad Mohamed ;
Farran, Bassam ;
Behbehani, Kazem ;
Thanaraj, Thangavel Alphonse .
DIABETES CARE, 2013, 36 (06) :E75-E75
[5]  
Chaturvedi P., 2012, Int J Comput Appl, V48, P32
[6]   Endothelial Transdifferentiation of Tumor Cells Triggered by the Twist1-Jagged1-KLF4 Axis: Relationship between Cancer Stemness and Angiogenesis [J].
Chen, Hsiao-Fan ;
Wu, Kou-Juey .
STEM CELLS INTERNATIONAL, 2016, 2016
[7]   A note on the almost sure central limit theorem for the product of some partial sums [J].
Chen, Yang ;
Tan, Zhongquan ;
Wang, Kaiyong .
JOURNAL OF INEQUALITIES AND APPLICATIONS, 2014, :1-11
[8]  
Conti CR, 2001, CLIN CARDIOL, V24, P1
[9]   The difficult clinical management of the combination of hypertension with aortic stenosis [J].
de Simone, Giovanni .
JOURNAL OF HYPERTENSION, 2010, 28 (02) :234-236
[10]   Predicting Hub Genes Associated with Cervical Cancer through Gene Co-Expression Networks [J].
Deng, Su-Ping ;
Zhu, Lin ;
Huang, De-Shuang .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (01) :27-35