Risk Prediction of Diabetes: Big data mining with fusion of multifarious physical examination indicators

被引:134
作者
Yang, Hui [1 ]
Luo, Yamei [2 ]
Ren, Xiaolei [3 ,4 ]
Wu, Ming [3 ]
He, Xiaolin [3 ]
Peng, Bowen [5 ]
Deng, Kejun [1 ]
Yan, Dan [6 ]
Tang, Hua [7 ,8 ]
Lin, Hao [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Ctr Informat Biol, Chengdu 610054, Peoples R China
[2] Southwest Med Univ, Sch Med Informat & Engn, Luzhou 646000, Peoples R China
[3] Heima Digital Technol Ltd, Luzhou 646000, Peoples R China
[4] Chuanjiang Sci & Technol Res Inst Ltd, Luzhou 646000, Peoples R China
[5] Hlth Commiss Sichuan Prov, Div Int Cooperat, Chengdu 610041, Peoples R China
[6] Captial Med Univ, Beijing Friendship Hosp, Beijing 100050, Peoples R China
[7] Southwest Med Univ, Sch Basic Med Sci, Luzhou 646000, Peoples R China
[8] Cent Nervous Syst Drug Key Lab Sichuan Prov, Luzhou 646000, Peoples R China
关键词
TYPE-2; WEIGHT; COMPLICATIONS; SCORE;
D O I
10.1016/j.inffus.2021.02.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diabetes is a global epidemic. Long-term exposure to hyperglycemia can cause chronic damage to various tissues. Thus, early diagnosis of diabetes is crucial. In this study, we designed a computational system to predict diabetes risk by fusing multifarious types of physical examination data. We collected 1,507,563 physical examination data of healthy people and diabetes patients, as well as 387,076 physical examination data from the follow-up records from 2011 to 2017 of diabetes patients in Luzhou City in China. Three types of physical examination indexes were statistically analyzed: demographics, vital signs, and laboratory values. To distinguish diabetes patients from healthy people, a model based on eXtreme Gradient Boosting (XGBoost) was developed, which could produce an area under the receiver operating characteristic curve (AUC) of 0.8768. Moreover, to improve the convenience and flexibility of the model in clinical and real-life scenarios, a diabetes risk scorecard was established based on logistic regression, which could evaluate human health. Lastly, we statistically analyzed the data from the follow-up records to identify the key factors influencing patient control of their conditions. To improve the diabetes cascade screening and personal lifestyle management, an online diabetes risk assessment system was established, which can be freely accessed at http://lin-group.cn/server/DRSC/index.html. This system is expected to provide guidance for human health management.
引用
收藏
页码:140 / 149
页数:10
相关论文
共 43 条
[1]   Diabetes and Complications: Cellular Signaling Pathways, Current Understanding and Targeted Therapies [J].
Adeshara, Krishna A. ;
Diwan, Arundhati G. ;
Tupe, Rashmi S. .
CURRENT DRUG TARGETS, 2016, 17 (11) :1309-1328
[2]  
Alam Uazman, 2014, Handb Clin Neurol, V126, P211, DOI 10.1016/B978-0-444-53480-4.00015-1
[3]   Classification and Diagnosis of Diabetes [J].
不详 .
DIABETES CARE, 2015, 38 :S8-S16
[4]  
[Anonymous], ery and Data Mining, DOI DOI 10.1145/2939672.2939785
[5]   Predicting Risk of Type 2 Diabetes Mellitus with Genetic Risk Models on the Basis of Established Genome-wide Association Markers: A Systematic Review [J].
Bao, Wei ;
Hu, Frank B. ;
Rong, Shuang ;
Rong, Ying ;
Bowers, Katherine ;
Schisterman, Enrique F. ;
Liu, Liegang ;
Zhang, Cuilin .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2013, 178 (08) :1197-1207
[6]   Machine intelligence in peptide therapeutics: A next-generation tool for rapid disease screening [J].
Basith, Shaherin ;
Manavalan, Balachandran ;
Shin, Tae Hwan ;
Lee, Gwang .
MEDICINAL RESEARCH REVIEWS, 2020, 40 (04) :1276-1314
[7]   iGHBP: Computational identification of growth hormone binding proteins from sequences using extremely randomised tree [J].
Basith, Shaherin ;
Manavalan, Balachandran ;
Shin, Tae Hwan ;
Lee, Gwang .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2018, 16 :412-420
[8]  
Bonacaro Antonio, 2019, Prof Inferm, V72, P143
[9]  
Cristelo C, 2020, DIABETES RES CLIN PR, V164, DOI 10.1016/j.diabres.2020.108228
[10]   Consumer credit, chronic disease and risk behaviours [J].
Dean, Lorraine T. ;
Knapp, Emily A. ;
Snguon, Sevly ;
Ransome, Yusuf ;
Qato, Dima M. ;
Visvanathan, Kala .
JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2019, 73 (01) :73-78