Interpretable machine learning analysis to identify risk factors for diabetes using the anonymous living census data of Japan

被引:3
作者
Jiang, Pei [1 ,2 ]
Suzuki, Hiroyuki [3 ]
Obi, Takashi [4 ]
机构
[1] Tokyo Inst Technol, Dept Engineer, Course Informat & Commun, Yokohama, Kanagawa, Japan
[2] 4259 Nagatsutachou,Midori Ward, Yokohama, Kanagawa 2260026, Japan
[3] Gunma Univ, Ctr Math & Data Sci, Maebashi, Gunma, Japan
[4] Tokyo Inst Technol, Inst Innovat Res, Yokohama, Kanagawa, Japan
关键词
Interpretable machine learning; Non-objective-oriented census data; Diabetes; Risk factors; ALCOHOL-CONSUMPTION; CIGARETTE-SMOKING; MELLITUS; MEN; METAANALYSIS; ASSOCIATION; PREDICTION; MORTALITY; XGBOOST; STRESS;
D O I
10.1007/s12553-023-00730-w
中图分类号
R-058 [];
学科分类号
摘要
PurposeDiabetes mellitus causes various problems in our life. With the big data boom in our society, some risk factors for Diabetes must still exist. To identify new risk factors for diabetes in the big data society and explore further efficient use of big data, the non-objective-oriented census data about the Japanese Citizen's Survey of Living Conditions were analyzed using interpretable machine learning methods.MethodsSeven interpretable machine learning methods were used to analysis Japan citizens' census data. Firstly, logistic analysis was used to analyze the risk factors of diabetes from 19 selected initial elements. Then, the linear analysis, linear discriminate analysis, Hayashi's quantification analysis method 2, random forest, XGBoost, and SHAP methods were used to re-check and find the different factor contributions. Finally, the relationship among the factors was analyzed to understand the relationship among factors.ResultsFour new risk factors: the number of family members, insurance type, public pension type, and health awareness level, were found as risk factors for diabetes mellitus for the first time, while another 11 risk factors were reconfirmed in this analysis. Especially the insurance type factor and health awareness level factor make more contributions to diabetes than factors: hypertension, hyperlipidemia, and stress in some interpretable models. We also found that work years were identified as a risk factor for diabetes because it has a high coefficient with the risk factor of age.ConclusionsNew risk factors for diabetes mellitus were identified based on Japan's non-objective-oriented anonymous census data using interpretable machine learning models. The newly identified risk factors inspire new possible policies for preventing diabetes. Moreover, our analysis certifies that big data can help us find helpful knowledge in today's prosperous society. Our study also paves the way for identifying more risk factors and promoting the efficiency of using big data.
引用
收藏
页码:119 / 131
页数:13
相关论文
共 65 条
  • [1] The magnitude of association between overweight and obesity and the risk of diabetes: A meta-analysis of prospective cohort studies
    Abdullah, Asnawi
    Peeters, Anna
    de Courten, Maximilian
    Stoelwinder, Johannes
    [J]. DIABETES RESEARCH AND CLINICAL PRACTICE, 2010, 89 (03) : 309 - 319
  • [2] Smoking and the risk of type 2 diabetes in Japan: A systematic review and meta-analysis
    Akter, Shamima
    Goto, Atsushi
    Mizoue, Tetsuya
    [J]. JOURNAL OF EPIDEMIOLOGY, 2017, 27 (12) : 553 - 561
  • [3] Smoking, Smoking Cessation, and the Risk of Type 2 Diabetes among Japanese Adults: Japan Epidemiology Collaboration on Occupational Health Study
    Akter, Shamima
    Okazaki, Hiroko
    Kuwahara, Keisuke
    Miyamoto, Toshiaki
    Murakami, Taizo
    Shimizu, Chii
    Shimizu, Makiko
    Tomita, Kentaro
    Nagahama, Satsue
    Eguchi, Masafumi
    Kochi, Takeshi
    Imai, Teppei
    Nishihara, Akiko
    Sasaki, Naoko
    Nakagawa, Tohru
    Yamamoto, Shuichiro
    Honda, Toru
    Uehara, Akihiko
    Yamamoto, Makoto
    Hori, Ai
    Sakamoto, Nobuaki
    Nishiura, Chiro
    Totsuzaki, Takafumi
    Kato, Noritada
    Fukasawa, Kenji
    Ngoc M. Pham
    Kurotani, Kayo
    Nanri, Akiko
    Kabe, Isamu
    Mizoue, Tetsuya
    Sone, Tomofumi
    Dohi, Seitaro
    [J]. PLOS ONE, 2015, 10 (07):
  • [4] Al-Delaimy WK, SMOKING MORTALITY WO
  • [5] American Diabetes Association, RES ED ADV
  • [6] Increasing prevalence of diabetes mellitus in a developing country and its related factors
    Animaw, Worku
    Seyoum, Yeshaneh
    [J]. PLOS ONE, 2017, 12 (11):
  • [7] [Anonymous], ENGLISH JAPAN DIABET
  • [8] [Anonymous], JAPAN PREVENTIVE ASS
  • [9] [Anonymous], Certification Board for Music Therapists
  • [10] [Anonymous], DIABETES NET