Multi-modality risk prediction of cardiovascular diseases for breast cancer cohort in the All of Us Research Program

被引:1
|
作者
Yang, Han [1 ]
Zhou, Sicheng [1 ]
Rao, Zexi [2 ]
Zhao, Chen [2 ]
Cui, Erjia [2 ]
Shenoy, Chetan [3 ]
Blaes, Anne H. [4 ]
Paidimukkala, Nishitha [1 ]
Wang, Jinhua [5 ]
Hou, Jue [2 ]
Zhang, Rui [6 ]
机构
[1] Univ Minnesota, Inst Hlth Informat, Minneapolis, MN 55455 USA
[2] Univ Minnesota, Sch Publ Hlth, Div Biostat & Hlth Data Sci, 2221 Univ Ave SE,Suite 200, Minneapolis, MN 55414 USA
[3] Univ Minnesota, Med Ctr, Dept Med, Cardiovasc Div, Minneapolis, MN 55455 USA
[4] Univ Minnesota, Div Hematol Oncol & Transplantat, Minneapolis, MN 55455 USA
[5] Univ Minnesota, Masonic Canc Ctr, Minneapolis, MN 55455 USA
[6] Univ Minnesota, Dept Surg, Div Comp Hlth Sci, 308 Harvard St SE, Minneapolis, MN 55455 USA
基金
美国国家卫生研究院;
关键词
cardiovascular disease; breast cancer; predictive model; All of Us; SOCIAL DETERMINANTS; SURVIVAL; MODELS; TIME; ASSOCIATIONS; STATEMENT; SELECTION; IMPACT; INDEX; LASSO;
D O I
10.1093/jamia/ocae199
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective This study leverages the rich diversity of the All of Us Research Program (All of Us)'s dataset to devise a predictive model for cardiovascular disease (CVD) in breast cancer (BC) survivors. Central to this endeavor is the creation of a robust data integration pipeline that synthesizes electronic health records (EHRs), patient surveys, and genomic data, while upholding fairness across demographic variables.Materials and Methods We have developed a universal data wrangling pipeline to process and merge heterogeneous data sources of the All of Us dataset, address missingness and variance in data, and align disparate data modalities into a coherent framework for analysis. Utilizing a composite feature set including EHR, lifestyle, and social determinants of health (SDoH) data, we then employed Adaptive Lasso and Random Forest regression models to predict 6 CVD outcomes. The models were evaluated using the c-index and time-dependent Area Under the Receiver Operating Characteristic Curve over a 10-year period.Results The Adaptive Lasso model showed consistent performance across most CVD outcomes, while the Random Forest model excelled particularly in predicting outcomes like transient ischemic attack when incorporating the full multi-model feature set. Feature importance analysis revealed age and previous coronary events as dominant predictors across CVD outcomes, with SDoH clustering labels highlighting the nuanced impact of social factors.Discussion The development of both Cox-based predictive model and Random Forest Regression model represents the extensive application of the All of Us, in integrating EHR and patient surveys to enhance precision medicine. And the inclusion of SDoH clustering labels revealed the significant impact of sociobehavioral factors on patient outcomes, emphasizing the importance of comprehensive health determinants in predictive models. Despite these advancements, limitations include the exclusion of genetic data, broad categorization of CVD conditions, and the need for fairness analyses to ensure equitable model performance across diverse populations. Future work should refine clinical and social variable measurements, incorporate advanced imputation techniques, and explore additional predictive algorithms to enhance model precision and fairness.Conclusion This study demonstrates the liability of the All of Us's diverse dataset in developing a multi-modality predictive model for CVD in BC survivors risk stratification in oncological survivorship. The data integration pipeline and subsequent predictive models establish a methodological foundation for future research into personalized healthcare.
引用
收藏
页码:2800 / 2810
页数:11
相关论文
共 50 条
  • [41] Impact of cumulative body mass index and cardiometabolic diseases on survival among patients with colorectal and breast cancer: a multi-centre cohort study
    Kohls, Mirjam
    Freisling, Heinz
    Charvat, Hadrien
    Soerjomataram, Isabelle
    Viallon, Vivian
    Davila-Batista, Veronica
    Kaaks, Rudolf
    Turzanski-Fortner, Renee
    Aleksandrova, Krasimira
    Schulze, Matthias B.
    Dahm, Christina C.
    Tilma Vistisen, Helene
    Rostgaard-Hansen, Agnetha Linn
    Tjonneland, Anne
    Bonet, Catalina
    Sanchez, Maria-Jose
    Colorado-Yohar, Sandra
    Masala, Giovanna
    Palli, Domenico
    Krogh, Vittorio
    Ricceri, Fulvio
    Rolandsson, Olov
    Lu, Sai San Moon
    Tsilidis, Konstantinos K.
    Weiderpass, Elisabete
    Gunter, Marc J.
    Ferrari, Pietro
    Berger, Ursula
    Arnold, Melina
    BMC CANCER, 2022, 22 (01)
  • [42] Aromatase inhibitors use and risk for cardiovascular disease in breast cancer patients: A population-based cohort study
    Sund, Maria
    Garcia-Argibay, Miguel
    Garmo, Hans
    Ahlgren, Johan
    Wennstig, Anna-Karin
    Fredriksson, Irma
    Lindman, Henrik
    Valachis, Antonis
    BREAST, 2021, 59 : 157 - 164
  • [43] LOW CARBOHYDRATE AND HIGH PROTEIN DIETS AND ALL-CAUSE, CANCER AND CARDIOVASCULAR DISEASES MORTALITIES: A SYSTEMATIC REVIEW AND META-ANALYSIS FROM 7 COHORT STUDIES
    Zhou, J.
    Xu, H.
    ACTA ENDOCRINOLOGICA-BUCHAREST, 2014, 10 (02) : 259 - 266
  • [44] Pooled Cohort Equations and the competing risk of cardiovascular disease versus cancer: Multi-Ethnic study of atherosclerosis
    Whelton, Seamus P.
    Marshall, Catherine Handy
    Cainzos-Achirica, Miguel
    Dzaye, Omar
    Blumenthal, Roger S.
    Nasir, Khurram
    McClelland, Robyn L.
    Blaha, Michael J.
    AMERICAN JOURNAL OF PREVENTIVE CARDIOLOGY, 2021, 7
  • [45] The Risk of Cancer-Associated and Radiotherapy-Associated Cardiovascular Diseases among Patients with Breast Cancer
    Hsieh, Cheng-Tzu
    Lee, Wen-Chung
    Chiang, Chun-Ju
    Wang, Chia-Chun
    Hsu, Hsin-Yin
    Lin, Hung-Ju
    Yeh, Tzu-Lin
    Tsai, Ming-Chieh
    Jhuang, Jing-Rong
    Hsiao, Bo-Yu
    Chien, Kuo-Liong
    CLINICAL BREAST CANCER, 2024, 24 (02) : 131 - +
  • [46] The Role of Cancer in the Risk of Cardiovascular and All-Cause Mortality: A Nationwide Prospective Cohort Study
    Shen, Ruihuan
    Wang, Jia
    Wang, Rui
    Tian, Yuqing
    Guo, Peiyao
    Shen, Shuhui
    Liu, Donghao
    Zou, Tong
    INTERNATIONAL JOURNAL OF PUBLIC HEALTH, 2023, 68
  • [47] Potassium levels and the risk of all-cause and cardiovascular mortality among patients with cardiovascular diseases: a meta-analysis of cohort studies
    Yahui Fan
    Min Wu
    Xiaohui Li
    Jinping Zhao
    Jia Shi
    Lu Ding
    Hong Jiang
    Zhaofang Li
    Wei Zhang
    Tianyou Ma
    Duolao Wang
    Le Ma
    Nutrition Journal, 23
  • [48] Potassium levels and the risk of all-cause and cardiovascular mortality among patients with cardiovascular diseases: a meta-analysis of cohort studies
    Fan, Yahui
    Wu, Min
    Li, Xiaohui
    Zhao, Jinping
    Shi, Jia
    Ding, Lu
    Jiang, Hong
    Li, Zhaofang
    Zhang, Wei
    Ma, Tianyou
    Wang, Duolao
    Ma, Le
    NUTRITION JOURNAL, 2024, 23 (01)
  • [49] The Breast Cancer to Bone (B2B) Metastases Research Program: a multi-disciplinary investigation of bone metastases from breast cancer
    Brockton, Nigel T.
    Gill, Stephanie J.
    Laborge, Stephanie L.
    Paterson, Alexander H. G.
    Cook, Linda S.
    Vogel, Hans J.
    Shemanko, Carrie S.
    Hanley, David A.
    Magliocco, Anthony M.
    Friedenreich, Christine M.
    BMC CANCER, 2015, 15
  • [50] The Breast Cancer to Bone (B2B) Metastases Research Program: a multi-disciplinary investigation of bone metastases from breast cancer
    Nigel T. Brockton
    Stephanie J. Gill
    Stephanie L. Laborge
    Alexander H. G. Paterson
    Linda S. Cook
    Hans J. Vogel
    Carrie S. Shemanko
    David A. Hanley
    Anthony M. Magliocco
    Christine M. Friedenreich
    BMC Cancer, 15