共 50 条
Machine learning prediction of pathological complete response and overall survival of breast cancer patients in an underserved inner-city population
被引:0
|作者:
Dell'Aquila, Kevin
[1
]
Vadlamani, Abhinav
[1
]
Maldjian, Takouhie
[1
]
Fineberg, Susan
[2
]
Eligulashvili, Anna
[1
]
Chung, Julie
[3
]
Adam, Richard
[1
]
Hodges, Laura
[1
]
Hou, Wei
[1
]
Makower, Della
[3
]
Duong, Tim Q.
[1
,4
]
机构:
[1] Montefiore Hlth Syst & Albert Einstein Coll Med, Dept Radiol, 111 210th St, Bronx, NY 10467 USA
[2] Montefiore Hlth Syst & Albert Einstein Coll Med, Dept Pathol, Bronx, NY USA
[3] Montefiore Hlth Syst & Albert Einstein Coll Med, Dept Oncol, Bronx, NY USA
[4] Montefiore Hlth Syst & Albert Einstein Coll Med, Ctr Hlth Data Innovat, Bronx, NY USA
关键词:
Molecular subtypes;
Tumor subtypes;
Socioeconomic status;
Progression-free survival;
Deep learning;
Health disparity;
BACKGROUND PARENCHYMAL ENHANCEMENT;
CONVOLUTIONAL NEURAL-NETWORK;
LYMPH-NODE METASTASIS;
NEOADJUVANT CHEMOTHERAPY;
SOCIOECONOMIC-STATUS;
RACIAL DISPARITY;
POOLED ANALYSIS;
AMERICAN WOMEN;
PRIMARY TUMOR;
OUTCOMES;
D O I:
10.1186/s13058-023-01762-w
中图分类号:
R73 [肿瘤学];
学科分类号:
100214 ;
摘要:
Background Generalizability of predictive models for pathological complete response (pCR) and overall survival (OS) in breast cancer patients requires diverse datasets. This study employed four machine learning models to predict pCR and OS up to 7.5 years using data from a diverse and underserved inner-city population. Methods Demographics, staging, tumor subtypes, income, insurance status, and data from radiology reports were obtained from 475 breast cancer patients on neoadjuvant chemotherapy in an inner-city health system (01/01/2012 to 12/31/2021). Logistic regression, Neural Network, Random Forest, and Gradient Boosted Regression models were used to predict outcomes (pCR and OS) with fivefold cross validation. Results pCR was not associated with age, race, ethnicity, tumor staging, Nottingham grade, income, and insurance status (p > 0.05). ER-/HER2+ showed the highest pCR rate, followed by triple negative, ER+/HER2+, and ER+/HER2- (all p < 0.05), tumor size (p < 0.003) and background parenchymal enhancement (BPE) (p < 0.01). Machine learning models ranked ER+/HER2-, ER-/HER2+, tumor size, and BPE as top predictors of pCR (AUC = 0.74-0.76). OS was associated with race, pCR status, tumor subtype, and insurance status (p < 0.05), but not ethnicity and incomes (p > 0.05). Machine learning models ranked tumor stage, pCR, nodal stage, and triple-negative subtype as top predictors of OS (AUC = 0.83-0.85). When grouping race and ethnicity by tumor subtypes, neither OS nor pCR were different due to race and ethnicity for each tumor subtype (p > 0.05). Conclusion Tumor subtypes and imaging characteristics were top predictors of pCR in our inner-city population. Insurance status, race, tumor subtypes and pCR were associated with OS. Machine learning models accurately predicted pCR and OS.
引用
收藏
页数:14
相关论文