Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework

被引:9
作者
Jiang, Cong [1 ]
Xiu, Yuting [1 ]
Qiao, Kun [1 ]
Yu, Xiao [1 ]
Zhang, Shiyuan [1 ]
Huang, Yuanxi [1 ]
机构
[1] Harbin Med Univ, Dept Breast Surg, Canc Hosp, Harbin, Peoples R China
来源
FRONTIERS IN ONCOLOGY | 2022年 / 12卷
关键词
machine learning; SHAP; IMPC; nomogram; lymph node metastasis; FOLLOW-UP; CANCER; ESTROGEN; RECEPTOR; TUMORS;
D O I
10.3389/fonc.2022.981059
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background and purpose: Machine learning (ML) is applied for outcome prediction and treatment support. This study aims to develop different ML models to predict risk of axillary lymph node metastasis (LNM) in breast invasive micropapillary carcinoma (IMPC) and to explore the risk factors of LNM. MethodsFrom the Surveillance, Epidemiology, and End Results (SEER) database and the records of our hospital, a total of 1547 patients diagnosed with breast IMPC were incorporated in this study. The ML model is built and the external validation is carried out. SHapley Additive exPlanations (SHAP) framework was applied to explain the optimal model; multivariable analysis was performed with logistic regression (LR); and nomograms were constructed according to the results of LR analysis. ResultsAge and tumor size were correlated with LNM in both cohorts. The luminal subtype is the most common in patients, with the tumor size <=20mm. Compared to other models, Xgboost was the best ML model with the biggest AUC of 0.813 (95% CI: 0.7994 - 0.8262) and the smallest Brier score of 0.186 (95% CI: 0.799-0.826). SHAP plots demonstrated that tumor size was the most vital risk factor for LNM. In both training and test sets, Xgboost had better AUC (0.761 vs 0.745; 0.813 vs 0.775; respectively), and it also achieved a smaller Brier score (0.202 vs 0.204; 0.186 vs 0.191; 0.220 vs 0.221; respectively) than the nomogram model based on LR in those three different sets. After adjusting for five most influential variables (tumor size, age, ER, HER-2, and PR), prediction score based on the Xgboost model was still correlated with LNM (adjusted OR:2.73, 95% CI: 1.30-5.71, P=0.008). ConclusionsThe Xgboost model outperforms the traditional LR-based nomogram model in predicting the LNM of IMPC patients. Combined with SHAP, it can more intuitively reflect the influence of different variables on the LNM. The tumor size was the most important risk factor of LNM for breast IMPC patients. The prediction score obtained by the Xgboost model could be a good indicator for LNM.
引用
收藏
页数:12
相关论文
共 44 条
  • [1] Invasive Micropapillary Carcinoma of the Breast: Mammographic, Sonographic, and MRI Features
    Adrada, Beatriz
    Arribas, Elsa
    Gilcrease, Michael
    Yang, Wei Tse
    [J]. AMERICAN JOURNAL OF ROENTGENOLOGY, 2009, 193 (01) : W58 - W63
  • [2] Comparison of nomogram with machine learning techniques for prediction of overall survival in patients with tongue cancer
    Alabi, Rasheed Omobolaji
    Makitie, Antti A.
    Pirinen, Matti
    Elmusrati, Mohammed
    Leivo, Ilmo
    Almangush, Alhadi
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 145
  • [3] AlJame Maryam, 2020, Inform Med Unlocked, V21, P100449, DOI 10.1016/j.imu.2020.100449
  • [4] Machine learning prediction of axillary lymph node metastasis in breast cancer: 2D versus 3D radiomic features
    Arefan, Dooman
    Chai, Ruimei
    Sun, Min
    Zuley, Margarita L.
    Wu, Shandong
    [J]. MEDICAL PHYSICS, 2020, 47 (12) : 6334 - 6342
  • [5] Human epidermal growth factor receptor 2 status correlates with lymph node involvement in patients with estrogen receptor (ER) -negative, but with grade in those with ER-Positive early-stage breast cancer suitable for cytotoxic chemotherapy
    Bartlett, John M. S.
    Ellis, Ian O.
    Dowsett, Mitch
    Mallon, Elizabeth A.
    Cameron, David A.
    Johnston, Stephen
    Hall, Emma
    A'Hern, Roger
    Peckitt, Clare
    Bliss, Judith M.
    Johnson, Lindsay
    Barrett-Lee, Peter
    Ellis, Paul
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2007, 25 (28) : 4423 - 4430
  • [6] Böcker W, 2002, VERH DEUT G, V86, P116
  • [7] Breast carcinoma with micropapillary features: Clinicopathologic study and long-term follow-up of 100 cases
    Chen, Ling
    Fan, Yu
    Lang, Rong-gang
    Guo, Xiao-jing
    Sun, Yu-lan
    Cui, Li-fang
    Liu, Fang-fang
    Wei, Jia
    Zhang, Xin-min
    Fu, Li
    [J]. INTERNATIONAL JOURNAL OF SURGICAL PATHOLOGY, 2008, 16 (02) : 155 - 163
  • [8] An interpretable machine learning prognostic system for locoregionally advanced nasopharyngeal carcinoma based on tumor burden features
    Chen, Xi
    Li, Yingxue
    Li, Xiang
    Cao, Xun
    Xiang, Yanqun
    Xia, Weixiong
    Li, Jianpeng
    Gao, Mingyong
    Sun, Yuyao
    Liu, Kuiyuan
    Qiang, Mengyun
    Liang, Chixiong
    Miao, Jingjing
    Cai, Zhuochen
    Guo, Xiang
    Li, Chaofeng
    Xie, Guotong
    Lv, Xing
    [J]. ORAL ONCOLOGY, 2021, 118
  • [9] FISHER ER, 1980, AM J CLIN PATHOL, V73, P313
  • [10] Fu Li, 2004, Zhonghua Bing Li Xue Za Zhi, V33, P21