Convolutional neural network for biomarker discovery for triple negative breast cancer with RNA sequencing data

被引:0
|
作者
Chen, Xiangning [1 ]
Balko, Justin M. [2 ,3 ,4 ,5 ,6 ]
Ling, Fei [7 ]
Jin, Yabin [8 ]
Gonzalez, Anneliese [9 ]
Zhao, Zhongming [10 ,11 ]
Chen, Jingchun [12 ]
机构
[1] 410 AI LLC, 10 Plummer Ct, Germantown, MD 20876 USA
[2] Vanderbilt Univ, Med Ctr, Vanderbilt Ingram Canc Ctr, Dept Med, 2101 End Ave, Nashville, TN 37240 USA
[3] Vanderbilt Univ, Med Ctr, Vanderbilt Ingram Canc Ctr, Breast Canc Res Program, 2101 W End Ave, Nashville, TN 37240 USA
[4] Vanderbilt Univ, Vanderbilt Ingram Canc Ctr, Med Ctr, Dept Pathol, Nashville, TN USA
[5] Vanderbilt Univ, Med Ctr, Vanderbilt Ingram Canc Ctr, Dept Microbiol, Nashville, TN USA
[6] Vanderbilt Univ, Med Ctr, Vanderbilt Ingram Canc Ctr, Dept Immunol, Nashville, TN USA
[7] South China Univ Technol, Sch Biol & Biol Engn, Guangzhou, Guangdong, Peoples R China
[8] First Peoples Hosp Foshan, Clin Res Inst, Foshan, Peoples R China
[9] Univ Texas Hlth Sci Ctr Houston, McGovern Med Sch, Dept Internal Med, Houston, TX 77030 USA
[10] Univ Texas Hlth Sci Ctr Houston, Ctr Precis Hlth, Sch Biomed Informat, Houston, TX 77030 USA
[11] Univ Texas Houston, McGovern Med Sch, Dept Psychiat & Behav Sci, Houston, TX 77030 USA
[12] Univ Nevada Vegas, Nevada Inst Personalized Med, Las Vegas, NV 89154 USA
关键词
Convolutional neural network; Triple negative breast cancer; Biomarker discovery; RNA sequencing; Machine learning; CLASSIFICATION; EXPRESSION; SUBTYPES; PATHWAY; IMAGE; SMOTE;
D O I
10.1016/j.heliyon.2023.e14819
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Triple negative breast cancers (TNBCs) are tumors with a poor treatment response and prognosis. In this study, we propose a new approach, candidate extraction from convolutional neural network (CNN) elements (CECE), for discovery of biomarkers for TNBCs. We used the GSE96058 and GSE81538 datasets to build a CNN model to classify TNBCs and non-TNBCs and used the model to make TNBC predictions for two additional datasets, the cancer genome atlas (TCGA) breast cancer RNA sequencing data and the data from Fudan University Shanghai Cancer Center (FUSCC). Using correctly predicted TNBCs from the GSE96058 and TCGA datasets, we calculated saliency maps for these subjects and extracted the genes that the CNN model used to separate TNBCs from non-TNBCs. Among the TNBC signature patterns that the CNN models learned from the training data, we found a set of 21 genes that can classify TNBCs into two major classes, or CECE subtypes, with distinct overall survival rates (P = 0.0074). We replicated this subtype classification in the FUSCC dataset using the same 21 genes, and the two subtypes had similar differential overall survival rates (P = 0.0490). When all TNBCs were combined from the 3 datasets, the CECE II subtype had a hazard ratio of 1.94 (95% CI, 1.25-3.01; P = 0.0032). The results demonstrate that the spatial patterns learned by the CNN models can be utilized to discover interacting biomarkers otherwise unlikely to be identified by traditional approaches.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Novel biomarkers identified in triple-negative breast cancer through RNA-sequencing
    Chen, Yan-li
    Wang, Ke
    Xie, Fei
    Zhuo, Zhong-ling
    Liu, Chang
    Yang, Yu
    Wang, Shu
    Zhao, Xiao-tao
    CLINICA CHIMICA ACTA, 2022, 531 : 302 - 308
  • [2] Automatic identification of triple negative breast cancer in ultrasonography using a deep convolutional neural network
    Ye, Heng
    Hang, Jing
    Zhang, Meimei
    Chen, Xiaowei
    Ye, Xinhua
    Chen, Jie
    Zhang, Weixin
    Xu, Di
    Zhang, Dong
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [3] A PROPOSED CONVOLUTIONAL NEURAL NETWORK FOR BREAST CANCER DIAGNOSES
    Kadhim, Noor Kareem
    Al-khateeb, Belal
    Ahmed, Huda Wadah
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2023, 21 (01) : 9 - 18
  • [4] Artificial image objects for classification of breast cancer biomarkers with transcriptome sequencing data and convolutional neural network algorithms
    Xiangning Chen
    Daniel G. Chen
    Zhongming Zhao
    Justin M. Balko
    Jingchun Chen
    Breast Cancer Research, 23
  • [5] Artificial image objects for classification of breast cancer biomarkers with transcriptome sequencing data and convolutional neural network algorithms
    Chen, Xiangning
    Chen, Daniel G.
    Zhao, Zhongming
    Balko, Justin M.
    Chen, Jingchun
    BREAST CANCER RESEARCH, 2021, 23 (01)
  • [6] A Survey of Convolutional Neural Network in Breast Cancer
    Zhu, Ziquan
    Wang, Shui-Hua
    Zhang, Yu-Dong
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 136 (03): : 2127 - 2172
  • [7] A Novel Role for Cathepsin S as a Potential Biomarker in Triple Negative Breast Cancer
    Wilkinson, Richard D. A.
    Burden, Roberta E.
    McDowell, Sara H.
    McArt, Darragh G.
    McQuaid, Stephen
    Bingham, Victoria
    Williams, Rich
    Cox, Orla T.
    O'Connor, Rosemary
    McCabe, Nuala
    Kennedy, Richard D.
    Buckley, Niamh E.
    Scott, Christopher J.
    JOURNAL OF ONCOLOGY, 2019, 2019
  • [8] Endocan as a prognostic biomarker of triple-negative breast cancer
    Sagara, Atsunobu
    Igarashi, Katsuhide
    Otsuka, Maky
    Kodama, Akihiro
    Yamashita, Mutsumi
    Sugiura, Rei
    Karasawa, Takeshi
    Arakawa, Kazuhiko
    Narita, Michiko
    Kuzumaki, Naoko
    Narita, Minoru
    Kato, Yoshinori
    BREAST CANCER RESEARCH AND TREATMENT, 2017, 161 (02) : 269 - 278
  • [9] Potential Biomarker for Triple-Negative Breast Cancer Invasiveness by Optical Redox Imaging
    Feng, Min
    Xu, He N.
    Jiang, Jinxia
    Li, Lin Z.
    OXYGEN TRANSPORT TO TISSUE XLII, 2021, 1269 : 247 - 251
  • [10] Breast Cancer Classification Using Convolutional Neural Network
    Alshanbari, Eman
    Alamri, Hanaa
    Alzahrani, Walaa
    Alghamdi, Manal
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (06): : 101 - 106