Convolutional neural network for biomarker discovery for triple negative breast cancer with RNA sequencing data

被引:0
|
作者
Chen, Xiangning [1 ]
Balko, Justin M. [2 ,3 ,4 ,5 ,6 ]
Ling, Fei [7 ]
Jin, Yabin [8 ]
Gonzalez, Anneliese [9 ]
Zhao, Zhongming [10 ,11 ]
Chen, Jingchun [12 ]
机构
[1] 410 AI LLC, 10 Plummer Ct, Germantown, MD 20876 USA
[2] Vanderbilt Univ, Med Ctr, Vanderbilt Ingram Canc Ctr, Dept Med, 2101 End Ave, Nashville, TN 37240 USA
[3] Vanderbilt Univ, Med Ctr, Vanderbilt Ingram Canc Ctr, Breast Canc Res Program, 2101 W End Ave, Nashville, TN 37240 USA
[4] Vanderbilt Univ, Vanderbilt Ingram Canc Ctr, Med Ctr, Dept Pathol, Nashville, TN USA
[5] Vanderbilt Univ, Med Ctr, Vanderbilt Ingram Canc Ctr, Dept Microbiol, Nashville, TN USA
[6] Vanderbilt Univ, Med Ctr, Vanderbilt Ingram Canc Ctr, Dept Immunol, Nashville, TN USA
[7] South China Univ Technol, Sch Biol & Biol Engn, Guangzhou, Guangdong, Peoples R China
[8] First Peoples Hosp Foshan, Clin Res Inst, Foshan, Peoples R China
[9] Univ Texas Hlth Sci Ctr Houston, McGovern Med Sch, Dept Internal Med, Houston, TX 77030 USA
[10] Univ Texas Hlth Sci Ctr Houston, Ctr Precis Hlth, Sch Biomed Informat, Houston, TX 77030 USA
[11] Univ Texas Houston, McGovern Med Sch, Dept Psychiat & Behav Sci, Houston, TX 77030 USA
[12] Univ Nevada Vegas, Nevada Inst Personalized Med, Las Vegas, NV 89154 USA
关键词
Convolutional neural network; Triple negative breast cancer; Biomarker discovery; RNA sequencing; Machine learning; CLASSIFICATION; EXPRESSION; SUBTYPES; PATHWAY; IMAGE; SMOTE;
D O I
10.1016/j.heliyon.2023.e14819
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Triple negative breast cancers (TNBCs) are tumors with a poor treatment response and prognosis. In this study, we propose a new approach, candidate extraction from convolutional neural network (CNN) elements (CECE), for discovery of biomarkers for TNBCs. We used the GSE96058 and GSE81538 datasets to build a CNN model to classify TNBCs and non-TNBCs and used the model to make TNBC predictions for two additional datasets, the cancer genome atlas (TCGA) breast cancer RNA sequencing data and the data from Fudan University Shanghai Cancer Center (FUSCC). Using correctly predicted TNBCs from the GSE96058 and TCGA datasets, we calculated saliency maps for these subjects and extracted the genes that the CNN model used to separate TNBCs from non-TNBCs. Among the TNBC signature patterns that the CNN models learned from the training data, we found a set of 21 genes that can classify TNBCs into two major classes, or CECE subtypes, with distinct overall survival rates (P = 0.0074). We replicated this subtype classification in the FUSCC dataset using the same 21 genes, and the two subtypes had similar differential overall survival rates (P = 0.0490). When all TNBCs were combined from the 3 datasets, the CECE II subtype had a hazard ratio of 1.94 (95% CI, 1.25-3.01; P = 0.0032). The results demonstrate that the spatial patterns learned by the CNN models can be utilized to discover interacting biomarkers otherwise unlikely to be identified by traditional approaches.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Triple negative breast cancer: the kiss of death
    Jitariu, Adriana-Andreea
    Cimpean, Anca Maria
    Ribatti, Domenico
    Raica, Marius
    ONCOTARGET, 2017, 8 (28) : 46652 - 46662
  • [22] LUCAT1-Mediated Competing Endogenous RNA (ceRNA) Network in Triple-Negative Breast Cancer
    Verma, Deepak
    Siddharth, Sumit
    Yende, Ashutosh S.
    Wu, Qitong
    Sharma, Dipali
    CELLS, 2024, 13 (22)
  • [23] Identification of candidate RNA signatures in triple-negative breast cancer by the construction of a competing endogenous RNA network with integrative analyses of Gene Expression Omnibus and The Cancer Genome Atlas data
    Yan, Ping
    Tang, Lingfeng
    Liu, Li
    Tu, Gang
    ONCOLOGY LETTERS, 2020, 19 (03) : 1915 - 1927
  • [24] Reassessment of Reliability and Reproducibility for Triple-Negative Breast Cancer Subtyping
    Yu, Xinjian
    Liu, Yongjing
    Chen, Ming
    CANCERS, 2022, 14 (11)
  • [25] Analysis of single-cell RNA-sequencing data identifies a hypoxic tumor subpopulation associated with poor prognosis in triple-negative breast cancer
    Shi, Yi
    Huang, Xiaoqian
    Du, Zhaolan
    Tan, Jianjun
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (06) : 5793 - 5812
  • [26] Circulating cell-free miRNAs as biomarker for triple-negative breast cancer
    Shin, V. Y.
    Siu, J. M.
    Cheuk, I.
    Ng, E. K. O.
    Kwong, A.
    BRITISH JOURNAL OF CANCER, 2015, 112 (11) : 1751 - 1759
  • [27] PIM-1 kinase: a potential biomarker of triple-negative breast cancer
    Chen, Jieying
    Tang, Guangyu
    ONCOTARGETS AND THERAPY, 2019, 12 : 6267 - 6273
  • [28] Biology of the Triple-Negative Breast Cancer: Immunohistochemical, RNA, and DNA Features
    Herrera Juarez, Mercedes
    Tolosa Ortega, Pablo
    Sanchez de Torre, Ana
    Ciruelos Gil, Eva
    BREAST CARE, 2020, 15 (03) : 208 - 216
  • [29] Potential Management of Circulating Tumor DNA as a Biomarker in Triple-Negative Breast Cancer
    Shang, Mao
    Chang, Chunxiao
    Pei, Yanqing
    Guan, Yin
    Chang, Jin
    Li, HuiHui
    JOURNAL OF CANCER, 2018, 9 (24): : 4627 - 4634
  • [30] Prolactin receptor expression as a novel prognostic biomarker for triple negative breast cancer patients
    Motamedi, Behnaz
    Rafiee-Pour, Hossain-Ali
    Khosravi, Mohammad-Reza
    Kefayat, Amirhosein
    Baradaran, Azar
    Amjadi, Elham
    Goli, Parvin
    ANNALS OF DIAGNOSTIC PATHOLOGY, 2020, 46