Genes selection using deep learning and explainable artificial intelligence for chronic lymphocytic leukemia predicting the need and time to therapy

被引:5
作者
Morabito, Fortunato [1 ]
Adornetto, Carlo [2 ]
Monti, Paola [3 ]
Amaro, Adriana [4 ]
Reggiani, Francesco [4 ]
Colombo, Monica [5 ]
Rodriguez-Aldana, Yissel [2 ]
Tripepi, Giovanni [6 ]
D'Arrigo, Graziella [6 ]
Vener, Claudia [7 ]
Torricelli, Federica [8 ]
Rossi, Teresa [8 ]
Neri, Antonino [9 ]
Ferrarini, Manlio [10 ]
Cutrona, Giovanna [5 ]
Gentile, Massimo [11 ,12 ]
Greco, Gianluigi [2 ]
机构
[1] A Sforza Foundat, Biotechnol Res Unit, Cosenza, Italy
[2] Univ Calabria, Dept Math & Comp Sci, Cosenza, Italy
[3] Osped Policlin San Martino, Mutagenesis & Canc Prevent Unit, Ist Ricovero & Cura Carattere Sci IRCCS, Genoa, Italy
[4] Osped Policlin San Martino, Tumor Epigenet Unit, Ist Ricovero & Cura Carattere Sci IRCCS, Genoa, Italy
[5] Osped Policlin San Martino, Mol Pathol Unit, Ist Ricovero & Cura Carattere Sci IRCCS, Genoa, Italy
[6] CNR, Consiglio Nazl Ric CNR, Ist Fisiol Clin, Reggio Di Calabria, Italy
[7] Univ Milan, Dept Oncol & Hematooncol, Milan, Italy
[8] Ist Ricovero & Cura Crabtree Sci USL IRCCS Reggio, Azienda Unita Sanit Locale, Lab Translat Res, Reggio Emilia, Italy
[9] Ist Ricovero & Cura Carattere Sci USL IRCCS Reggio, Azienda Unita Sanit Locale, Sci Directorate, Reggio Emilia, Italy
[10] Osped Policlin San Martino, Unita Operar UO Mol Pathol, Ist Ricovero & Cura Carattere Sci IRCCS, Genoa, Italy
[11] Aienda Osped AO Cosenza, Dept Oncohematol, Hematol Unit, Cosenza, Italy
[12] Univ Calabria, Dept Pharm & Hlth & Nutr Sci, Cosenza, Italy
来源
FRONTIERS IN ONCOLOGY | 2023年 / 13卷
关键词
chronic lymphocytic leukemia; gene expression profile; deep learning; explainable artificial intelligence; feature selection; GROWTH-FACTOR-I; B-CELLS; EXPRESSION SIGNATURE; PROGNOSTIC INDEX; CLL-IPI; SURVIVAL; RECEPTOR; PATHWAY; RNA; PROGRESSION;
D O I
10.3389/fonc.2023.1198992
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Analyzing gene expression profiles (GEP) through artificial intelligence provides meaningful insight into cancer disease. This study introduces DeepSHAP Autoencoder Filter for Genes Selection (DSAF-GS), a novel deep learning and explainable artificial intelligence-based approach for feature selection in genomics-scale data. DSAF-GS exploits the autoencoder's reconstruction capabilities without changing the original feature space, enhancing the interpretation of the results. Explainable artificial intelligence is then used to select the informative genes for chronic lymphocytic leukemia prognosis of 217 cases from a GEP database comprising roughly 20,000 genes. The model for prognosis prediction achieved an accuracy of 86.4%, a sensitivity of 85.0%, and a specificity of 87.5%. According to the proposed approach, predictions were strongly influenced by CEACAM19 and PIGP, moderately influenced by MKL1 and GNE, and poorly influenced by other genes. The 10 most influential genes were selected for further analysis. Among them, FADD, FIBP, FIBP, GNE, IGF1R, MKL1, PIGP, and SLC39A6 were identified in the Reactome pathway database as involved in signal transduction, transcription, protein metabolism, immune system, cell cycle, and apoptosis. Moreover, according to the network model of the 3D protein-protein interaction (PPI) explored using the NetworkAnalyst tool, FADD, FIBP, IGF1R, QTRT1, GNE, SLC39A6, and MKL1 appear coupled into a complex network. Finally, all 10 selected genes showed a predictive power on time to first treatment (TTFT) in univariate analyses on a basic prognosticmodel including IGHV mutational status, del(11q) and del(17p), NOTCH1mutations, beta 2-microglobulin, Rai stage, and B-lymphocytosis known to predict TTFT in CLL. However, only IGF1R [hazard ratio (HR) 1.41, 95% CI 1.08-1.84, P=0.013), COL28A1 (HR 0.32, 95% CI 0.10-0.97, P=0.045), and QTRT1 (HR 7.73, 95% CI 2.48-24.04, P<0.001) genes were significantly associated with TTFT in multivariable analyses when combined with the prognostic factors of the basic model, ultimately increasing the Harrell's c-index and the explained variation to 78.6% (versus 76.5% of the basic prognostic model) and 52.6% (versus 42.2% of the basic prognostic model), respectively. Also, the goodness of model fit was enhanced (chi(2) = 20.1, P=0.002), indicating its improved performance above the basic prognostic model. In conclusion, DSAF-GS identified a group of significant genes for CLL prognosis, suggesting future directions for bio-molecular research.
引用
收藏
页数:17
相关论文
共 90 条
  • [81] Diagnosis and classification of hematologic malignancies on the basis of genetics
    Taylor, Justin
    Xiao, Wenbin
    Abdel-Wahab, Omar
    [J]. BLOOD, 2017, 130 (04) : 410 - 423
  • [82] Risk prediction models
    Tripepi, Giovanni
    Heinze, Georg
    Jager, Kitty J.
    Stel, Vianda S.
    Dekker, Friedo W.
    Zoccali, Carmine
    [J]. NEPHROLOGY DIALYSIS TRANSPLANTATION, 2013, 28 (08) : 1975 - 1980
  • [83] Genomic predictors of central nervous system relapse in primary testicular diffuse large B-cell lymphoma
    Twa, David D. W.
    Lee, Derrick G.
    Tan, King L.
    Slack, Graham W.
    Ben-Neriah, Susana
    Villa, Diego
    Connors, Joseph M.
    Sehn, Laurie H.
    Mottok, Anja
    Gascoyne, Randy D.
    Scott, David W.
    Steidl, Christian
    Savage, Kerry J.
    [J]. BLOOD, 2021, 137 (09) : 1256 - 1259
  • [84] Werner Haim, 2009, Pediatr Endocrinol Rev, V7, P2
  • [85] A deep auto-encoder model for gene expression prediction
    Xie, Rui
    Wen, Jia
    Quitadamo, Andrew
    Cheng, Jianlin
    Shi, Xinghua
    [J]. BMC GENOMICS, 2017, 18
  • [86] Insulin-like growth factor-1 receptor (IGF1R) as a novel target in chronic lymphocytic leukemia
    Yaktapour, Niuscha
    Uebelhart, Rudolf
    Schueler, Julia
    Aumann, Konrad
    Dierks, Christine
    Burger, Meike
    Pfeifer, Dietmar
    Jumaa, Hassan
    Veelken, Hendrik
    Brummer, Tilman
    Zirlik, Katja
    [J]. BLOOD, 2013, 122 (09) : 1621 - 1633
  • [87] GHR is involved in gastric cell growth and apoptosis via PI3K/AKT signalling
    Yan, Hong-Zhu
    Wang, Hua-Feng
    Yin, Yueling
    Zou, Jue
    Xiao, Feng
    Yi, Li-Na
    He, Ying
    He, Bo-Sheng
    [J]. JOURNAL OF CELLULAR AND MOLECULAR MEDICINE, 2021, 25 (05) : 2450 - 2458
  • [88] Zhang L, 2016, INT J CLIN EXP PATHO, V9, P9126
  • [89] NetworkAnalyst 3.0: a visual analytics platform for comprehensive gene expression profiling and meta-analysis
    Zhou, Guangyan
    Soufan, Othman
    Ewald, Jessica
    Hancock, Robert E. W.
    Basu, Niladri
    Xia, Jianguo
    [J]. NUCLEIC ACIDS RESEARCH, 2019, 47 (W1) : W234 - W241
  • [90] Identification of Six Diagnostic Biomarkers for Chronic Lymphocytic Leukemia Based on Machine Learning Algorithms
    Zhu, Yidong
    Gan, Xinjin
    Qin, Ruoyan
    Lin, Zhikang
    [J]. JOURNAL OF ONCOLOGY, 2022, 2022