Radiogenomics in Clear Cell Renal Cell Carcinoma: Machine Learning-Based High-Dimensional Quantitative CT Texture Analysis in Predicting PBRM1 Mutation Status

被引:91
作者
Kocak, Burak [1 ]
Durmaz, Emine Sebnem [2 ]
Ates, Ece [1 ]
Ulusan, Melis Baykara [1 ]
机构
[1] Istanbul Training & Res Hosp, Dept Radiol, Istanbul, Turkey
[2] Istanbul Univ, Cerrahpasa Med Fac, Dept Radiol, Istanbul, Turkey
关键词
CT; clear cell renal cell carcinoma; genomics; machine learning; PBRM1; mutation; TUMOR HETEROGENEITY; SELECTION; GENE; INFORMATION; FEATURES; IMAGES;
D O I
10.2214/AJR.18.20443
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
OBJECTIVE. The purpose of this study is to evaluate the potential value of machine learning (ML)-based high-dimensional quantitative CT texture analysis in predicting the mutation status of the gene encoding the protein polybromo-1 (PBRM1) in patients with clear cell renal cell carcinoma (RCC). MATERIALS AND METHODS. In this retrospective study, 45 patients with clear cell RCC (29 without the PBRM1 mutation and 16 with the PBRM1 mutation) were identified in The Cancer Genome Atlas-Kidney Renal Clear Cell Carcinoma database. To create stable ML models and balanced classes, the data were augmented to a total of 161 labeled segmentations (87 without the PBRM1 mutation and 74 with the PBRM1 mutation) by obtaining three to five different samples per patient. Texture features were extracted from corticomedullary phase contrast-enhanced CT images with the use of an open-source software package for the extraction of radiomic data from medical images. Reproducibility analysis (intraclass correlation) was performed by two radiologists. Attribute selection and model optimization were done using a wrapper-based classifier-specific algorithm with nested cross-validation. ML classifiers were an artificial neural network (ANN) algorithm and a random forest (RF) algorithm. The models were validated using 10-fold cross-validation. The reference standard was the PBRM1 mutation status. The main performance metric was the AUC value. RESULTS. Of 828 extracted texture features, 759 had excellent reproducibility. Using 10 selected features, the ANN algorithm correctly classified 88.2% (142 of 161) of the clear cell RCCs in terms of PBRM1 mutation status (AUC value, 0.925). Using five selected features, the RF algorithm correctly classified 95.0% (153 of 161) of the clear cell RCCs (AUC value, 0.987). Overall, the RF algorithm performed better than the ANN algorithm (z score = -2.677; p = 0.007). CONCLUSION. ML-based high-dimensional quantitative CT texture analysis might be a feasible and potential method for predicting PBRM1 mutation status in patients with clear cell RCC.
引用
收藏
页码:W55 / W63
页数:9
相关论文
共 52 条
  • [1] A feature selection technique for classificatory analysis
    Ahmad, A
    Dey, L
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (01) : 43 - 56
  • [2] Akin O, RADIOLOGY DATA CANC
  • [3] Alessandrino F, 2018, ABDOM RADIOL NY
  • [4] Update on Radiogenomics of Clear Cell Renal Cell Carcinoma
    Alessandrino, Francesco
    Krajewski, Katherine M.
    Shinagare, Atul B.
    [J]. EUROPEAN UROLOGY FOCUS, 2016, 2 (06): : 572 - 573
  • [5] [Anonymous], EUR RADIOL
  • [6] IMPROVING INCREMENTAL WRAPPER-BASED SUBSET SELECTION VIA REPLACEMENT AND EARLY STOPPING
    Bermejo, Pablo
    Gamez, Jose A.
    Puerta, Jose M.
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2011, 25 (05) : 605 - 625
  • [7] Molecular Genetics of Clear-Cell Renal Cell Carcinoma
    Brugarolas, James
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2014, 32 (18) : 1968 - +
  • [8] Carlo MI, 2017, KIDNEY CANCER, V1, P49, DOI 10.3233/KCA-160003
  • [9] Cawley GC, 2010, J MACH LEARN RES, V11, P2079
  • [10] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)