Phenotyping the Histopathological Subtypes of Non-Small-Cell Lung Carcinoma: How Beneficial Is Radiomics?

被引:16
|
作者
Pasini, Giovanni [1 ,2 ]
Stefano, Alessandro [2 ]
Russo, Giorgio [2 ]
Comelli, Albert [2 ,3 ]
Marinozzi, Franco [1 ]
Bini, Fabiano [1 ]
机构
[1] Sapienza Univ Rome, Dept Mech & Aerosp Engn, Eudossiana 18, I-00184 Rome, Italy
[2] Natl Res Council IBFM CNR, Inst Mol Bioimaging & Physiol, I-90015 Cefalu, Italy
[3] Ri MED Fdn, Via Bandiera 11, I-90133 Palermo, Italy
关键词
radiomics; CT; non-small-cell lung carcinoma; NSCLC; phenotyping; multicenter; harmonization; machine learning; CLASSIFICATION; HARMONIZATION; FRAMEWORK; SCANNERS; SYSTEM; IMAGES; TUMORS;
D O I
10.3390/diagnostics13061167
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
The aim of this study was to investigate the usefulness of radiomics in the absence of well-defined standard guidelines. Specifically, we extracted radiomics features from multicenter computed tomography (CT) images to differentiate between the four histopathological subtypes of non-small-cell lung carcinoma (NSCLC). In addition, the results that varied with the radiomics model were compared. We investigated the presence of the batch effects and the impact of feature harmonization on the models' performance. Moreover, the question on how the training dataset composition influenced the selected feature subsets and, consequently, the model's performance was also investigated. Therefore, through combining data from the two publicly available datasets, this study involves a total of 152 squamous cell carcinoma (SCC), 106 large cell carcinoma (LCC), 150 adenocarcinoma (ADC), and 58 no other specified (NOS). Through the matRadiomics tool, which is an example of Image Biomarker Standardization Initiative (IBSI) compliant software, 1781 radiomics features were extracted from each of the malignant lesions that were identified in CT images. After batch analysis and feature harmonization, which were based on the ComBat tool and were integrated in matRadiomics, the datasets (the harmonized and the non-harmonized) were given as an input to a machine learning modeling pipeline. The following steps were articulated: (i) training-set/test-set splitting (80/20); (ii) a Kruskal-Wallis analysis and LASSO linear regression for the feature selection; (iii) model training; (iv) a model validation and hyperparameter optimization; and (v) model testing. Model optimization consisted of a 5-fold cross-validated Bayesian optimization, repeated ten times (inner loop). The whole pipeline was repeated 10 times (outer loop) with six different machine learning classification algorithms. Moreover, the stability of the feature selection was evaluated. Results showed that the batch effects were present even if the voxels were resampled to an isotropic form and whether feature harmonization correctly removed them, even though the models' performances decreased. Moreover, the results showed that a low accuracy (61.41%) was reached when differentiating between the four subtypes, even though a high average area under curve (AUC) was reached (0.831). Further, a NOS subtype was classified as almost completely correct (true positive rate similar to 90%). The accuracy increased (77.25%) when only the SCC and ADC subtypes were considered, as well as when a high AUC (0.821) was obtained-although harmonization decreased the accuracy to 58%. Moreover, the features that contributed the most to models' performance were those extracted from wavelet decomposed and Laplacian of Gaussian (LoG) filtered images and they belonged to the texture feature class.. In conclusion, we showed that our multicenter data were affected by batch effects, that they could significantly alter the models' performance, and that feature harmonization correctly removed them. Although wavelet features seemed to be the most informative features, an absolute subset could not be identified since it changed depending on the training/testing splitting. Moreover, performance was influenced by the chosen dataset and by the machine learning methods, which could reach a high accuracy in binary classification tasks, but could underperform in multiclass problems. It is, therefore, essential that the scientific community propose a more systematic radiomics approach, focusing on multicenter studies, with clear and solid guidelines to facilitate the translation of radiomics to clinical practice.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Assessing the Robustness and Reproducibility of CT Radiomics Features in Non-small-cell Lung Carcinoma
    Pasini, Giovanni
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 39 - 48
  • [2] Non-small cell lung carcinoma histopathological subtype phenotyping using high-dimensional multinomial multiclass CT radiomics signature
    Khodabakhshi, Zahra
    Mostafaei, Shayan
    Arabi, Hossein
    Oveisi, Mehrdad
    Shiri, Isaac
    Zaidi, Habib
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 136
  • [3] Developing a Radiomics Framework for Classifying Non-Small Cell Lung Carcinoma Subtypes
    Yu, Dongdong
    Zang, Yali
    Dong, Di
    Zhou, Mu
    Gevaert, Olivier
    Fang, Mengji
    Shi, Jingyun
    Tian, Jie
    MEDICAL IMAGING 2017: COMPUTER-AIDED DIAGNOSIS, 2017, 10134
  • [4] Exploratory Study of a CT Radiomics Model for the Classification of Small Cell Lung Cancer and Non-small-Cell Lung Cancer
    Liu, Shihe
    Liu, Shunli
    Zhang, Chuanyu
    Yu, Hualong
    Liu, Xuejun
    Hu, Yabin
    Xu, Wenjian
    Tang, Xiaoyan
    Fu, Qing
    FRONTIERS IN ONCOLOGY, 2020, 10
  • [5] Intratumoral and peritumoral CT-based radiomics strategy reveals distinct subtypes of non-small-cell lung cancer
    Tang, Xing
    Huang, Haolin
    Du, Peng
    Wang, Lijuan
    Yin, Hong
    Xu, Xiaopan
    JOURNAL OF CANCER RESEARCH AND CLINICAL ONCOLOGY, 2022, 148 (09) : 2247 - 2260
  • [6] Deciphering unclassified tumors of non-small-cell lung cancer through radiomics
    Saad, Maliazurina
    Choi, Tae-Sun
    COMPUTERS IN BIOLOGY AND MEDICINE, 2017, 91 : 222 - 230
  • [7] Robust Identification of Subtypes in Non-Small Cell Lung Cancer Using Radiomics
    Wang, Ke
    An, Ying
    Li, Ni
    Zhou, Jiancun
    Chen, Xianlai
    TRAITEMENT DU SIGNAL, 2022, 39 (04) : 1399 - 1406
  • [8] Matrix metalloproteinase pharmacogenomics in non-small-cell lung carcinoma
    Chetty, Chandramu
    Rao, Jasti S.
    Lakka, Sajani S.
    PHARMACOGENOMICS, 2011, 12 (04) : 535 - 546
  • [9] MRI-based radiomics analysis in differentiating solid non-small-cell from small-cell lung carcinoma: a pilot study
    Dang, S.
    Guo, Y.
    Han, D.
    Ma, G.
    Yu, N.
    Yang, Q.
    Duan, X.
    Duan, H.
    Ren, J.
    CLINICAL RADIOLOGY, 2022, 77 (10) : E749 - E757
  • [10] Machine Learning-Based Radiomics Analysis for Identifying KRAS Mutations in Non-Small-Cell Lung Cancer from CT Images: Challenges, Insights and Implications
    Schoeneck, Mirjam
    Rehbach, Nicolas
    Lotter-Becker, Lars
    Persigehl, Thorsten
    Lennartz, Simon
    Caldeira, Liliana Lourenco
    LIFE-BASEL, 2025, 15 (01):