Parameter tuning in machine learning based on radiomics biomarkers of lung cancer

被引:2
作者
Luo, Yuan [1 ]
Li, Yifan [1 ]
Zhang, Yuwei [1 ]
Zhang, Jianwei [2 ]
Liang, Meng [1 ]
Jiang, Lin [1 ]
Guo, Li [1 ]
机构
[1] Tianjin Med Univ, Sch Med Imaging, Tianjin 300203, Peoples R China
[2] Tianjin Baodi Hosp, Dept Radiol, Tianjin, Peoples R China
关键词
Lung neoplasms; machine learning; radiomics; parameter analysis; lung nodule classification; FEATURES; ADENOCARCINOMA; NODULES; IMAGES;
D O I
10.3233/XST-211096
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
BACKGROUND: Lung cancer is one of the most common cancers, and early diagnosis and intervention can improve cancer cure rate. OBJECTIVE: To improve predictive performance of radiomics features for lung cancer by tuning the machine learning model parameters. METHODS: Using a dataset involving 263 cases (125 benign and 138 malignant) acquired from our hospital, each classifier model is trained and tested using 237 and 26 cases, respectively. We initially extract 867 radiomics features of CT images for model development and then test 10 feature selections and 7 models to determine the best method. We further tune the parameter of the final model to reach the best performance. The adjusted final model is then validated using 224 cases acquired from Lung Image Database Consortium (LIDC) dataset (64 benign and 160 malignant) with the same set of selected radiomics features. RESULTS: During model development, the feature selection via concave minimization method showthe best performance of area under ROC curve (AUC=0.765), followed by l0-norm regularization (AUC=0.741) and Fisher discrimination criterion (AUC=0.734). Support vector machine (SVM) and random forest (RF) are the top two machine learning algorithms showing the best performance (AUC=0.765 and 0.734, respectively), using by the default parameter. After parameter tuning, SVM with linear kernel achieves the best performance (AUC=0.837), whereas the best tuned RF with the number of trees is 510 and yields a slightly lower performance (AUC=0.775) in 26 test samples data. During model validation, the SVM and RF models yield AUC=0.78 and 0.77, respectively. CONCLUSION: Appropriate quantitative radiomics features and accurate parameters can improve the model's performance to predict lung cancer.
引用
收藏
页码:477 / 490
页数:14
相关论文
共 36 条
  • [31] A PET/CT nomogram incorporating SUVmax and CT radiomics for preoperative nodal staging in non-small cell lung cancer
    Xie, Yunming
    Zhao, Hongguang
    Guo, Yan
    Meng, Fanyang
    Liu, Xiangchun
    Zhang, Yiying
    Huai, Xiaochen
    Wong, Qianting
    Fu, Yu
    Zhang, Huimao
    [J]. EUROPEAN RADIOLOGY, 2021, 31 (08) : 6030 - 6038
  • [32] Machine Learning for Histologic Subtype Classification of Non-Small Cell Lung Cancer: A Retrospective Multicenter Radiomics Study
    Yang, Fengchang
    Chen, Wei
    Wei, Haifeng
    Zhang, Xianru
    Yuan, Shuanghu
    Qiao, Xu
    Chen, Yen-Wei
    [J]. FRONTIERS IN ONCOLOGY, 2021, 10
  • [33] Yin R., 2021, J X-RAY SCI TECHNOL, V29, P1149
  • [34] Prediction of pathologic stage in non-small cell lung cancer using machine learning algorithm based on CT image feature analysis
    Yu, Lingming
    Tao, Guangyu
    Zhu, Lei
    Wang, Gang
    Li, Ziming
    Yi, Jianding
    Chen, Qunhui
    [J]. BMC CANCER, 2019, 19 (1)
  • [35] Computer Tomography Radiomics-Based Nomogram in the Survival Prediction for Brain Metastases From Non-Small Cell Lung Cancer Underwent Whole Brain Radiotherapy
    Zhang, Ji
    Jin, Juebin
    Ai, Yao
    Zhu, Kecheng
    Xiao, Chengjian
    Xie, Congying
    Jin, Xiance
    [J]. FRONTIERS IN ONCOLOGY, 2021, 10
  • [36] Differentiation of focal organising pneumonia and peripheral adenocarcinoma in solid lung lesions using thin-section CT-based radiomics
    Zhang, T.
    Yuan, M.
    Zhong, Y.
    Zhang, Y. -D.
    Li, H.
    Wu, J. -F.
    Yu, T. -F.
    [J]. CLINICAL RADIOLOGY, 2019, 74 (01) : 78.e23 - 78.e30