A Genetic Programming Approach to Radiomic-Based Feature Construction for Survival Prediction in Non-Small Cell Lung Cancer

被引:1
作者
Scalco, Elisa [1 ]
Gomez-Flores, Wilfrido [2 ]
Rizzo, Giovanna [3 ]
机构
[1] Italian Natl Res Council, Inst Biomed Technol, Via Fratelli Cervi 93, I-20054 Segrate, Italy
[2] Ctr Invest & Estudios Avanzados IPN, Unidad Tamaulipas, Km 5-5 Carretera Cd Victoria Soto Marina,Parque Ci, Ciudad Victoria 87138, Mexico
[3] Italian Natl Res Council, Inst Intelligent Ind Technol & Syst, Via Alfonso Corti 12, I-20133 Milan, Italy
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 16期
关键词
computer tomography; feature construction; genetic programming; radiomics; non-small cell lung cancer; CLASSIFICATION; INFORMATION; 2D;
D O I
10.3390/app14166923
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Machine learning (ML) is commonly used to develop survival-predictive radiomic models in non-small cell lung cancer (NSCLC) patients, which helps assist treatment decision making. Radiomic features derived from computer tomography (CT) lung images aim to capture quantitative tumor characteristics. However, these features are determined by humans, which poses a risk of including irrelevant or redundant variables, thus reducing the model's generalization. To address this issue, we propose using genetic programming (GP) to automatically construct new features with higher discriminant power than the original radiomic features. To achieve this goal, we introduce a fitness function that measures the classification performance ratio of output to input. The constructed features are then input for various classifiers to predict the two-year survival of NSCLC patients from two public CT datasets. Our approach is compared against two popular feature selection methods in radiomics to choose relevant radiomic features, and two GP-based feature construction methods whose fitness functions are based on measuring the constructed features' quality. The experimental results show that survival prediction models trained on GP-based constructed features outperform feature selection methods. Also, maximizing the classification performance gain output-to-input ratio produces features with higher discriminative power than only maximizing the classification accuracy from constructed features. Furthermore, a survival analysis demonstrated statistically significant differences between survival and non-survival groups in the Kaplan-Meier curves. Therefore, the proposed approach can be used as a complementary method for oncologists in determining the clinical management of NSCLC patients.
引用
收藏
页数:19
相关论文
共 55 条
[1]   Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach [J].
Aerts, Hugo J. W. L. ;
Velazquez, Emmanuel Rios ;
Leijenaar, Ralph T. H. ;
Parmar, Chintan ;
Grossmann, Patrick ;
Cavalho, Sara ;
Bussink, Johan ;
Monshouwer, Rene ;
Haibe-Kains, Benjamin ;
Rietveld, Derek ;
Hoebers, Frank ;
Rietbergen, Michelle M. ;
Leemans, C. Rene ;
Dekker, Andre ;
Quackenbush, John ;
Gillies, Robert J. ;
Lambin, Philippe .
NATURE COMMUNICATIONS, 2014, 5
[2]   Machine-Learning-Based Disease Diagnosis: A Comprehensive Review [J].
Ahsan, Md Manjurul ;
Luna, Shahana Akter ;
Siddique, Zahed .
HEALTHCARE, 2022, 10 (03)
[3]   A radiogenomic dataset of non-small cell lung cancer [J].
Bakr, Shaimaa ;
Gevaert, Olivier ;
Echegaray, Sebastian ;
Ayers, Kelsey ;
Zhou, Mu ;
Shafiq, Majid ;
Zheng, Hong ;
Benson, Jalen Anthony ;
Zhang, Weiruo ;
Leung, Ann N. C. ;
Kadoch, Michael ;
Hoang, Chuong D. ;
Shrager, Joseph ;
Quon, Andrew ;
Rubin, Daniel L. ;
Plevritis, Sylvia K. ;
Napel, Sandy .
SCIENTIFIC DATA, 2018, 5
[4]  
Bhardwaj H, 2018, 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), P2186, DOI 10.1109/SSCI.2018.8628935
[5]   Genetic programming for multiple-feature construction on high-dimensional classification [J].
Binh Tran ;
Xue, Bing ;
Zhang, Mengjie .
PATTERN RECOGNITION, 2019, 93 :404-417
[6]   Radiomics and deep learning methods for the prediction of 2-year overall survival in LUNG1 dataset [J].
Braghetto, Anna ;
Marturano, Francesca ;
Paiusco, Marta ;
Baiesi, Marco ;
Bettinelli, Andrea .
SCIENTIFIC REPORTS, 2022, 12 (01)
[7]   Predicting survival time of lung cancer patients using radiomic analysis [J].
Chaddad, Ahmad ;
Desrosiers, Christian ;
Toews, Matthew ;
Abdulkarim, Bassam .
ONCOTARGET, 2017, 8 (61) :104393-104407
[8]   The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository [J].
Clark, Kenneth ;
Vendt, Bruce ;
Smith, Kirk ;
Freymann, John ;
Kirby, Justin ;
Koppel, Paul ;
Moore, Stephen ;
Phillips, Stanley ;
Maffitt, David ;
Pringle, Michael ;
Tarbox, Lawrence ;
Prior, Fred .
JOURNAL OF DIGITAL IMAGING, 2013, 26 (06) :1045-1057
[9]   Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies [J].
Da-ano, R. ;
Masson, I ;
Lucia, F. ;
Dore, M. ;
Robin, P. ;
Alfieri, J. ;
Rousseau, C. ;
Mervoyer, A. ;
Reinhold, C. ;
Castelli, J. ;
De Crevoisier, R. ;
Ramee, J. F. ;
Pradier, O. ;
Schick, U. ;
Visvikis, D. ;
Hatt, M. .
SCIENTIFIC REPORTS, 2020, 10 (01)
[10]   A fast and elitist multiobjective genetic algorithm: NSGA-II [J].
Deb, K ;
Pratap, A ;
Agarwal, S ;
Meyarivan, T .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (02) :182-197