Improved NSGA-II algorithms for multi-objective biomarker discovery

被引:4
作者
Cattelani, Luca [1 ]
Fortino, Vittorio [1 ]
机构
[1] Univ Eastern Finland, Sch Med, Inst Biomed, Kuopio, Finland
基金
芬兰科学院;
关键词
BREAST-CANCER;
D O I
10.1093/bioinformatics/btac463
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In modern translational research, the development of biomarkers heavily relies on use of omics technologies, but implementations with basic data mining algorithms frequently lead to false positives. Non-dominated Sorting Genetic Algorithm II (NSGA2) is an extremely effective algorithm for biomarker discovery but has been rarely evaluated against large-scale datasets. The exploration of the feature search space is the key to NSGA2 success but in specific cases NSGA2 expresses a shallow exploration of the space of possible feature combinations, possibly leading to models with low predictive performances. Results We propose two improved NSGA2 algorithms for finding subsets of biomarkers exhibiting different trade-offs between accuracy and feature number. The performances are investigated on gene expression data of breast cancer patients. The results are compared with NSGA2 and LASSO. The benchmarking dataset includes internal and external validation sets. The results show that the proposed algorithms generate a better approximation of the optimal trade-offs between accuracy and set size. Moreover, validation and test accuracies are better than those provided by NSGA2 and LASSO. Remarkably, the GA-based methods provide biomarkers that achieve a very high prediction accuracy (>80%) with a small number of features (<10), representing a valid alternative to known biomarker models, such as Pam50 and MammaPrint.
引用
收藏
页码:ii20 / ii26
页数:7
相关论文
共 16 条
[1]   Systematic review of the clinical and economic value of gene expression profiles for invasive early breast cancer available in Europe [J].
Blok, E. J. ;
Bastiaannet, E. ;
van den Hout, W. B. ;
Liefers, G. J. ;
Smit, V. T. H. B. M. ;
Kroep, J. R. ;
van de Velde, C. J. H. .
CANCER TREATMENT REVIEWS, 2018, 62 :74-90
[2]   Clinical Value of RNA Sequencing-Based Classifiers for Prediction of the Five Conventional Breast Cancer Biomarkers: A Report From the Population-Based Multicenter Sweden Cancerome Analysis Network-Breast Initiative [J].
Brueffer, Christian ;
Vallon-Christersson, Johan ;
Grabau, Dorthe ;
Ehinger, Anna ;
Hakkinen, Jari ;
Hegardt, Cecilia ;
Malina, Janne ;
Chen, Yilun ;
Bendahl, Par-Ola ;
Manjer, Jonas ;
Malmberg, Martin ;
Larsson, Christer ;
Loman, Niklas ;
Ryden, Lisa ;
Borg, Ake ;
Saal, Lao H. .
JCO PRECISION ONCOLOGY, 2018, 2 :1-18
[3]   A fast and elitist multiobjective genetic algorithm: NSGA-II [J].
Deb, K ;
Pratap, A ;
Agarwal, S ;
Meyarivan, T .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (02) :182-197
[4]  
Elian Fahed A, 2018, Oncotarget, V9, P8165, DOI 10.18632/oncotarget.22742
[5]   Feature set optimization in biomarker discovery from genome-scale data [J].
Fortino, V. ;
Scala, G. ;
Greco, D. .
BIOINFORMATICS, 2020, 36 (11) :3393-3400
[6]   Association between female breast cancer and cutaneous melanoma [J].
Goggins, W ;
Gao, W ;
Tsao, H .
INTERNATIONAL JOURNAL OF CANCER, 2004, 111 (05) :792-794
[7]   SOX10 Promotes Melanoma Cell Invasion by Regulating Melanoma Inhibitory Activity [J].
Graf, Saskia A. ;
Busch, Christian ;
Bosserhoff, Anja-Katrin ;
Besch, Robert ;
Berking, Carola .
JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2014, 134 (08) :2212-2220
[8]   Stable feature selection for biomarker discovery [J].
He, Zengyou ;
Yu, Weichuan .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2010, 34 (04) :215-225
[9]  
Holland J.H., 1992, Adaptation in Natural and Artificial Systems
[10]   Breast Cancer Consensus Subtypes: A system for subtyping breast cancer tumors based on gene expression [J].
Horr, Christina ;
Buechler, Steven A. .
NPJ BREAST CANCER, 2021, 7 (01)