A methodology for evaluating multi-objective evolutionary feature selection for classification in the context of virtual screening

被引:8
作者
Jimenez, Fernando [1 ]
Perez-Sanchez, Horacio [2 ]
Palma, Jose [1 ]
Sanchez, Gracia [1 ]
Martinez, Carlos [3 ]
机构
[1] Univ Murcia, Fac Informat, Dept Informat & Commun Engn, E-30100 Murcia, Spain
[2] Catholic Univ San Antonio Murcia UCAM, Comp Engn Dept, Bioinformat & High Performance Comp Res Grp BIOHP, Murcia 30107, Spain
[3] Univ Murcia, Int Doctorate Sch, E-30100 Murcia, Spain
关键词
Feature selection; Multi-objective evolutionary algorithms; Classification; Decision trees; Virtual screening; Drug discovery; FEATURE SUBSET-SELECTION; DRUG DISCOVERY; DIFFERENTIAL EVOLUTION; GENETIC ALGORITHM; SCORING FUNCTIONS; DOCKING; OPTIMIZATION; DESIGN; MODELS;
D O I
10.1007/s00500-018-3479-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Virtual screening (VS) methods have been shown to increase success rates in many drug discovery campaigns, when they complement experimental approaches, such as high-throughput screening methods or classical medicinal chemistry approaches. Nevertheless, predictive capability of VS is not yet optimal, mainly due to limitations in the underlying physical principles describing drug binding phenomena. One approach that can improve VS methods is the aid of machine learning methods. When enough experimental data are available to train such methods, predictive capability can considerably increase. We show in this research work how a multi-objective evolutionary search strategy for feature selection, which can provide with small and accurate decision trees that can be very easily understood by chemists, can drastically increase the applicability and predictive ability of these techniques and therefore aid considerable in the drug discovery problem. With the proposed methodology, we find classification models with accuracy between 0.9934 and 1.00 and area under ROC between 0.96 and 1.00 evaluated in full training sets, and accuracy between 0.9849 and 0.9940 and area under ROC between 0.89 and 0.93 evaluated with tenfold cross-validation over 30 iterations, while substantially reducing the model size.
引用
收藏
页码:8775 / 8800
页数:26
相关论文
共 50 条
[41]   Multi-Objective Hyperparameter Tuning and Feature Selection using Filter Ensembles [J].
Binder, Martin ;
Moosbauer, Julia ;
Thomas, Janek ;
Bischl, Bernd .
GECCO'20: PROCEEDINGS OF THE 2020 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2020, :471-479
[42]   Multi-Objective Feature Selection in QSAR Using a Machine Learning Approach [J].
Soto, Axel J. ;
Cecchini, Rocio L. ;
Vazquez, Gustavo E. ;
Ponzoni, Ignacio .
QSAR & COMBINATORIAL SCIENCE, 2009, 28 (11-12) :1509-1523
[43]   A Multi-objective Evolutionary Algorithm with Symmetrical Uncertainty Crossover Operator for Feature Selection in Disease Classification [J].
Hou, Lei ;
Guo, Xiaofang ;
Guan, Shihao .
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTERS AND ARTIFICIAL INTELLIGENCE TECHNOLOGY, CAIT, 2024, :185-190
[44]   Multi-task Optimisation for Multi-objective Feature Selection in Classification [J].
Lin, Jiabin ;
Chen, Qi ;
Xue, Bing ;
Zhang, Mengjie .
PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, :264-267
[45]   Adaptive crossover operator based multi-objective binary genetic algorithm for feature selection in classification [J].
Xue, Yu ;
Zhu, Haokai ;
Liang, Jiayu ;
Slowik, Adam .
KNOWLEDGE-BASED SYSTEMS, 2021, 227
[46]   Advancing text classification: a novel two-stage multi-objective feature selection framework [J].
Liu, Yan ;
Cheng, Xian ;
Stephen, Liao Shaoyi ;
Wei, Shansen .
INFORMATION TECHNOLOGY & MANAGEMENT, 2025,
[47]   A new framework of multi-objective evolutionary algorithms for feature selection and multi-label classification of video data [J].
Gizem Nur Karagoz ;
Adnan Yazici ;
Tansel Dokeroglu ;
Ahmet Cosar .
International Journal of Machine Learning and Cybernetics, 2021, 12 :53-71
[48]   Multi-objective genetic algorithm for multi-view feature selection [J].
Imani, Vandad ;
Sevilla-Salcedo, Carlos ;
Moradi, Elaheh ;
Fortino, Vittorio ;
Tohka, Jussi .
APPLIED SOFT COMPUTING, 2024, 167
[49]   Handling Different Preferences Between Objectives for Multi-objective Feature Selection in Classification [J].
Jiao, Ruwang ;
Xue, Bing ;
Zhang, Mengjie .
AI 2022: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13728 :237-251
[50]   Multi-Objective Evolutionary Algorithms for Feature Selection: Application in Bankruptcy Prediction [J].
Gaspar-Cunha, Antonio ;
Mendes, Fernando ;
Duarte, Joao ;
Vieira, Armando ;
Ribeiro, Bernardete ;
Ribeiro, Andre ;
Neves, Joao .
SIMULATED EVOLUTION AND LEARNING, 2010, 6457 :319-+