Toward improving the performance of learning by joining feature selection and ensemble classification techniques: an application for cancer diagnosis

被引:0
作者
Wang, Dan [1 ]
机构
[1] Zaozhuang Hosp Tradit Chinese Med, Zaozhuang 277000, Shandong, Peoples R China
关键词
Cancer diagnosis; Feature selection; Ensemble classification; Adaptive Differential Evolution; Learning Vector Quantization; SUPPORT VECTOR MACHINE; BREAST-CANCER; SYSTEMS;
D O I
10.1007/s00432-023-05422-6
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
IntroductionBreast cancer is known as the most common type of cancer in women, and this has raised the importance of its diagnosis in medical science as one of the most important issues. In addition to reducing costs, the diagnosis of benign or malignant breast cancer is very important in determining the treatment method.ObjectiveThe purpose of this paper is to present a model based on data mining techniques including feature selection and ensemble classification that can accurately predict breast cancer patients in the early stages.MethodologyThe proposed breast cancer detection model is developed by joining Adaptive Differential Evolution (ADE) algorithm for feature selection and Learning Vector Quantization (LVQ) neural network for classification. Our proposed model as ADE-LVQ has the ability to automatically and quickly diagnose breast cancer patients into two classes, benign and malignant. As a new evolutionary approach, ADE performs optimal configuration for LVQ neural network in addition to selecting effective features from breast cancer data. Meanwhile, we configure an ensemble classification technique based on LVQ, which significantly improves the prediction performance.ResultsADE-LVQ has been analyzed from different perspectives on different datasets from Wisconsin breast cancer database. We apply different approaches to handle missing values and improve data quality on this database. The results of the simulations showed that the ADE-LVQ model is more successful than the equivalent and state-of-the-art models in diagnosing breast cancer patients. Also, ADE-LVQ provides better performance with less complexity, considering feature selection and ensemble learning. In particular, ADE-LVQ improves accuracy (up to 3.4%) and runtime (up to 2.3%) on average compared to the existing best method.ConclusionCombined methods based on data mining techniques for breast cancer diagnosis can help doctors in making better decisions for disease treatment.
引用
收藏
页码:16993 / 17006
页数:14
相关论文
共 53 条
[1]   Normalized Neural Networks for Breast Cancer Classification [J].
Alickovic, Emina ;
Subasi, Abdulhamit .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING, CMBEBIH 2019, 2020, 73 :519-524
[2]   Breast cancer diagnosis using GA feature selection and Rotation Forest [J].
Alickovic, Emina ;
Subasi, Abdulhamit .
NEURAL COMPUTING & APPLICATIONS, 2017, 28 (04) :753-763
[3]  
[Anonymous], 2001, P INT DAT AN MED PHA
[4]  
Aruna S., 2011, Comput. Sci. Inf. Technol., P37, DOI DOI 10.5121/CSIT.2011.1205
[5]   An intelligent ensemble classification method based on multi-layer perceptron neural network and evolutionary algorithms for breast cancer diagnosis [J].
Azad, Saeed Talatian ;
Ahmadi, Gholamreza ;
Rezaeipanah, Amin .
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2022, 34 (06) :949-969
[6]  
Bazazeh D, 2016, INT C ELECT DEVICE S
[7]  
Bilalovic O, 2018, 2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), P279, DOI 10.23919/MIPRO.2018.8400053
[8]   webTWAS: a resource for disease candidate susceptibility genes identified by transcriptome-wide association study [J].
Cao, Chen ;
Wang, Jianhua ;
Kwok, Devin ;
Cui, Feifei ;
Zhang, Zilong ;
Zhao, Da ;
Li, Mulin Jun ;
Zou, Quan .
NUCLEIC ACIDS RESEARCH, 2022, 50 (D1) :D1123-D1130
[9]   Fault Detection and Performance Recovery Design With Deferred Actuator Replacement via a Low-Computation Method [J].
Cheng, Fabin ;
Niu, Ben ;
Xu, Ning ;
Zhao, Xudong ;
Ahmad, Adil M. .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (03) :4646-4656
[10]   Breast cancer classification with reduced feature set using association rules and support vector machine [J].
Ed-daoudy, Abderrahmane ;
Maalmi, Khalil .
NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2020, 9 (01)