Transforming Cancer Classification: The Role of Advanced Gene Selection

被引:5
作者
Yaqoob, Abrar [1 ]
Mir, Mushtaq Ahmad [2 ]
Rao, G. V. V. Jagannadha [3 ]
Tejani, Ghanshyam G. [4 ,5 ]
机构
[1] VIT Bhopal Univ, Sch Adv Sci & Language, Bhopal 466114, India
[2] King Khalid Univ, Coll Appl Med Sci, Dept Clin Lab Sci, Abha 61421, Saudi Arabia
[3] Kalinga Univ, Dept Math, Raipur 492001, India
[4] Yuan Ze Univ, Dept Ind Engn & Management, Taoyuan City 320315, Taiwan
[5] Jadara Univ, Jadara Res Ctr, Irbid 21110, Jordan
关键词
cancer classification; Mutual Information; Particle Swarm Optimization; Support Vector Machine; ALGORITHM;
D O I
10.3390/diagnostics14232632
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background/Objectives: Accurate classification in cancer research is vital for devising effective treatment strategies. Precise cancer classification depends significantly on selecting the most informative genes from high-dimensional datasets, a task made complex by the extensive data involved. This study introduces the Two-stage MI-PSA Gene Selection algorithm, a novel approach designed to enhance cancer classification accuracy through robust gene selection methods. Methods: The proposed method integrates Mutual Information (MI) and Particle Swarm Optimization (PSO) for gene selection. In the first stage, MI acts as an initial filter, identifying genes rich in cancer-related information. In the second stage, PSO refines this selection to pinpoint an optimal subset of genes for accurate classification. Results: The experimental findings reveal that the MI-PSA method achieves a best classification accuracy of 99.01% with a selected subset of 19 genes, substantially outperforming the MI and SVM methods, which attain best accuracies of 93.44% and 91.26%, respectively, for the same gene count. Furthermore, MI-PSA demonstrates superior performance in terms of average and worst-case accuracy, underscoring its robustness and reliability. Conclusions: The MI-PSA algorithm presents a powerful approach for identifying critical genes essential for precise cancer classification, advancing both our understanding and management of this complex disease.
引用
收藏
页数:19
相关论文
共 44 条
[1]   Uncertainty quantification in skin cancer classification using three-way decision-based Bayesian deep learning [J].
Abdar, Moloud ;
Samami, Maryam ;
Mahmoodabad, Sajjad Dehghani ;
Doan, Thang ;
Mazoure, Bogdan ;
Hashemifesharaki, Reza ;
Liu, Li ;
Khosravi, Abbas ;
Acharya, U. Rajendra ;
Makarenkov, Vladimir ;
Nahavandi, Saeid .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135
[2]   Deep learning-based computer aided diagnosis model for skin cancer detection and classification [J].
Adla, Devakishan ;
Reddy, G. Venkata Rami ;
Nayak, Padmalaya ;
Karuna, G. .
DISTRIBUTED AND PARALLEL DATABASES, 2022, 40 (04) :717-736
[3]   Binary biogeography-based optimization based SVM-RFE for feature selection [J].
Albashish, Dheeb ;
Hammouri, Abdelaziz, I ;
Braik, Malik ;
Atwan, Jaffar ;
Sahran, Shahnorbanun .
APPLIED SOFT COMPUTING, 2021, 101
[4]   Regularized logistic regression with adjusted adaptive elastic net for gene selection in high dimensional cancer classification [J].
Algamal, Zakariya Yahya ;
Lee, Muhammad Hisyam .
COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 67 :136-145
[5]   Computer-aided diagnosis for breast cancer classification using deep neural networks and transfer learning [J].
Aljuaid, Hanan ;
Alturki, Nazik ;
Alsubaie, Najah ;
Cavallaro, Lucia ;
Liotta, Antonio .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 223
[6]   mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling [J].
Alshamlan, Hala ;
Badr, Ghada ;
Alohali, Yousef .
BIOMED RESEARCH INTERNATIONAL, 2015, 2015
[7]   Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification [J].
Alshamlan, Hala M. ;
Badr, Ghada H. ;
Alohali, Yousef A. .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 56 :49-60
[8]   Overview and comparative study of dimensionality reduction techniques for high dimensional data [J].
Ayesha, Shaeela ;
Hanif, Muhammad Kashif ;
Talib, Ramzan .
INFORMATION FUSION, 2020, 59 :44-58
[9]  
Aziz R., 2018, Ann. Data Sci., V5, P615, DOI [10.1007/s40745-018-0155-2, DOI 10.1007/S40745-018-0155-2]
[10]   LGBM: a machine learning approach for Ethereum fraud detection [J].
Aziz R.M. ;
Baluch M.F. ;
Patel S. ;
Ganie A.H. .
International Journal of Information Technology, 2022, 14 (7) :3321-3331