Hybridizing Artificial Neural Networks Through Feature Selection Based Supervised Weight Initialization and Traditional Machine Learning Algorithms for Improved Colon Cancer Prediction

被引:0
作者
Sajjad Ahmed Nadeem, Malik [1 ,2 ]
Hammad Waseem, Muhammad [1 ,2 ]
Aziz, Wajid [1 ]
Habib, Usman [3 ]
Masood, Anum [4 ]
Attique Khan, Muhammad [5 ]
机构
[1] Univ Azad Jammu & Kashmir, Dept Comp Sci & Informat Technol, Muzaffarabad 13100, Pakistan
[2] Univ Azad Jammu & Kashmir, Dept Comp Sci, Neelum Campus, Athmuqam 13230, Pakistan
[3] Natl Univ Comp & Emerging Sci, FAST Sch Comp, Software Engn Dept, Islamabad 44000, Pakistan
[4] Norwegian Univ Sci & Technol, Dept Phys, N-7491 Trondheim, Norway
[5] Lebanese American Univ, Dept Comp Sci & Math, Beirut 11022801, Lebanon
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Accuracy; Feature extraction; Classification algorithms; Task analysis; Classification tree analysis; Prediction algorithms; Machine learning; Artificial neural networks; Computer aided diagnosis; Cancer; classification; feature selection; genetic profile; machine learning; hybrid classifiers; artificial neural network; SUPPORT VECTOR MACHINE; GENE-EXPRESSION DATA; COMPREHENSIVE SURVEY; CLASSIFICATION; CHALLENGES; KERNEL;
D O I
10.1109/ACCESS.2024.3422317
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computer-aided decision support systems (DSSs) are becoming popular in a variety of professions. Notably, medical DSSs assist healthcare professionals (decision makers) choose the optimal course of action (decisions) while treating patients. Such systems help decision-makers in situations when there is uncertainty in manual decisions due to lack of information or expertise. Choosing a suitable learning algorithm in a DSS is essential and affects its performance. Among machine learning (ML) algorithms, artificial neural networks (ANNs) are considered the most suitable framework for many classification tasks. In healthcare, an ML-based prediction system/DSS employs data (genetic profile or clinical characteristics) and learning algorithms to forecast target values, which may give promising results. However, improving prediction accuracy is a crucial step in making informed decisions. One can apply various preprocessing methods (cross validation, feature selection, bagging, boosting, etc.) to achieve this. For complex classification tasks like cancer, decision-makers can utilize the hybridization of classifiers to increase prediction accuracy. The presented study investigates the possibilities of improvements in the design of hybridized systems for DSSs to assist healthcare professionals in robust decision-making before, during, and after cancer diagnosis. Since the network weights and the activation functions are the two crucial elements in the learning process of an ANN, this study is organized to investigates the improvement in the hybrid system by selecting suitable features from gene expression microarray data and using these features to compute the more realistic initial weights instead of using random guesses as initial weights for ANN. The use of the proposed framework gives promising results (upto 6.67% gain in accuracy when compared to previous study (Table-5) while 10.43% increase in accuracy when compared to conventional ML classifiers (Table-4).
引用
收藏
页码:97099 / 97114
页数:16
相关论文
共 75 条
  • [1] Abdulqader D.M., 2020, Machine Learning, V62, P233
  • [2] Performance Comparison of Support Vector Machine, Random Forest, and Extreme Learning Machine for Intrusion Detection
    Ahmad, Iftikhar
    Basheri, Mohammad
    Iqbal, Muhammad Javed
    Rahim, Aneel
    [J]. IEEE ACCESS, 2018, 6 : 33789 - 33795
  • [3] An Efficient Deep Learning-Based Skin Cancer Classifier for an Imbalanced Dataset
    Alam, Talha Mahboob
    Shaukat, Kamran
    Khan, Waseem Ahmad
    Hameed, Ibrahim A.
    Abd Almuqren, Latifah
    Raza, Muhammad Ahsan
    Aslam, Memoona
    Luo, Suhuai
    [J]. DIAGNOSTICS, 2022, 12 (09)
  • [4] Feature selection methods on gene expression microarray data for cancer classification: A systematic review
    Alhenawi, Esra'a
    Al-Sayyed, Rizik
    Hudaib, Amjad
    Mirjalili, Seyedali
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 140
  • [5] AlJame I., 2020, Informat. Med.Unlocked, V21
  • [6] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [7] AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION
    ALTMAN, NS
    [J]. AMERICAN STATISTICIAN, 1992, 46 (03) : 175 - 185
  • [8] [Anonymous], 2021, Med. J., V134, P783
  • [9] [Anonymous], 2023, IETE J. Res., V69, P9
  • [10] [Anonymous], 2003, Int. J. Pattern Recognit. Artif. Intell., V17, P459