Detection of colon cancer based on microarray dataset using machine learning as a feature selection and classification techniques

被引:22
|
作者
Shafi, A. S. M. [1 ,2 ]
Molla, M. M. Imran [2 ]
Jui, Julakha Jahan [3 ]
Rahman, Mohammad Motiur [1 ]
机构
[1] Mawlana Bhashani Sci & Technol Univ, Dept Comp Sci & Engn, Tangail 1902, Bangladesh
[2] Khwaja Yunus Ali Univ, Fac Comp Sci & Engn, Sirajgonj 6751, Bangladesh
[3] Univ Malaysia Pahang, Fac Elect & Elect Engn, Pekan 26600, Pahang, Malaysia
来源
SN APPLIED SCIENCES | 2020年 / 2卷 / 07期
关键词
Colon cancer; Microarray data; Feature selection; Machine learning; Random forest; Cross validation; PARTICLE SWARM OPTIMIZATION; GENE; PREDICTION;
D O I
10.1007/s42452-020-3051-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Microarray data is an increasingly important tool for providing information on gene expression for analysis and interpretation. Researchers attempt to utilize the smallest possible set of relevant gene expression profiles in most gene expression studies to enhance tumor identification accuracy. This research aims to analyze and predicts colon cancer data employing a machine learning approach and feature selection technique based on a random forest classifier. More particularly, our proposed method can reduce the burden of high dimensional data and allow faster calculations by combining the "Mean Decrease Accuracy" and "Mean Decrease Gini" as feature selection methods into a renowned classifier namely Random Forest, with the aim of increasing the prediction model's accuracy level. In addition, we have also shown a comparative model analysis with selection of features and model without selection of features. The extensive experimental results have demonstrated that the proposed model with feature selection is favorable and effective which triumphs the best performance of accuracy.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Machine Learning and Ensemble Learning Techniques for Intrusion Detection Systems: A Performance Analysis Based on Feature Selection Methods
    Basarslan, Muhammet Sinan
    Turgut, Zeynep
    INTELLIGENT AND FUZZY SYSTEMS, VOL 3, INFUS 2024, 2024, 1090 : 117 - 124
  • [42] Ensemble Classification Model With CFS-IGWO-Based Feature Selection for Cancer Detection Using Microarray Data
    Panda, Pinakshi
    Bisoy, Sukant Kishoro
    Kautish, Sandeep
    Ahmad, Reyaz
    Irshad, Asma
    Sarwar, Nadeem
    INTERNATIONAL JOURNAL OF TELEMEDICINE AND APPLICATIONS, 2024, 2024
  • [43] Feature Selection Approach for Phishing Detection Based on Machine Learning
    Wei, Yi
    Sekiya, Yuji
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON APPLIED CYBER SECURITY (ACS) 2021, 2022, 378 : 61 - 70
  • [44] Gene selection and classification for cancer microarray data based on machine learning and similarity measures
    Liu, Qingzhong
    Sung, Andrew H.
    Chen, Zhongxue
    Liu, Jianzhong
    Chen, Lei
    Qiao, Mengyu
    Wang, Zhaohui
    Huang, Xudong
    Deng, Youping
    BMC GENOMICS, 2011, 12
  • [45] Predicting Students Performance Using Supervised Machine Learning Based on Imbalanced Dataset and Wrapper Feature Selection
    Alija S.
    Beqiri E.
    Gaafar A.S.
    Hamoud A.K.
    Informatica (Slovenia), 2023, 47 (01): : 11 - 20
  • [46] Feature selection and classification in breast cancer prediction using IoT and machine learning
    Gopal, V. Nanda
    Al-Turjman, Fadi
    Kumar, R.
    Anand, L.
    Rajesh, M.
    MEASUREMENT, 2021, 178
  • [47] Robust machine learning based Intrusion detection system using simple statistical techniques in feature selection
    Kaushik, Sunil
    Bhardwaj, Akashdeep
    Almogren, Ahmad
    Bharany, Salil
    Altameem, Ayman
    Rehman, Ateeq Ur
    Hussen, Seada
    Hamam, Habib
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [48] Feature selection with Fast Correlation-Based Filter for Breast cancer prediction and Classification using Machine Learning Algorithms
    Khourdifi, Youness
    Bahaj, Mohamed
    2018 INTERNATIONAL SYMPOSIUM ON ADVANCED ELECTRICAL AND COMMUNICATION TECHNOLOGIES (ISAECT), 2018,
  • [49] Android Malware Detection Using Machine Learning with Feature Selection Based on the Genetic Algorithm
    Lee, Jaehyeong
    Jang, Hyuk
    Ha, Sungmin
    Yoon, Yourim
    MATHEMATICS, 2021, 9 (21)
  • [50] Feature selection with ensemble learning for prostate cancer diagnosis from microarray gene expression
    Gumaei, Abdu
    Sammouda, Rachid
    Al-Rakhami, Mabrook
    AlSalman, Hussain
    El-Zaart, Ali
    HEALTH INFORMATICS JOURNAL, 2021, 27 (01)