Detection of colon cancer based on microarray dataset using machine learning as a feature selection and classification techniques

被引:22
|
作者
Shafi, A. S. M. [1 ,2 ]
Molla, M. M. Imran [2 ]
Jui, Julakha Jahan [3 ]
Rahman, Mohammad Motiur [1 ]
机构
[1] Mawlana Bhashani Sci & Technol Univ, Dept Comp Sci & Engn, Tangail 1902, Bangladesh
[2] Khwaja Yunus Ali Univ, Fac Comp Sci & Engn, Sirajgonj 6751, Bangladesh
[3] Univ Malaysia Pahang, Fac Elect & Elect Engn, Pekan 26600, Pahang, Malaysia
来源
SN APPLIED SCIENCES | 2020年 / 2卷 / 07期
关键词
Colon cancer; Microarray data; Feature selection; Machine learning; Random forest; Cross validation; PARTICLE SWARM OPTIMIZATION; GENE; PREDICTION;
D O I
10.1007/s42452-020-3051-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Microarray data is an increasingly important tool for providing information on gene expression for analysis and interpretation. Researchers attempt to utilize the smallest possible set of relevant gene expression profiles in most gene expression studies to enhance tumor identification accuracy. This research aims to analyze and predicts colon cancer data employing a machine learning approach and feature selection technique based on a random forest classifier. More particularly, our proposed method can reduce the burden of high dimensional data and allow faster calculations by combining the "Mean Decrease Accuracy" and "Mean Decrease Gini" as feature selection methods into a renowned classifier namely Random Forest, with the aim of increasing the prediction model's accuracy level. In addition, we have also shown a comparative model analysis with selection of features and model without selection of features. The extensive experimental results have demonstrated that the proposed model with feature selection is favorable and effective which triumphs the best performance of accuracy.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Detection of colon cancer based on microarray dataset using machine learning as a feature selection and classification techniques
    A. S. M. Shafi
    M. M. Imran Molla
    Julakha Jahan Jui
    Mohammad Motiur Rahman
    SN Applied Sciences, 2020, 2
  • [2] Machine learning approaches for classification of colorectal cancer with and without feature selection method on microarray data
    Nazari, Elham
    Aghemiri, Mehran
    Avan, Amir
    Mehrabian, Amin
    Tabesh, Hamed
    GENE REPORTS, 2021, 25
  • [3] Feature selection for classification based on machine learning algorithms for prostate cancer
    Swathypriyadharsini, P.
    Rupashini, P. R.
    Premalatha, K.
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2025, 11 (03):
  • [4] Osteoporosis Detection Using Machine Learning Techniques and Feature Selection
    Iliou, Theodoros
    Anagnostopoulos, Christos-Nikolaos
    Anastassopoulos, George
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2014, 23 (05)
  • [5] Review on intrusion detection using feature selection with machine learning techniques
    Kalimuthan, C.
    Renjit, J. Arokia
    MATERIALS TODAY-PROCEEDINGS, 2020, 33 : 3794 - 3802
  • [6] FEATURE SELECTION AND MACHINE LEARNING CLASSIFICATION FOR MALWARE DETECTION
    Khammas, Ban Mohammed
    Monemi, Alireza
    Bassi, Joseph Stephen
    Ismail, Ismahani
    Nor, Sulaiman Mohd
    Marsono, Muhammad Nadzir
    JURNAL TEKNOLOGI, 2015, 77 (01):
  • [7] Optimized feature selection method using particle swarm intelligence with ensemble learning for cancer classification based on microarray datasets
    Alrefai, Nashat
    Ibrahim, Othman
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16) : 13513 - 13528
  • [8] Android malware detection applying feature selection techniques and machine learning
    Mohammad Reza Keyvanpour
    Mehrnoush Barani Shirzad
    Farideh Heydarian
    Multimedia Tools and Applications, 2023, 82 : 9517 - 9531
  • [9] Optimizing Microarray Gene Selection in Colon Cancer: An Enhanced Metaheuristic Algorithm for Feature Selection
    Benghazouani, Salsabila
    Nouh, Said
    Zakrani, Abdelali
    INFORMATION TECHNOLOGIES AND THEIR APPLICATIONS, PT II, ITTA 2024, 2025, 2226 : 76 - 86
  • [10] Gene Microarray Cancer Classification using Correlation Based Feature Selection Algorithm and Rules Classifiers
    Al-Batah, Mohammad
    Zaqaibeh, Belal
    Alomari, Saleh Ali
    Alzboon, Mowafaq Salem
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2019, 15 (08) : 62 - 73