A hybrid convolutional neural network approach for feature selection and disease classification

被引:5
作者
Debata, Prajna Paramita [1 ]
Mohapatra, Puspanjali [2 ]
机构
[1] Int Inst Informat Technol, Dept Comp Sci & Engn, Bhubaneswar, India
[2] Int Inst Informat Technol, Dept Comp Sci & Engn, Bhubaneswar, India
关键词
High dimensional gene expression data; deep learning approach; improvised meta-heuristic algorithm; kernel-based Fisher score; convolutional neural network; GENE-EXPRESSION DATA; MICROARRAY DATA; CANCER; DISCOVERY; ALGORITHM; TUMOR;
D O I
10.3906/elk-2105-43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many researchers have analyzed the high dimensional gene expression data for disease classification using several conventional and machine learning-based approaches, but still there exists some issues which make this task nontrivial. Due to the growing complexities of the unstructured data, the researchers focus on the deep learning approach, which is the latest form of machine learning algorithm. In the presented work, a kernel-based Fisher score (KFS) approach is implemented to extract the notable genes, and an improvised chaotic Jaya (CJaya) algorithm optimized convolutional neural network (CJaya-CNN) model is applied to classify high dimensional gene expression or microarray data. This model is tested on two binary class and two multi class standard microarray datasets. Here, the presented hybrid deep learning model (KFS based CJaya-CNN) has been compared with other standard machine learning classification models like CJaya hybridized multi-layer perceptron (CJaya-MLP), CJaya hybridized extreme learning machine (CJayaELM), and CJaya hybridized kernel extreme learning machine (CJaya-KELM). The suggested model is evaluated by classification accuracy percentage, number of significant genes selected, sensitivity and specificity values with receiver operating characteristic (ROC) curves. Eventually, the experimental outcomes obtained from the presented model has also been compared with the recent existing feature selection and classification models for a suitable research in analysing high dimensional microarray data. The presented model offered the classification accuracy percentage of 98.2, 99.96, 99.78, and 99.87 for colon cancer, leukemia, lymphoma-3, and small round blue cell tumor (SRBCT) datasets, respectively. All the experimental outcomes reveal that the KFS based CJaya-CNN model is outperforming. Hence, the presented method can be used as a dependable framework for disease classification.
引用
收藏
页码:2580 / 2599
页数:20
相关论文
共 39 条
[1]  
Aiguo Wang, 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), P74, DOI 10.1109/BIBM.2014.6999251
[2]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[3]   mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling [J].
Alshamlan, Hala ;
Badr, Ghada ;
Alohali, Yousef .
BIOMED RESEARCH INTERNATIONAL, 2015, 2015
[4]   Genetic Bee Colony (GBC) algorithm: A new gene selection method for microarray cancer classification [J].
Alshamlan, Hala M. ;
Badr, Ghada H. ;
Alohali, Yousef A. .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 56 :49-60
[5]   Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection [J].
Ang, Jun Chin ;
Mirzal, Andri ;
Haron, Habibollah ;
Hamed, Haza Nuzly Abdull .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (05) :971-989
[6]   A fuzzy based feature selection from independent component subspace for machine learning classification of microarray data [J].
Aziz, Rabia ;
Verma, C. K. ;
Srivastava, Namita .
GENOMICS DATA, 2016, 8 :4-15
[7]   Analysis of high-dimensional genomic data using MapReduce based probabilistic neural network [J].
Baliarsingh, Santos Kumar ;
Vipsita, Swati ;
Gandomi, Amir H. ;
Panda, Abhijeet ;
Bakshi, Sambit ;
Ramasubbareddy, Somula .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 195
[8]   Analysis of high-dimensional genomic data employing a novel bio-inspired algorithm [J].
Baliarsingh, Santos Kumar ;
Vipsita, Swati ;
Muhammad, Khan ;
Dash, Bodhisattva ;
Bakshi, Sambit .
APPLIED SOFT COMPUTING, 2019, 77 :520-532
[9]   Deep learning approach for microarray cancer data classification [J].
Basavegowda, Hema Shekar ;
Dagnew, Guesh .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2020, 5 (01) :22-33
[10]  
Bochinski E, 2017, IEEE IMAGE PROC, P3924