Robust microarray data feature selection using a correntropy based distance metric learning approach

被引:4
作者
Vahabzadeh, Venus [1 ]
Moattar, Mohammad Hossein [1 ]
机构
[1] Islamic Azad Univ, Dept Software Engn, Mashhad Branch, Mashhad, Iran
关键词
Microarray data classifications; Feature selection; Distance metric learning; Robustness; Correntropy; GENE SELECTION; CLASSIFICATION; ALGORITHM;
D O I
10.1016/j.compbiomed.2023.107056
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Classification of high-dimensional microarray data is a challenge in bioinformatics and genetic data processing. One of the challenging issues of feature selection is the presence of outliers. The Euclidean distance metric is sensitive to outliers. In this study, a distance metric learning based feature selection approach that uses the correntropy function as the discrimination metric is proposed. For this purpose, the metric learning problem is formulated as an optimization problem and solved using the Lagrange method. The output of the approach signifies the most important and robust features. After feature selection, different classification methods such as SVM, decision trees, and NN classifiers are used to investigate the classification accuracy of the proposed method as well as precision, recall, and F-measure. Experiments are carried out on 13 high-dimensional datasets and show that the proposed method outperforms the previous models in terms of accuracy and robustness.
引用
收藏
页数:10
相关论文
共 59 条
[1]   A feature selection technique for classificatory analysis [J].
Ahmad, A ;
Dey, L .
PATTERN RECOGNITION LETTERS, 2005, 26 (01) :43-56
[2]   Feature selection methods on gene expression microarray data for cancer classification: A systematic review [J].
Alhenawi, Esra'a ;
Al-Sayyed, Rizik ;
Hudaib, Amjad ;
Mirjalili, Seyedali .
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 140
[3]   Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection [J].
Ang, Jun Chin ;
Mirzal, Andri ;
Haron, Habibollah ;
Hamed, Haza Nuzly Abdull .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (05) :971-989
[4]  
[Anonymous], 2013, INT J COMPUT APPL, DOI DOI 10.5120/11482-7181
[5]  
Arunkumar C., 2018, Future Computing and Informatics Journal, V3, P131, DOI [10.1016/j.fcij.2018.02.002, 10.1016/j.fcij.2018.02.002]
[6]  
Awada W, 2012, 2012 IEEE 13TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), P356, DOI 10.1109/IRI.2012.6303031
[7]   Feature selection using Joint Mutual Information Maximisation [J].
Bennasar, Mohamed ;
Hicks, Yulia ;
Setchi, Rossitza .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) :8520-8532
[8]   Supervised Variational Relevance Learning, An Analytic Geometric Feature Selection with Applications to Omic Datasets [J].
Boareto, Marcelo ;
Cesar, Jonatas ;
Leite, Vitor B. P. ;
Caticha, Nestor .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (03) :705-711
[9]   A review of microarray datasets and applied feature selection methods [J].
Bolon-Canedo, V. ;
Sanchez-Marono, N. ;
Alonso-Betanzos, A. ;
Benitez, J. M. ;
Herrera, F. .
INFORMATION SCIENCES, 2014, 282 :111-135
[10]   A hybrid feature selection approach for Microarray datasets using graph theoretic-based method [J].
Chamlal, Hasna ;
Ouaderhman, Tayeb ;
Rebbah, Fatima Ezzahra .
INFORMATION SCIENCES, 2022, 615 :449-474