Texture classification using feature selection and kernel-based techniques

被引:0
作者
Carlos Fernandez-Lozano
Jose A. Seoane
Marcos Gestal
Tom R. Gaunt
Julian Dorado
Colin Campbell
机构
[1] University of A Coruña,Information and Communications Technologies Department, Faculty of Computer Science
[2] University of Bristol,Bristol Genetic Epidemiology Laboratories, School of Social and Community Medicine
[3] Stanford University,Stanford Cancer Institute, Stanford School of Medicine
[4] University of Bristol,MRC Integrative Epidemiology Unit, School of Social and Community Medicine
[5] University of Bristol,Intelligent Systems Laboratory
来源
Soft Computing | 2015年 / 19卷
关键词
Multiple kernel learning; Support vector machines; Feature selection; Texture analysis; Recursive feature elimination;
D O I
暂无
中图分类号
学科分类号
摘要
The interpretation of the results in a classification problem can be enhanced, specially in image texture analysis problems, by feature selection techniques, knowing which features contribute more to the classification performance. This paper presents an evaluation of a number of feature selection techniques for classification in a biomedical image texture dataset (2-DE gel images), with the aim of studying their performance and the stability in the selection of the features. We analyse three different techniques: subgroup-based multiple kernel learning (MKL), which can perform a feature selection by down-weighting or eliminating subsets of features which shares similar characteristic, and two different conventional feature selection techniques such as recursive feature elimination (RFE), with different classifiers (naive Bayes, support vector machines, bagged trees, random forest and linear discriminant analysis), and a genetic algorithm-based approach with an SVM as decision function. The different classifiers were compared using a ten times tenfold cross-validation model, and the best technique found is SVM-RFE, with an AUROC score of (95.88±0.39%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$95.88 \pm 0.39\,\%$$\end{document}). However, this method is not significantly better than RFE-TREE, RFE-RF and grouped MKL, whilst MKL uses lower number of features, increasing the interpretability of the results. MKL selects always the same features, related to wavelet-based textures, while RFE methods focuses specially co-occurrence matrix-based features, but with high instability in the number of features selected.
引用
收藏
页码:2469 / 2480
页数:11
相关论文
共 254 条
[51]  
Gestal M(1992)Segmenting ultrasound images of the prostate using neural networks Ultrason Imaging 14 159-1370
[52]  
Pedreira N(1979)Using weighted rankings in the analysis of complete blocks with additive block effects J Am Stat Assoc 74 680-2521
[53]  
Dorado J(2003)Variable selection using SVM based criteria J Mach Learn Res 3 1357-140
[54]  
Pazos A(2008)Simple MKL J Mach Learn Res 9 2491-688
[55]  
Fernandez-Lozano C(1996)Automated texture-based segmentation of ultrasound images of the prostate Comput Med Imaging Graph 20 131-2517
[56]  
Fernandez-Blanco E(2011)pROC: an open-source package for R and S+ to analyze and compare ROC curves BMC Bioinform 12 77-1684
[57]  
Dave K(2009)Development of tolerant features for characterization of masses in mammograms Comput Biol Med 39 678-611
[58]  
Pedreira N(2007)A review of feature selection techniques in bioinformatics Bioinformatics 23 2507-347
[59]  
Gestal M(1996)Image feature selection by a genetic algorithm: application to classification of mass and normal breast tissue Med Phys 23 1671-1565
[60]  
Dorado J(1965)An analysis of variance test for normality (complete samples) Biometrika 52 591-29