Support vector machine ensembles for discriminant analysis for ranking principal components

被引:0
作者
Tiene A. Filisbino
Gilson A. Giraldi
Carlos E. Thomaz
机构
[1] National Laboratory for Scientific Computing,Coordination of Mathematical and Computational Methods
[2] Quitandinha,undefined
[3] Department of Electrical Engineering,undefined
[4] FEI,undefined
来源
Multimedia Tools and Applications | 2020年 / 79卷
关键词
PCA; Ranking PCA components; Separating hyperplanes; Ensemble methods; AdaBoost; Face image analysis;
D O I
暂无
中图分类号
学科分类号
摘要
The problemof ranking linear subspaces in principal component analysis (PCA), for multi-class classification tasks, has been addressed by building support vector machine (SVM) ensembles and AdaBoost.M2 technique. This methodology, named multi-class discriminant principal components analysis (Multi-Class.M2 DPCA), is motivated by the fact that the first PCA components do not necessarily represent important discriminant directions to separate sample groups. The Multi-Class.M2 DPCA proposal presents fundamental issues related to the weakening methodology, parametrization, strategy for SVM bias, and classification versus reconstruction performance. Also, it is observed a lack of comparisons between Multi-Class.M2 DPCA and feature weighting techniques. Motivated by these facts, this paper firstly presents a unified formulation to generate weakened SVM approaches and to derive different strategies of the literature. These strategies are analyzed within Multi-Class.M2 DPCA methodology and its parametrization to realize the best one for ranking PCA features in face image analysis. Moreover, this work proposes variants to improve that Multi-Class.M2 DPCA configuration using strategies that incorporate SVM bias and sensitivity analysis results. The obtained Multi-Class.M2 DPCA setups are applied in the computational experiments for both classification and reconstruction problems. The results show that Multi-Class.M2 DPCA achieves higher recognition rates using less PCA features, as well as robust reconstruction and interpretation of the data.
引用
收藏
页码:25277 / 25313
页数:36
相关论文
共 124 条
  • [1] Ang JC(2016)Supervised, unsupervised, and semi-supervised feature selection: a review on gene selection IEEE/ACM T Comput Biology Bioinform 13 971-989
  • [2] Mirzal A(2000)Generalized discriminant analysis using a kernel approach Neural Comput 12 2385-2404
  • [3] Haron H(2009)A simple and elementary proof of the karush–kuhn–tucker theorem for inequality-constrained optimization Optim Lett 3 7-10
  • [4] Hamed HNA(2014)A survey on feature selection methods Comput Electr Eng 40 16-28
  • [5] Baudat G(2019)A survey on image data augmentation for deep learning J Big Data 6 1-48
  • [6] Anouar F(2015)Linear dimensionality reduction: survey, insights, and generalizations J Mach Learn Res 16 2859-2900
  • [7] Brezhneva OA(2005)Minimum redundancy feature selection from microarray gene expression data J Bioinforma Comput Biol 03 185-205
  • [8] Tret’yakov AA(2006)Gene selection and classification of microarray data using random forest BMC Bioinforma 7 3-51373
  • [9] Wright SE(2018)A hybrid seasonal mechanism with a chaotic cuckoo search algorithm with a support vector regression model for electric load forecasting Energies 11 1009-307
  • [10] Chandrashekar G(2018)Automatic multicarrier waveform classification via PCA and convolutional neural networks IEEE Access 6 51365-11