Linear discriminant analysis: A detailed tutorial

被引:587
作者
Tharwat, Alaa [1 ,2 ,6 ]
Gaber, Tarek [3 ,6 ]
Ibrahim, Abdelhameed [4 ,6 ]
Hassanien, Aboul Ella [5 ,6 ]
机构
[1] Frankfurt Univ Appl Sci, Dept Comp Sci & Engn, Frankfurt, Germany
[2] Suez Canal Univ, Fac Engn, Ismailia, Egypt
[3] Suez Canal Univ, Fac Comp & Informat, Ismailia, Egypt
[4] Mansoura Univ, Fac Engn, Mansoura, Egypt
[5] Cairo Univ, Fac Comp & Informat, Cairo, Egypt
[6] Sci Res Grp Egypt, Cairo, Egypt
关键词
Dimensionality reduction; PCA; LDA; Kernel Functions; Class-Dependent LDA; Class-Independent LDA; SSS (Small Sample Size) problem; eigenvectors artificial intelligence; PRINCIPAL COMPONENT ANALYSIS; SAMPLE-SIZE PROBLEM; FEATURE-SELECTION; FACE RECOGNITION; NULL SPACE; FOURIER-TRANSFORM; DIRECT LDA; CLASSIFICATION; REDUCTION; MATRIX;
D O I
10.3233/AIC-170729
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Linear Discriminant Analysis (LDA) is a very common technique for dimensionality reduction problems as a preprocessing step for machine learning and pattern classification applications. At the same time, it is usually used as a black box, but (sometimes) not well understood. The aim of this paper is to build a solid intuition for what is LDA, and how LDA works, thus enabling readers of all levels be able to get a better understanding of the LDA and to know how to apply this technique in different applications. The paper first gave the basic definitions and steps of how LDA technique works supported with visual explanations of these steps. Moreover, the two methods of computing the LDA space, i.e. class-dependent and class-independent methods, were explained in details. Then, in a step-by-step approach, two numerical examples are demonstrated to show how the LDA space can be calculated in case of the class-dependent and class-independent methods. Furthermore, two of the most common LDA problems (i.e. Small Sample Size (SSS) and non-linearity problems) were highlighted and illustrated, and state-of-the-art solutions to these problems were investigated and explained. Finally, a number of experiments was conducted with different datasets to (1) investigate the effect of the eigenvectors that used in the LDA space on the robustness of the extracted feature for the classification accuracy, and (2) to show when the SSS problem occurs and how it can be addressed.
引用
收藏
页码:169 / 190
页数:22
相关论文
共 85 条
  • [21] COMPUTER-AIDED CLASSIFICATION OF MAMMOGRAPHIC MASSES AND NORMAL TISSUE - LINEAR DISCRIMINANT-ANALYSIS IN TEXTURE FEATURE SPACE
    CHAN, HP
    WEI, DT
    HELVIE, MA
    SAHINER, B
    ADLER, DD
    GOODSITT, MM
    PETRICK, N
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 1995, 40 (05) : 857 - 876
  • [22] A new LDA-based face recognition system which can solve the small sample size problem
    Chen, LF
    Liao, HYM
    Ko, MT
    Lin, JC
    Yu, GJ
    [J]. PATTERN RECOGNITION, 2000, 33 (10) : 1713 - 1726
  • [23] A rapid method to screen for cell-wall mutants using discriminant analysis of Fourier transform infrared spectra
    Chen, LM
    Carpita, NC
    Reiter, WD
    Wilson, RH
    Jeffries, C
    McCann, MC
    [J]. PLANT JOURNAL, 1998, 16 (03) : 385 - 392
  • [24] COOMANS D, 1979, ANAL CHIM ACTA-COMP, V3, P97
  • [25] Face recognition by regularized discriminant analysis
    Dai, Dao-Qing
    Yuen, Pong C.
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (04): : 1080 - 1085
  • [26] Donoho D, 2004, ADV NEUR IN, V16, P1141
  • [27] Comparison of discrimination methods for the classification of tumors using gene expression data
    Dudoit, S
    Fridlyand, J
    Speed, TP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 77 - 87
  • [28] REGULARIZED DISCRIMINANT-ANALYSIS
    FRIEDMAN, JH
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1989, 84 (405) : 165 - 175
  • [29] Plant Identification: Two Dimensional-Based Vs. One Dimensional-Based Feature Extraction Methods
    Gaber, Tarek
    Tharwat, Alaa
    Snasel, Vaclav
    Hassanien, Aboul Ella
    [J]. 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS, 2015, 368 : 375 - 385
  • [30] Biometric cattle identification approach based on Weber's Local Descriptor and AdaBoost classifier
    Gaber, Tarek
    Tharwat, Alaa
    Hassanien, Aboul Ella
    Snasel, Vaclav
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2016, 122 : 55 - 66