On sparse linear discriminant analysis algorithm for high-dimensional data classification

被引:12
|
作者
Ng, Michael K. [1 ,2 ]
Liao, Li-Zhi [2 ]
Zhang, Leihong [3 ]
机构
[1] Hong Kong Baptist Univ, Ctr Math Imaging & Vis, Kowloon Tong, Hong Kong, Peoples R China
[2] Hong Kong Baptist Univ, Dept Math, Kowloon Tong, Hong Kong, Peoples R China
[3] Shanghai Univ Finance & Econ, Dept Appl Math, Shanghai 200433, Peoples R China
关键词
linear discriminant analysis; sparsity; weighting; high-dimensional data; REDUCTION;
D O I
10.1002/nla.736
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we present a sparse linear discriminant analysis (LDA) algorithm for high-dimensional objects in subspaces. In high dimensional data, groups of objects often exist in subspaces rather than in the entire space. For example, in text data classification, groups of documents of different types are categorized by different subsets of terms. The terms for one group may not occur in the samples of other groups. In the new algorithm, we consider a LDA to calculate a weight for each dimension and use the weight values to identify the subsets of important dimensions in the discriminant vectors that categorize different groups. This is achieved by including the weight sparsity term in the objective function that is minimized in the LDA. We develop an iterative algorithm for computing such sparse and orthogonal vectors in the LDA. Experiments on real data sets have shown that the new algorithm can generate better classification results and identify relevant dimensions. Copyright (C) 2010 John Wiley & Sons, Ltd.
引用
收藏
页码:223 / 235
页数:13
相关论文
共 50 条
  • [1] Graph-based sparse linear discriminant analysis for high-dimensional classification
    Liu, Jianyu
    Yu, Guan
    Liu, Yufeng
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 171 : 250 - 269
  • [2] Modified linear discriminant analysis approaches for classification of high-dimensional microarray data
    Xu, Ping
    Brock, Guy N.
    Parrish, Rudolph S.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (05) : 1674 - 1687
  • [3] A modified linear discriminant analysis for high-dimensional data
    Hyodo, Masashi
    Yamada, Takayuki
    Himeno, Tetsuto
    Seo, Takashi
    HIROSHIMA MATHEMATICAL JOURNAL, 2012, 42 (02) : 209 - 231
  • [4] A Hybrid Dimension Reduction Based Linear Discriminant Analysis for Classification of High-Dimensional Data
    Zorarpaci, Ezgi
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 1028 - 1036
  • [5] SPARSE LINEAR DISCRIMINANT ANALYSIS BY THRESHOLDING FOR HIGH DIMENSIONAL DATA
    Shao, Jun
    Wang, Yazhen
    Deng, Xinwei
    Wang, Sijian
    ANNALS OF STATISTICS, 2011, 39 (02): : 1241 - 1265
  • [6] Generalized Linear Discriminant Analysis for High-Dimensional Genomic Data
    Li, Sisi
    Lewinger, Juan Pablo
    GENETIC EPIDEMIOLOGY, 2017, 41 (07) : 704 - 704
  • [7] Generalized linear discriminant analysis for high-dimensional genomic data
    Li, Sisi
    Lewinger, Juan Pablo
    GENETIC EPIDEMIOLOGY, 2018, 42 (07) : 713 - 713
  • [8] Optimal Linear Discriminant Analysis for High-Dimensional Functional Data
    Xue, Kaijie
    Yang, Jin
    Yao, Fang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) : 1055 - 1064
  • [9] Weighted linear programming discriminant analysis for high-dimensional binary classification
    Wu, Yufei
    Yu, Guan
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (05) : 437 - 450
  • [10] AN EFFICIENT GREEDY SEARCH ALGORITHM FOR HIGH-DIMENSIONAL LINEAR DISCRIMINANT ANALYSIS
    Yang, Hannan
    Lin, Danyu
    Li, Quefeng
    STATISTICA SINICA, 2023, 33 : 1343 - 1364