A method of dimensionality reduction by selection of components in principal component analysis for text classification

被引:5
|
作者
Zhang, Yangwu [1 ,2 ]
Li, Guohe [1 ,3 ]
Zong, Heng [2 ]
机构
[1] China Univ Petr, Coll Geophys & Informat Engn, Beijing, Peoples R China
[2] China Univ Polit Sci & Law, Dept Sci & Technol Teaching, Beijing, Peoples R China
[3] China Univ Petr, Beijing Key Lab Data Min Petr Data, Beijing, Peoples R China
关键词
Principal components analysis; Dimensionality reduction; Text classification;
D O I
10.2298/FIL1805499Z
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Dimensionality reduction, including feature extraction and selection, is one of the key points for text classification. In this paper, we propose a mixed method of dimensionality reduction constructed by principal components analysis and the selection of components. Principal components analysis is a method of feature extraction. Not all of the components in principal component analysis contribute to classification, because PCA objective is not a form of discriminant analysis (see, e.g. Jolliffe, 2002). In this context, we present a function of components selection, which returns the useful components for classification by the indicators of the performances on the different subsets of the components. Compared to traditional methods of feature selection, SVM classifiers trained on selected components show improved classification performance and a reduction in computational overhead.
引用
收藏
页码:1499 / 1506
页数:8
相关论文
共 50 条
  • [21] PCA Dimensionality Reduction Method for Image Classification
    Zhao, Baiting
    Dong, Xiao
    Guo, Yongcun
    Jia, Xiaofen
    Huang, Yourui
    NEURAL PROCESSING LETTERS, 2022, 54 (01) : 347 - 368
  • [22] PCA Dimensionality Reduction Method for Image Classification
    Baiting Zhao
    Xiao Dong
    Yongcun Guo
    Xiaofen Jia
    Yourui Huang
    Neural Processing Letters, 2022, 54 : 347 - 368
  • [23] Dimensionality reduction based on parallel factor analysis model and independent component analysis method
    Yan, Ronghua
    Peng, Jinye
    Ma, Dongmei
    JOURNAL OF APPLIED REMOTE SENSING, 2019, 13 (01)
  • [24] Multi-label Text Classification Using Semantic Features and Dimensionality Reduction with Autoencoders
    Alkhatib, Wael
    Rensing, Christoph
    Silberbauer, Johannes
    LANGUAGE, DATA, AND KNOWLEDGE, LDK 2017, 2017, 10318 : 380 - 394
  • [25] A New Filter Feature Selection Method for Text Classification
    Cekik, Rasim
    IEEE ACCESS, 2024, 12 : 139316 - 139335
  • [26] Randomized independent component analysis and linear discriminant analysis dimensionality reduction methods for hyperspectral image classification
    Jayaprakash, Chippy
    Damodaran, Bharath Bhushan
    Viswanathan, Sowmya
    Soman, Kutti Padannayil
    JOURNAL OF APPLIED REMOTE SENSING, 2020, 14 (03)
  • [27] Aggressive Dimensionality Reduction with Reinforcement Local Feature Selection for Text Categorization
    Zheng, Wenbin
    Qian, Yuntao
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2010, 6319 : 365 - 372
  • [28] Fast hybrid dimensionality reduction method for classification based on feature selection and grouped feature extraction
    Li, Mengmeng
    Wang, Haofeng
    Yang, Lifang
    Liang, You
    Shang, Zhigang
    Wan, Hong
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 150
  • [29] Quantum discriminant analysis for dimensionality reduction and classification
    Cong, Iris
    Duan, Luming
    NEW JOURNAL OF PHYSICS, 2016, 18
  • [30] Using 2D Principal Component Analysis to Reduce Dimensionality of Gene Expression Profiles for Tumor Classification
    Wang, Shu-Lin
    Li, Min
    Wang, Hongqiang
    BIO-INSPIRED COMPUTING AND APPLICATIONS, 2012, 6840 : 588 - +