Dimensionality reduction by combining category information and latent semantic index for text categorization

被引:0
作者
Zheng, Wenbin [1 ]
An, Lixin [1 ,2 ]
Xu, Zhanyi [1 ]
机构
[1] College of Information Engineering, China Jiliang University
[2] College of Textiles, Donghua University
来源
Journal of Information and Computational Science | 2013年 / 10卷 / 08期
关键词
Category information; Dimensionality reduction; Latent semantic indexing; Text categorization;
D O I
10.12733/jics20101814
中图分类号
学科分类号
摘要
The Latent Semantic Indexing (LSI) is a commonly used dimensionality reduction methods in text categorization; however, as a linear reconstructed method, its goal is to obtain the optimal representative feature rather than the optimal classification feature. This paper proposes a novel method in which the categorization information is combined into the latent semantic indexing to obtain more discriminating features than the standard latent semantic indexing. The experimental results show that the proposed method achieves good performance on two benchmark data sets, especially in the case where the dimensionality is greatly reduced. Copyright © 2013 Binary Information Press.
引用
收藏
页码:2463 / 2469
页数:6
相关论文
共 22 条