Dimensionality Reduction with Category Information Fusion and Non-negative Matrix Factorization for Text Categorization

被引:0
|
作者
Zheng, Wenbin [1 ,2 ]
Qian, Yuntao [1 ]
Tang, Hong [3 ,4 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310003, Zhejiang, Peoples R China
[2] China Jiliang Univ, Coll Informat Engn, Hangzhou 310003, Zhejiang, Peoples R China
[3] Zhejiang Univ, Sch Aeronaut & Astronaut, Hangzhou 310003, Zhejiang, Peoples R China
[4] China Jiliang Univ, Coll Metrol Technol & Engn, Hangzhou, Peoples R China
来源
ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT III | 2011年 / 7004卷
关键词
Text Categorization; Dimensionality reduction; Non-negative Matrix Factorization; Category Fusion; CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimensionality reduction can efficiently improve computing performance of classifiers in text categorization, and non-negative matrix factorization could map the high dimensional term space into a low dimensional semantic subspace easily. Meanwhile, the non-negative of the basis vectors could provide a meaningful explanation for the semantic subspace. However, it usually could not achieve a satisfied classification performance because it is sensitive to the noise, data missing and outlier as a linear reconstruction method. This paper proposes a novel approach in which the train text and its category information are fused and a transformation matrix that maps the term space into a semantic subspace is obtained by a basis orthogonality non-negative matrix factorization and truncation. Finally, the dimensionality can be reduced aggressively with these transformations. Experimental results show that the proposed approach remains a good classification performance in a very low dimensional case.
引用
收藏
页码:505 / +
页数:2
相关论文
共 50 条
  • [1] Dimensionality reduction using non-negative matrix factorization for information retrieval
    Tsuge, S
    Shishibori, M
    Kuroiwa, S
    Kita, K
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 960 - 965
  • [2] Structure preserving non-negative matrix factorization for dimensionality reduction
    Li, Zechao
    Liu, Jing
    Lu, Hanqing
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (09) : 1175 - 1189
  • [3] Dimensionality Reduction for Histogram Features Based on Supervised Non-negative Matrix Factorization
    Ambai, Mitsuru
    Utama, Nugraha P.
    Yoshida, Yuichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (10) : 1870 - 1879
  • [4] Non-negative matrix factorization with local preservation for hyperspectral image dimensionality reduction
    Xiao, Zhiyong
    REMOTE SENSING LETTERS, 2014, 5 (09) : 793 - 802
  • [5] Dimensionality reduction by combining category information and latent semantic index for text categorization
    Zheng, Wenbin
    An, Lixin
    Xu, Zhanyi
    Journal of Information and Computational Science, 2013, 10 (08): : 2463 - 2469
  • [6] Non-negative Matrix Factorization: A Survey
    Gan, Jiangzhang
    Liu, Tong
    Li, Li
    Zhang, Jilian
    COMPUTER JOURNAL, 2021, 64 (07) : 1080 - 1092
  • [7] Non-Negative Matrix Factorization with Auxiliary Information on Overlapping Groups
    Shiga, Motoki
    Mamitsuka, Hiroshi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (06) : 1615 - 1628
  • [8] A constrained non-negative matrix factorization in information retrieval
    Xu, BW
    Lu, JJ
    Huang, GS
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2003, : 273 - 277
  • [9] Initialization enhancer for non-negative matrix factorization
    Zheng, Zhonglong
    Yang, Jie
    Zhu, Yitan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2007, 20 (01) : 101 - 110
  • [10] Study on Text Classification Algorithm Based on Non-negative Matrix Factorization
    Jing, Yongxia
    Gou, Heping
    Fu, Chuanyi
    Liu, Qiang
    2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2017, : 484 - 487