A Bayesian Classification Approach Using Class-Specific Features for Text Categorization

被引:86
|
作者
Tang, Bo [1 ]
He, Haibo [1 ]
Baggenstoss, Paul M. [2 ]
Kay, Steven [1 ]
机构
[1] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
[2] Frauhnhofer FKIE, Fraunhoferstr 20, D-53343 Wachtberg, Germany
基金
美国国家科学基金会;
关键词
Feature selection; text categorization; class-specific features; PDF projection and estimation; naive Bayes; dimension reduction; SELECTION;
D O I
10.1109/TKDE.2016.2522427
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a Bayesian classification approach for automatic text categorization using class-specific features. Unlike conventional text categorization approaches, our proposed method selects a specific feature subset for each class. To apply these class-specific features for classification, we follow Baggenstoss's PDF Projection Theorem (PPT) to reconstruct the PDFs in raw data space from the class-specific PDFs in low-dimensional feature subspace, and build a Bayesian classification rule. One noticeable significance of our approach is that most feature selection criteria, such as Information Gain (IG) and Maximum Discrimination (MD), can be easily incorporated into our approach. We evaluate our method's classification performance on several real-world benchmarks, compared with the state-of-the-art feature selection approaches. The superior results demonstrate the effectiveness of the proposed approach and further indicate its wide potential applications in data mining.
引用
收藏
页码:1602 / 1606
页数:5
相关论文
共 50 条
  • [31] Mining for class-specific motifs in protein sequence classification
    Srinivasan, Satish M.
    Vural, Suleyman
    King, Brian R.
    Guda, Chittibabu
    BMC BIOINFORMATICS, 2013, 14
  • [32] Class-Specific Sparse Principal Component Analysis for Visual Classification
    Pan, Fei
    Pan, Fei
    Zhang, Zai-Xu
    Liu, Bao-Di
    Xie, Ji-Jun
    IEEE Access, 2020, 8 : 110033 - 110047
  • [33] Automatic Arabic Text Categorization using Bayesian Learning
    Kadhim, Mahmood H.
    Omar, Nazlia
    2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, : 415 - 419
  • [34] Android Applications Categorization Using Bayesian Classification
    Yuan, Cangzhou
    Wei, Shenhong
    Wang, Yutong
    Yue, You
    ZiLiang, ShangGuan
    2016 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY PROCEEDINGS - CYBERC 2016, 2016, : 173 - 176
  • [35] A class-specific feature selection and classification approach using neighborhood rough set and K-nearest neighbor theories
    Sewwandi M.A.N.D.
    Li Y.
    Zhang J.
    Applied Soft Computing, 2023, 143
  • [36] Adversarial Imbalance Classification with Class-specific Diverse Instance Generation
    Shen, Qihang
    Wang, Xinyue
    Cai, Zixin
    Jing, Liping
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [37] Iris recognition using class-specific dictionaries
    Naseem, Imran
    Aleem, Affan
    Togneri, Roberto
    Bennamoun, Mohammed
    COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 178 - 193
  • [38] Automated Classification of Signals with Duration-Dependent Segments via Class-Specific Features and Gibbs Sampling
    Sun, Yan
    Willett, Peter
    2012 IEEE AEROSPACE CONFERENCE, 2012,
  • [39] Class-Specific Model Mixtures for the Classification of Acoustic Time Series
    Baggenstoss, Paul M.
    Harrison, Brian F.
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2016, 52 (04) : 1937 - 1952
  • [40] Improved class-specific vector for biomedical question type classification
    Gupta, Tanu
    Kumar, Ela
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2023, 26 (02) : 182 - 191