A Bayesian Classification Approach Using Class-Specific Features for Text Categorization

被引:86
|
作者
Tang, Bo [1 ]
He, Haibo [1 ]
Baggenstoss, Paul M. [2 ]
Kay, Steven [1 ]
机构
[1] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
[2] Frauhnhofer FKIE, Fraunhoferstr 20, D-53343 Wachtberg, Germany
基金
美国国家科学基金会;
关键词
Feature selection; text categorization; class-specific features; PDF projection and estimation; naive Bayes; dimension reduction; SELECTION;
D O I
10.1109/TKDE.2016.2522427
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a Bayesian classification approach for automatic text categorization using class-specific features. Unlike conventional text categorization approaches, our proposed method selects a specific feature subset for each class. To apply these class-specific features for classification, we follow Baggenstoss's PDF Projection Theorem (PPT) to reconstruct the PDFs in raw data space from the class-specific PDFs in low-dimensional feature subspace, and build a Bayesian classification rule. One noticeable significance of our approach is that most feature selection criteria, such as Information Gain (IG) and Maximum Discrimination (MD), can be easily incorporated into our approach. We evaluate our method's classification performance on several real-world benchmarks, compared with the state-of-the-art feature selection approaches. The superior results demonstrate the effectiveness of the proposed approach and further indicate its wide potential applications in data mining.
引用
收藏
页码:1602 / 1606
页数:5
相关论文
共 50 条
  • [1] Class-Specific Features Using J48 Classifier for Text Classification
    Patil, Rupali
    Barkade, V. M.
    2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [2] Object Categorization Using Class-Specific Representations
    Zhang, Chunjie
    Cheng, Jian
    Li, Liang
    Li, Changsheng
    Tian, Qi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) : 4528 - 4534
  • [3] Joint segmentation and classification of time series using class-specific features
    Wang, ZJ
    Willett, P
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (02): : 1056 - 1067
  • [4] Generalized Gaussian mixture models as a nonparametric Bayesian approach for clustering using class-specific visual features
    Elguebaly, Tarek
    Bouguila, Nizar
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 23 (08) : 1199 - 1212
  • [5] EEF: Exponentially Embedded Families With Class-Specific Features for Classification
    Tang, Bo
    Kay, Steven
    He, Haibo
    Baggenstoss, Paul M.
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (07) : 969 - 973
  • [6] Towards Effective Image Classification Using Class-Specific Codebooks and Distinctive Local Features
    Altintakan, Umit Lutfu
    Yazici, Adnan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (03) : 323 - 332
  • [7] HIERARCHICAL CLASSIFICATION OF HEP-2 CELL IMAGES USING CLASS-SPECIFIC FEATURES
    Gupta, Vibha
    Gupta, Krati
    Bhavsar, Arnav
    Sao, Anil K.
    PROCEEDINGS OF THE 2016 6TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP), 2016,
  • [8] Using a Bayesian network induction approach for text categorization
    Lam, W
    Low, KF
    Ho, CY
    IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 745 - 750
  • [9] Audio Classification Using Class-Specific Learned Descriptors
    Sonowal, Sukanya
    Sandhan, Tushar
    Choi, Inkyu
    Kim, Nam Soo
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 484 - 487
  • [10] Speech music discrimination using class-specific features
    Beierholm, T
    Baggenstoss, PM
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 379 - 382