An efficient parameter estimation method for generalized Dirichlet priors in naive Bayesian classifiers with multinomial models

被引：7

作者：

Wong, Tzu-Tsung ^{[1
]}

Liu, Chao-Rui ^{[1
]}

机构：

[1] Natl Cheng Kung Univ, Inst Informat Management, 1 Ta Sheuh Rd, Tainan 701, Taiwan

来源：

PATTERN RECOGNITION | 2016年 / 60卷

关键词：

Covariance matrix; Document classification; Generalized Dirichlet distribution; Multinomial model; Naive Bayesian classifier;

D O I：

10.1016/j.patcog.2016.04.019

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Generalized Dirichlet priors have been shown to be an effective way for improving the performance of naive Bayesian classifiers with multinomial models, called multinomial naive Bayesian classifiers, in document classification. For the sake of computational efficiency, a previous study divided distinct words into groups, and proposed a searching mechanism to determine the values of the parameters in a generalized Dirichlet prior group by group. That searching approach increases the computational cost of the multinomial naive Bayesian classifier. In this paper, the covariance matrices for word groups are first calculated from available documents. A parameter estimation method and four strategies for choosing the value of a parameter corresponding to a word group are then proposed to solve for the parameters of the noninformative generalized Dirichlet priors for distinct words. The experimental results on two document sets show that the best strategy is to choose the largest value calculated from the statistics in a row, and that our parameter estimation method can efficiently solve for the parameters of generalized Dirichlet priors to significantly improve the performance of the multinomial naive Bayesian classifier with respect to the searching approach. (C) 2016 Elsevier Ltd. All rights reserved.

引用

页码：62 / 71

页数：10

共 15 条

[1] Intelligent Naive Bayes-based approaches for Web proxy caching [J].

Ali, Waleed ;

Shamsuddin, Siti Mariyam ;

Ismail, Abdul Samad .

KNOWLEDGE-BASED SYSTEMS, 2012, 31 :162-175

[2]

[Anonymous], 1998, LEARNING TEXT CATEGO

[3] Feature selection for text classification with Naive Bayes [J].

Chen, Jingnian ;

Huang, Houkuan ;

Tian, Shengfeng ;

Qu, Youli .

EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :5432-5435

[4] CONCEPTS OF INDEPENDENCE FOR PROPORTIONS WITH A GENERALIZATION OF DIRICHLET DISTRIBUTION [J].

CONNOR, RJ ;

MOSIMANN, JE .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1969, 64 (325) :194-&

[5] Some effective techniques for naive Bayes text classification [J].

Kim, Sang-Bum ;

Han, Kyoung-Soo ;

Rim, Hae-Chang ;

Myaeng, Sung Hyon .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (11) :1457-1466

[6]

Kolcz A, 2007, KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P400

[7]

Lang K., 1995, Machine Learning. Proceedings of the Twelfth International Conference on Machine Learning, P331

[8] AN ALGORITHM FOR SUFFIX STRIPPING [J].

PORTER, MF .

PROGRAM-AUTOMATED LIBRARY AND INFORMATION SYSTEMS, 1980, 14 (03) :130-137

[9]

Rennie JD. M., 1973, Proceedings of the Twentieth International Conference on Machine Learning (ICML)-2003), V20, P616, DOI [10.1186/1477-3155-8-16, DOI 10.1186/1477-3155-8-16]

[10]

Schneider KM, 2003, EACL 2003: 10TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P307

← 1 2 →