Classification optimization for training a large dataset with Naïve Bayes

被引:0
|
作者
Thi Thanh Sang Nguyen
Pham Minh Thu Do
机构
[1] International University – Vietnam National University,School of Computer Science and Engineering
来源
Journal of Combinatorial Optimization | 2020年 / 40卷
关键词
Data mining; Naïve Bayes; Word embedding; Feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
Book classification is very popular in digital libraries. Book rating prediction is crucial to improve the care of readers. The commonly used techniques are decision tree, Naïve Bayes (NB), neural networks, etc. Moreover, mining book data depends on feature selection, data pre-processing, and data preparation. This paper proposes the solutions of knowledge representation optimization as well as feature selection to enhance book classification and point out appropriate classification algorithms. Several experiments have been conducted and it has been found that NB could provide best prediction results. The accuracy and performance of NB can be improved and outperform other classification algorithms by applying appropriate strategies of feature selections, data type selection as well as data transformation.
引用
收藏
页码:141 / 169
页数:28
相关论文
共 50 条
  • [1] Classification optimization for training a large dataset with Naive Bayes
    Thi Thanh Sang Nguyen
    Pham Minh Thu Do
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2020, 40 (01) : 141 - 169
  • [2] Na?ve Bayes Algorithm for Large Scale Text Classification
    Pirunthavi SIVAKUMAR
    Jayalath EKANAYAKE
    Instrumentation, 2021, 8 (04) : 55 - 62
  • [3] Privacy-preserving Naïve Bayes classification
    Jaideep Vaidya
    Murat Kantarcıoğlu
    Chris Clifton
    The VLDB Journal, 2008, 17 : 879 - 898
  • [4] Extended naïve bayes for group based classification
    Samsudin, Noor Azah
    Bradley, Andrew P.
    Advances in Intelligent Systems and Computing, 2014, 287 : 497 - 506
  • [5] Classification using Hierarchical Naïve Bayes models
    Helge Langseth
    Thomas D. Nielsen
    Machine Learning, 2006, 63 : 135 - 159
  • [6] Spam message classification based on the naïve Bayes classification algorithm
    Ning, Bin
    Junwei, Wu
    Feng, Hu
    IAENG International Journal of Computer Science, 2019, 46 (01)
  • [7] Text classification: Naïve bayes classifier with sentiment Lexicon
    Le, Cong-Cuong
    Prasad, P.W.C.
    Alsadoon, Abeer
    Pham, L.
    Elchouemi, A.
    IAENG International Journal of Computer Science, 2019, 46 (02) : 141 - 148
  • [8] Constrained Naïve Bayes with application to unbalanced data classification
    Rafael Blanquero
    Emilio Carrizosa
    Pepa Ramírez-Cobo
    M. Remedios Sillero-Denamiel
    Central European Journal of Operations Research, 2022, 30 : 1403 - 1425
  • [9] Naïve Bayes Classifiers and accompanying dataset for Pseudomonas syringae isolate characterization
    Chad Fautt
    Estelle Couradeau
    Kevin L. Hockett
    Scientific Data, 11
  • [10] Naïve Bayes Classifiers and accompanying dataset for Pseudomonas syringae isolate characterization
    Fautt, Chad
    Couradeau, Estelle
    Hockett, Kevin L.
    SCIENTIFIC DATA, 2024, 11 (01)