Iterative threshold-based Naive bayes classifier

被引:1
|
作者
Romano, Maurizio [1 ]
Zammarchi, Gianpaolo [1 ]
Conversano, Claudio [1 ]
机构
[1] Univ Cagliari, Dept Econ & Business Sci, Viale Fra Ignazio 17, I-09123 Cagliari, Italy
来源
STATISTICAL METHODS AND APPLICATIONS | 2024年 / 33卷 / 01期
关键词
Naive bayes; Post-hoc analysis; Customer satisfaction; Sentiment analysis; Natural language processing; Booking.com; SENTIMENT ANALYSIS; REVIEWS;
D O I
10.1007/s10260-023-00721-1
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The iterative Threshold-based Naive Bayes (iTb-NB) classifier is introduced as a (simple) improved version of the previously introduced non-iterative Threshold-based Naive Bayes (Tb-NB) classifier. iTb-NB starts from a Natural Language text-corpus and allows the user to quantify with a numeric value a sentiment (positive or negative) from a specific test. Differently from Tb-NB, iTb-NB is an algorithm aimed at estimating multiple threshold values that concur to refine Tb-NB's decision rules when classifying a text into positive (negative) based on its content. Observations with sentiment scores close to the threshold are marked to be reclassified, hence a new decision rule is defined for them. Such "iterative" process improves the quality of predictions w.r.t. Tb-NB but keeping the possibility to utilize its results as the input of useful post-hoc analyses. The effectiveness of iTb-NB is evaluated analyzing hotel guests' reviews from all hotels located in the Sardinia region and available on Booking.com. Furthermore, iTb-NB is compared with Tb-NB in terms of model accuracy, resistance to noise, and computational efficiency.
引用
收藏
页码:235 / 265
页数:31
相关论文
共 50 条
  • [41] Boosting the Tree Augmented Naive Bayes classifier
    Downs, T
    Tang, A
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 708 - 713
  • [42] Multiple explanations driven Naive Bayes classifier
    Almonayyes, A
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2006, 12 (02) : 127 - 139
  • [43] A sequential naive Bayes classifier for DNA barcodes
    Anderson, Michael P.
    Dubnicka, Suzanne R.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2014, 13 (04) : 423 - 434
  • [44] Software Defect Prediction with Naive Bayes Classifier
    Rahim, Aqsa
    Hayat, Zara
    Abbas, Muhammad
    Rahim, Amna
    Rahim, Muhammad Abdul
    PROCEEDINGS OF 2021 INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGIES (IBCAST), 2021, : 293 - 297
  • [45] Classifying Twitter Data with Naive Bayes Classifier
    Tseng, Chris
    Patel, Nishant
    Paranjape, Hrishikesh
    Lin, T. Y.
    Teoh, SooTee
    2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012, : 294 - 299
  • [46] Measuring Software Maintainability with Naive Bayes Classifier
    Iqbal, Nayyar
    Sang, Jun
    Chen, Jing
    Xia, Xiaofeng
    ENTROPY, 2021, 23 (02) : 1 - 27
  • [47] Vulnerability Analysis of IoT Devices to Cyberattacks Based on Naive Bayes Classifier
    Mizera-Pietraszko, Jolanta
    Tancula, Jolanta
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 630 - 642
  • [48] A dynamic trust model based on Naive Bayes classifier for ubiquitous environments
    Yuan, Weiwei
    Guan, Donghai
    Lee, Sungyoung
    Lee, Youngkoo
    HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2006, 4208 : 562 - 571
  • [49] LEARNING THE NAIVE BAYES CLASSIFIER WITH OPTIMIZATION MODELS
    Taheri, Sona
    Mammadov, Musa
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2013, 23 (04) : 787 - 795
  • [50] A new fuzzy beta naive Bayes classifier
    de Moraes, Ronei Marcos
    Gomes Rodrigues, Anny Kerollayny
    de Melo Gomes Soares, Elaine Anita
    Machado, Liliane dos Santos
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 437 - 445