Analyzing the Performance of BERT for the Sentiment Classification Task in Bengali Text

被引:0
作者
Banshal, Sumit Kumar [1 ]
Uddin, Ashraf [2 ]
Piryani, Rajesh [3 ]
机构
[1] Alliance Univ, Dept Comp Sci & Engn, Bangalore, Karnataka, India
[2] Amer Int Univ Bangladesh, Dhaka, Bangladesh
[3] Univ Toulouse III Paul Sabatier UT3, IRIT, Toulouse, France
来源
ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT III | 2024年 / 2092卷
关键词
Sentiment analysis; NLP; BERT; Bengali textual data;
D O I
10.1007/978-3-031-64070-4_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent era has seen significant growth of technologies in the field of Natural Language Processing (NLP). But the scarce resource languages like Bengali have not got much attention from the research community. The BERT language model has laid a very positive impact on the performance of the NLP tasks. Although several others language models came into the scenario, we investigate the performance of BERT model and other conventional methods for the sentiment classification task in Bengali text. The obtained result shows that BERT overperformed other conventional machine learning and lexicon-based methods in all aspects of the performance metrics. Along with BERT, conventional methods namely Logistic Regression, Decision Tree, SVM, Random Forest, Naive Bayes and Neural Network were implemented. Besides these methods a lexicon-based approach was used to see the overall variation in the results. The lexicon resource for Benali was created for this implementation.
引用
收藏
页码:273 / 285
页数:13
相关论文
共 65 条
  • [21] Das A., 2021, AISC, V1324, P1124, DOI [10.1007/978-3-030-, DOI 10.1007/978-3-030]
  • [22] Dawn I., 2020, ICIMSAT 2019. Learning and Analytics in Intelligent Systems, V12, P761, DOI [10.1007/978-3-030-42363-689, DOI 10.1007/978-3-030-42363-689]
  • [23] Deshpande M, 2017, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SUSTAINABLE SYSTEMS (ICISS 2017), P858, DOI 10.1109/ISS1.2017.8389299
  • [24] Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, DOI 10.48550/ARXIV.1810.04805]
  • [25] Dey R.C., 2019, 2019 22 INT C COMP I, DOI DOI 10.1109/ICCIT48885.2019.9038250
  • [26] Ascertaining polarity of public opinions on Bangladesh cricket using machine learning techniques
    Faruque, M. Abdullah
    Rahman, Saifur
    Chakraborty, Partha
    Choudhury, Tanupriya
    Um, Jung-Sup
    Singh, Thipendra Pal
    [J]. SPATIAL INFORMATION RESEARCH, 2022, 30 (01) : 1 - 8
  • [27] Ghosal T, 2015, ANNU IEEE IND CONF
  • [28] Hossain I., 2019, 2019 INT C ELECT COM, P1, DOI DOI 10.1109/ECACE.2019.8679144
  • [29] Hossain M., 2019, Adv. Intell. Syst. Comput., V882, P513, DOI [10.1007/978-981-13-5953-8_43, DOI 10.1007/978-981-13-5953-8_43]
  • [30] Hossain M.S., 2017, Sentiment analysis for Bengali newspaper headlines