Analyzing the Performance of BERT for the Sentiment Classification Task in Bengali Text

被引:0
作者
Banshal, Sumit Kumar [1 ]
Uddin, Ashraf [2 ]
Piryani, Rajesh [3 ]
机构
[1] Alliance Univ, Dept Comp Sci & Engn, Bangalore, Karnataka, India
[2] Amer Int Univ Bangladesh, Dhaka, Bangladesh
[3] Univ Toulouse III Paul Sabatier UT3, IRIT, Toulouse, France
来源
ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT III | 2024年 / 2092卷
关键词
Sentiment analysis; NLP; BERT; Bengali textual data;
D O I
10.1007/978-3-031-64070-4_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent era has seen significant growth of technologies in the field of Natural Language Processing (NLP). But the scarce resource languages like Bengali have not got much attention from the research community. The BERT language model has laid a very positive impact on the performance of the NLP tasks. Although several others language models came into the scenario, we investigate the performance of BERT model and other conventional methods for the sentiment classification task in Bengali text. The obtained result shows that BERT overperformed other conventional machine learning and lexicon-based methods in all aspects of the performance metrics. Along with BERT, conventional methods namely Logistic Regression, Decision Tree, SVM, Random Forest, Naive Bayes and Neural Network were implemented. Besides these methods a lexicon-based approach was used to see the overall variation in the results. The lexicon resource for Benali was created for this implementation.
引用
收藏
页码:273 / 285
页数:13
相关论文
共 65 条
  • [1] Akanda Wahiduzzaman, 2021, Proceedings of 2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD), P466, DOI 10.1109/ICICT4SD50815.2021.9396882
  • [2] Akhtar MS, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P370
  • [3] Improving Word Embedding Coverage in Less-Resourced Languages Through Multi-Linguality and Cross-Linguality: A Case Study with Aspect-Based Sentiment Analysis
    Akhtar, Md Shad
    Sawant, Palaash
    Sen, Sukanta
    Ekbal, Asif
    Bhattacharyya, Pushpak
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (02)
  • [4] Al-Amin M, 2017, 2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION ENGINEERING (ECCE), P186, DOI 10.1109/ECACE.2017.7912903
  • [5] Alam F., 2021, A review of Bangla natural language processing tasks and the utility of transformer models
  • [6] Sentiment Analysis of Iraqi Arabic Dialect on Facebook Based on Distributed Representations of Documents
    Alnawas, Anwar
    Arici, Nursal
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (03)
  • [7] Restaurant recommender system based on sentiment analysis
    Asani, Elham
    Vahdat-Nejad, Hamed
    Sadri, Javad
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2021, 6
  • [8] Banik N, 2018, Evaluation of Naive Bayes and support vector machines on Bangla textual movie reviews, DOI [10.1109/icbslp.2018.8554497, DOI 10.1109/ICBSLP.2018.8554497]
  • [9] Banik Nayan, 2019, 1 INT C ADV SCI ENG, P1, DOI DOI 10.1109/ICASERT.2019.8934481
  • [10] Baruah A., 2020, P 2 WORKSH TROLL AGG, P76