Analyzing the Performance of BERT for the Sentiment Classification Task in Bengali Text

被引:0
作者
Banshal, Sumit Kumar [1 ]
Uddin, Ashraf [2 ]
Piryani, Rajesh [3 ]
机构
[1] Alliance Univ, Dept Comp Sci & Engn, Bangalore, Karnataka, India
[2] Amer Int Univ Bangladesh, Dhaka, Bangladesh
[3] Univ Toulouse III Paul Sabatier UT3, IRIT, Toulouse, France
来源
ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT III | 2024年 / 2092卷
关键词
Sentiment analysis; NLP; BERT; Bengali textual data;
D O I
10.1007/978-3-031-64070-4_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent era has seen significant growth of technologies in the field of Natural Language Processing (NLP). But the scarce resource languages like Bengali have not got much attention from the research community. The BERT language model has laid a very positive impact on the performance of the NLP tasks. Although several others language models came into the scenario, we investigate the performance of BERT model and other conventional methods for the sentiment classification task in Bengali text. The obtained result shows that BERT overperformed other conventional machine learning and lexicon-based methods in all aspects of the performance metrics. Along with BERT, conventional methods namely Logistic Regression, Decision Tree, SVM, Random Forest, Naive Bayes and Neural Network were implemented. Besides these methods a lexicon-based approach was used to see the overall variation in the results. The lexicon resource for Benali was created for this implementation.
引用
收藏
页码:273 / 285
页数:13
相关论文
共 64 条
[1]  
Akanda Wahiduzzaman, 2021, Proceedings of 2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD), P466, DOI 10.1109/ICICT4SD50815.2021.9396882
[2]  
Akhtar MS, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P370
[3]   Improving Word Embedding Coverage in Less-Resourced Languages Through Multi-Linguality and Cross-Linguality: A Case Study with Aspect-Based Sentiment Analysis [J].
Akhtar, Md Shad ;
Sawant, Palaash ;
Sen, Sukanta ;
Ekbal, Asif ;
Bhattacharyya, Pushpak .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (02)
[4]  
Al-Amin M, 2017, 2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION ENGINEERING (ECCE), P186, DOI 10.1109/ECACE.2017.7912903
[5]  
Alam F., 2021, A review of Bangla natural language processing tasks and the utility of transformer models
[6]   Big Data with Integrated Cloud Computing for Prediction of Health Conditions [J].
Alamri, Abdullah .
2019 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON), 2019, :40-45
[7]   Sentiment Analysis of Iraqi Arabic Dialect on Facebook Based on Distributed Representations of Documents [J].
Alnawas, Anwar ;
Arici, Nursal .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (03)
[8]   Restaurant recommender system based on sentiment analysis [J].
Asani, Elham ;
Vahdat-Nejad, Hamed ;
Sadri, Javad .
MACHINE LEARNING WITH APPLICATIONS, 2021, 6
[9]  
Banik N., 2019, 2019 1 INT C ADV SCI, P1, DOI DOI 10.1109/ICASERT.2019.8934481
[10]  
Banik N, 2018, Evaluation of Naive Bayes and support vector machines on Bangla textual movie reviews, DOI [10.1109/icbslp.2018.8554497, DOI 10.1109/ICBSLP.2018.8554497]