Deep Ensemble Network for Sentiment Analysis in Bi-lingual Low-resource Languages

被引:7
|
作者
Roy, Pradeep Kumar [1 ]
机构
[1] Indian Inst Informat Technol, Dept Comp Sci & Engn, Surat 394190, Gujarat, India
关键词
Sentiment analysis; code-mixed; transformer; BERT; Kannada; Malayalam; ensemble learning; deep learning; machine learning;
D O I
10.1145/3600229
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis (SA) is the systematic identification, extraction, quantification, and study of affective states and subjective information using natural language processing. It is widely used for analyzing users' feedback, such as reviews or social posts. Recently, SA has been one of the favorite research domains in NLP due to their wide range of applications, including E-commerce, healthcare, hotel business, and others. Many machine learning and deep learning-based models exist to predict the sentiment of the user's post. However, the sentiment analysis in low-resource languages such as Kannada, Malayalam, Telugu, and Tamil received less attention due to language complexity and the low availability of required resources. This research fills the gap by proposing an ensemble model for predicting the sentiment of code-mixed Kannada and Malayalam languages. The ensemble of transformer-based models achieved a promising weighted F-1-score of 0.66 for Kannada code-mixed language. In contrast, the ensemble model of the deep learning framework performed best by achieving a weighted F-1-score of 0.72 for the Malayalam dataset, outperforming existing research.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] An Ensemble of Shallow and Deep Learning Algorithms for Vietnamese Sentiment Analysis
    Hoang-Quan Nguyen
    Quang-Uy Nguyen
    PROCEEDINGS OF 2018 5TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS 2018), 2018, : 165 - 170
  • [32] Ensemble Deep Learning for Aspect-based Sentiment Analysis
    Mohammadi, Azadeh
    Shaverizade, Anis
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2021, 12 : 29 - 38
  • [33] Seals_Lab at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages, Hausa and Igbo
    Raychawdhary, Nilanjana
    Das, Amit
    Dozier, Gerry
    Seals, Cheryl D.
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1508 - 1517
  • [34] Machine learning and deep learning for sentiment analysis across languages: A survey
    Mercha, El Mahdi
    Benbrahim, Houda
    NEUROCOMPUTING, 2023, 531 : 195 - 216
  • [35] Cross-Lingual Sentiment Analysis for Indian Regional Languages
    Impana, P.
    Kallimani, Jagadish S.
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 867 - 872
  • [36] Enhancing deep learning sentiment analysis with ensemble techniques in social applications
    Araque, Oscar
    Corcuera-Platas, Ignacio
    Sanchez-Rada, J. Fernando
    Iglesias, Carlos A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 77 : 236 - 246
  • [37] Heterogeneous Ensemble Deep Learning Model for Enhanced Arabic Sentiment Analysis
    Saleh, Hager
    Mostafa, Sherif
    Alharbi, Abdullah
    El-Sappagh, Shaker
    Alkhalifah, Tamim
    SENSORS, 2022, 22 (10)
  • [38] Part-of-speech Tagging for Low-resource Languages: Activation Function for Deep Learning Network to Work with Minimal Training Data
    Baishya, Diganta
    Baruah, Rupam
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (05)
  • [39] Reinforced NMT for Sentiment and Content Preservation in Low-resource Scenario
    Kumari, Divya
    Ekbal, Asif
    Haque, Rejwanul
    Bhattacharyya, Pushpak
    Way, Andy
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (04)
  • [40] Deep Learning Model for Sentiment Analysis in Multi-lingual Corpus
    Medrouk, Lisa
    Pappa, Anna
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 205 - 212