RETRACTED: An ensemble deep learning classifier for sentiment analysis on code-mix Hindi-English data (Retracted Article)

被引:10
作者
Pradhan, Rahul [1 ]
Sharma, Dilip Kumar [1 ]
机构
[1] GLA Univ, Mathura, India
关键词
Sentiment analysis; Code-mixing; Sentiment analysis on indian languages; XLM-R; Word embedding; Universal sentence encoder; SA; Transfer learning; Data analytics; Classifier optimisation; RECOGNITION;
D O I
10.1007/s00500-022-07091-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Code-mixing on social media is a trend in many countries where people speak multiple languages, such as India, where Hindi and English are major communication languages. Sentiment analysis is beneficial in understanding users' opinions and thoughts on social, economic, and political issues. It eliminates the manual monitoring of each and every review, which is a cumbersome task. However, performing sentiment analysis on code-mix data is challenging, as it involves various out of vocabulary terms and numerous issues, making it a new field in natural language processing. This work includes dealing with such text and ensembling a classifier to detect sentiment polarity. Our classifier ensembles a multilingual variant of RoBERTa and a sentence-level embedding from Universal Sentence Encoder to identify the sentiments of these code-mixed tweets with higher accuracy. This ensemble optimises the classifier's performance by using the strength of both for transfer learning. Experiments were conducted on real-life benchmark datasets and revealed their sentiment. The performance of the proposed classifier framework is compared with other baselines and deep learning models on five datasets to show the superiority of our results. Results showed improved and increased performance in the proposed classifier's accuracy, precision, and recall. The accuracy achieved by our classifier on code-mix datasets is 66% on Joshi et al. 2016, 60% on SAIL 2017, and 67% on SemEval 2020 Task-9 dataset, which is on average around 3% as compared to contemporary baselines.
引用
收藏
页码:11053 / 11053
页数:1
相关论文
共 67 条
  • [1] All-in-One: Emotion, Sentiment and Intensity Prediction Using a Multi-Task Ensemble Framework
    Akhtar, Md Shad
    Ghosal, Deepanway
    Ekbal, Asif
    Bhattacharyya, Pushpak
    Kurohashi, Sadao
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (01) : 285 - 297
  • [2] Using Artificial Intelligence to Understand What Causes Sentiment Changes on Social Media
    Alattar, Fuad
    Shaalan, Khaled
    [J]. IEEE ACCESS, 2021, 9 : 61756 - 61767
  • [3] Combination of Recursive and Recurrent Neural Networks for Aspect-Based Sentiment Analysis Using Inter-Aspect Relations
    Aydln, Cem Rlfkl
    Gungor, Tunga
    [J]. IEEE ACCESS, 2020, 8 : 77820 - 77832
  • [4] Banerjee Somnath, 2018, Text Processing. FIRE 2016 International Workshop. Revised Selected Papers: LNCS 10478, P39, DOI 10.1007/978-3-319-73606-8_3
  • [5] Bao H., 2020, INT C MACHINE LEARNI, P642
  • [6] Barnett R., 2000, International Journal of Bilingualism, V4, P131
  • [7] Cer D, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P169
  • [8] Chakravarthi BR, 2021, ARXIV PREPRINT ARXIV
  • [9] Semantic and Sentiment Analysis of Selected Bhagavad Gita Translations Using BERT-Based Language Framework
    Chandra, Rohitash
    Kulkarni, Venkatesh
    [J]. IEEE ACCESS, 2022, 10 : 21291 - 21315
  • [10] Coarse-Grained plus /-Effect Word Sense Disambiguation for Implicit Sentiment Analysis
    Choi, Yoonjung
    Wiebe, Janyce
    Mihalcea, Rada
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (04) : 471 - 479