Evaluating Ensembled Transformers for Multilingual Code-Switched Sentiment Analysis

被引:0
|
作者
Aryal, Saurav K. [1 ]
Prioleau, Howard [1 ]
Washington, Gloria [1 ]
Burge, Legand [1 ]
机构
[1] Howard Univ, Comp Sci, Washington, DC 20059 USA
来源
2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023 | 2023年
基金
美国国家卫生研究院;
关键词
Code Switching; Ensembling; BERT; Transformers;
D O I
10.1109/CSCI62032.2023.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is essential for understanding human-authored texts, especially in multilingual communities where code-switching is common. Most existing research focuses on single-language pair sentiment analysis. We introduce a three-step approach for sentiment analysis on code-switched data: translating the code-switched data into English at word and sentence levels, training on Transformer models, and utilizing a stacking classifier to ensemble the models for sentiment classification. We establish a performance benchmark for binary and ternary sentiment classification by applying this to five datasets featuring English mixed with Spanish, Tamil, Telugu, Hindi, and Malayalam. Our method emphasizes the potential of ensembled Transformer models in this domain, paving the way for future advancements.
引用
收藏
页码:165 / 173
页数:9
相关论文
共 50 条
  • [1] Sentiment Analysis of Code-Switched Tunisian Dialect: Exploring RNN-Based Techniques
    Jerbi, Mohamed Amine
    Achour, Hadhemi
    Souissi, Emna
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 122 - 131
  • [2] Approaches for Multilingual Phone Recognition in Code-switched and Non-code-switched Scenarios Using Indian Languages
    Manjunath, K. E.
    Raghavan, Srinivasa K. M.
    Rao, K. Sreenivasa
    Jayagopi, Dinesh Babu
    Ramasubramanian, V
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (04)
  • [3] A First South African Corpus of Multilingual Code-switched Soap Opera Speech
    van der Westhuizen, Ewald
    Niesler, Thomas
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2854 - 2859
  • [4] Adapting Deep Learning for Sentiment Classification of Code-Switched Informal Short Text
    Shakeel, Muhammad Haroon
    Karim, Asim
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 903 - 906
  • [5] Improving Low Resource Code-switched ASR using Augmented Code-switched TTS
    Sharma, Yash
    Abraham, Basil
    Taneja, Karan
    Jyothi, Preethi
    INTERSPEECH 2020, 2020, : 4771 - 4775
  • [6] The phonetics of code-switched vowels
    Muldner, Kasia
    Hoiting, Leah
    Sanger, Leyna
    Blumenfeld, Lev
    Toivonen, Ida
    INTERNATIONAL JOURNAL OF BILINGUALISM, 2019, 23 (01) : 37 - 52
  • [7] A novel socio-pragmatic framework for sentiment analysis in Dravidian-English code-switched texts
    Prakash, V. Jothi
    Vijay, S. Arul Antran
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [8] Augmenting sentiment prediction capabilities for code-mixed tweets with multilingual transformers
    Hashmi, Ehtesham
    Yayilgan, Sule Yildirim
    Shaikh, Sarang
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [9] CONVERSATION ANALYSIS OF CODE-SWITCHED UTTERANCES IN THE SPEECH OF KAZAKH BILINGUALS
    Akynova, Damira
    Agmanova, Atirkul
    Zhuravleva, Yevgeniya
    Bayekeyeva, Zhuldyz
    PROCEEDINGS OF INTCESS 2019- 6TH INTERNATIONAL CONFERENCE ON EDUCATION AND SOCIAL SCIENCES, 2019, : 901 - 907
  • [10] Normalization of code-switched text for speech synthesis
    Manghat, Sreeram
    Manghat, Sreeja
    Schultz, Tanja
    INTERSPEECH 2022, 2022, : 4297 - 4301