Evaluating Ensembled Transformers for Multilingual Code-Switched Sentiment Analysis

被引:0
|
作者
Aryal, Saurav K. [1 ]
Prioleau, Howard [1 ]
Washington, Gloria [1 ]
Burge, Legand [1 ]
机构
[1] Howard Univ, Comp Sci, Washington, DC 20059 USA
来源
2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023 | 2023年
基金
美国国家卫生研究院;
关键词
Code Switching; Ensembling; BERT; Transformers;
D O I
10.1109/CSCI62032.2023.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is essential for understanding human-authored texts, especially in multilingual communities where code-switching is common. Most existing research focuses on single-language pair sentiment analysis. We introduce a three-step approach for sentiment analysis on code-switched data: translating the code-switched data into English at word and sentence levels, training on Transformer models, and utilizing a stacking classifier to ensemble the models for sentiment classification. We establish a performance benchmark for binary and ternary sentiment classification by applying this to five datasets featuring English mixed with Spanish, Tamil, Telugu, Hindi, and Malayalam. Our method emphasizes the potential of ensembled Transformer models in this domain, paving the way for future advancements.
引用
收藏
页码:165 / 173
页数:9
相关论文
共 50 条
  • [41] Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition
    Wiesner, Matthew
    Sarma, Mousmita
    Arora, Ashish
    Raj, Desh
    Gao, Dongji
    Huang, Ruizhe
    Preet, Supreet
    Johnson, Moris
    Iqbal, Zikra
    Goel, Nagendra
    Trmal, Jan
    Garcia, Paola
    Khudanpur, Sanjeev
    INTERSPEECH 2021, 2021, : 2906 - 2910
  • [42] BioBridge: Unified Bio-Embedding With Bridging Modality in Code-Switched EMR
    Jeon, Jangyeong
    Cho, Sangyeon
    Lee, Dongjoon
    Lee, Changhee
    Kim, Junyeong
    IEEE ACCESS, 2024, 12 : 141866 - 141877
  • [43] Supervised sentiment analysis in multilingual environments
    Vilares, David
    Alonso, Miguel A.
    Gomez-Rodriguez, Carlos
    INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (03) : 595 - 607
  • [44] Twitter Dataset and Evaluation of Transformers for Turkish Sentiment Analysis
    Koksal, Abdullatif
    Ozgur, Arzucan
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [45] Code-switched end-to-end Marathi speech recognition for especially abled people
    Hore, Praveen
    Sharma, Amit
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2022, 25 (03) : 771 - 784
  • [46] AdapterFusion-based multi-task learning for code-mixed and code-switched text classification
    Rathnayake, Himashi
    Sumanapala, Janani
    Rukshani, Raveesha
    Ranathunga, Surangika
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [47] CODE-SWITCHED LANGUAGE MODELLING USING A CODE PREDICTIVE LSTM IN UNDER-RESOURCED SOUTH AFRICAN LANGUAGES
    van Vuren, Joshua Jansen
    Niesler, Thomas
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 785 - 791
  • [48] A Study of Lexical and Prosodic Cues to Segmentation in a Hindi-English Code-switched Discourse
    Rao, Preeti
    Pandya, Mugdha
    Sabu, Kamini
    Kumar, Kanhaiya
    Bondale, Nandini
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1918 - 1922
  • [49] CODE-SWITCHED SPEECH SYNTHESIS USING BILINGUAL PHONETIC POSTERIORGRAM WITH ONLY MONOLINGUAL CORPORA
    Cao, Yuewen
    Liu, Songxiang
    Wu, Xixin
    Kang, Shiyin
    Liu, Peng
    Wu, Zhiyong
    Liu, Xunying
    Su, Dan
    Yu, Dong
    Meng, Helen
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7619 - 7623
  • [50] Sentiment Analysis on Algerian Dialect with Transformers
    Benmounah, Zakaria
    Boulesnane, Abdennour
    Fadheli, Abdeladim
    Khial, Mustapha
    APPLIED SCIENCES-BASEL, 2023, 13 (20):