Evaluating Ensembled Transformers for Multilingual Code-Switched Sentiment Analysis

被引:0
|
作者
Aryal, Saurav K. [1 ]
Prioleau, Howard [1 ]
Washington, Gloria [1 ]
Burge, Legand [1 ]
机构
[1] Howard Univ, Comp Sci, Washington, DC 20059 USA
来源
2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023 | 2023年
基金
美国国家卫生研究院;
关键词
Code Switching; Ensembling; BERT; Transformers;
D O I
10.1109/CSCI62032.2023.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is essential for understanding human-authored texts, especially in multilingual communities where code-switching is common. Most existing research focuses on single-language pair sentiment analysis. We introduce a three-step approach for sentiment analysis on code-switched data: translating the code-switched data into English at word and sentence levels, training on Transformer models, and utilizing a stacking classifier to ensemble the models for sentiment classification. We establish a performance benchmark for binary and ternary sentiment classification by applying this to five datasets featuring English mixed with Spanish, Tamil, Telugu, Hindi, and Malayalam. Our method emphasizes the potential of ensembled Transformer models in this domain, paving the way for future advancements.
引用
收藏
页码:165 / 173
页数:9
相关论文
共 50 条
  • [31] Use of prompt-based learning for code-mixed and code-switched text classification
    Udawatta, Pasindu
    Udayangana, Indunil
    Gamage, Chathulanka
    Shekhar, Ravi
    Ranathunga, Surangika
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (05):
  • [32] Modeling the auxiliary phrase asymmetry in code-switched Spanish-English
    Tsoukala, Chara
    Frank, Stefan L.
    Van den Bosch, Antal
    Kroff, Jorge Valdes
    Broersma, Mirjam
    BILINGUALISM-LANGUAGE AND COGNITION, 2021, 24 (02) : 271 - 280
  • [33] Improving Code-Switched Language Modeling Performance Using Cognate Features
    Soto, Victor
    Hirschberg, Julia
    INTERSPEECH 2019, 2019, : 3725 - 3729
  • [34] Code-switched English Pronunciation Modeling for Swahili Spoken Term Detection
    Kleynhans, Neil
    Hartman, William
    van Niekerk, Daniel
    van Heerden, Charl
    Schwartz, Rich
    Tsakalidis, Stavros
    Davel, Marelie
    SLTU-2016 5TH WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGIES FOR UNDER-RESOURCED LANGUAGES, 2016, 81 : 128 - 135
  • [35] Identifying Sentiments in Algerian Code-switched User-generated Comments
    Adouane, Wafia
    Touileb, Sarnia
    Bernardy, Jean-Philippe
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2698 - 2705
  • [36] TRANSLITERATION BASED APPROACHES TO IMPROVE CODE-SWITCHED SPEECH RECOGNITION PERFORMANCE
    Emond, Jesse
    Ramabhadran, Bhuvana
    Roark, Brian
    Moreno, Pedro
    Ma, Min
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 448 - 455
  • [37] Code-switched automatic speech recognition in five South African languages
    Biswas, Astik
    Yilmaz, Emre
    van der Westhuizen, Ewald
    de Wet, Febe
    Niesler, Thomas
    COMPUTER SPEECH AND LANGUAGE, 2022, 71
  • [38] A Novel Approach for Effective Recognition of the Code-Switched Data on Monolingual Language Model
    Sreeram, Ganji
    Sinha, Rohit
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1953 - 1957
  • [39] The effect of lexical triggers on Spanish-English code-switched judgment tasks
    Koronkiewicz, Bryan
    Delgado, Rodrigo
    FRONTIERS IN PSYCHOLOGY, 2024, 15
  • [40] THE EFFECTS OF CODE-SWITCHED INSTRUCTION ON L2 LEARNING: A RESEARCH SYNTHESIS
    Pires, Daniel Reschke
    REVISTA DA ANPOLL, 2020, 51 (01) : 139 - 152