Standardization of Dialect Comments in Social Networks in View of Sentiment Analysis : Case of Tunisian Dialect

被引:0
|
作者
Kchaou, Sameh [1 ]
Boujelbane, Rahma [1 ]
Fsih, Emna [1 ]
Belguith, Lamia Hadrich [1 ]
机构
[1] Univ Sfax, MIRACL Lab FSEGS, ANLP Res Grp, Sfax, Tunisia
来源
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年
关键词
Dialect Identification; Neural Machine Translation; Sentiment Analysis; Tunisian Dialect; Modern Standard Arabic;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the growing access to the internet, the spoken Arabic dialect language becomes an informal languages written in social media. Most users post comments using their own dialect. This linguistic situation inhibits mutual understanding between internet users and makes difficult to use computational approaches since most Arabic resources are intended for the formal language: Modern Standard Arabic (MSA). In this paper, we present a pipeline to standardize the written texts in social networks by translating them to the MSA. We fine-tune at first an identification bert-based model to select Tunisian Dialect (TD) comments from MSA and other dialects. Then, the resulting comments are translated using a neural translation model. Each of these steps was evaluated on the same test corpus. In order to test the effectiveness of the approach, we compared two opinion analysis models, the first is intended for the Sentiment Analysis (SA) of dialect texts and the second is for the MSA texts. We concluded that through standardization we obtain the best score.
引用
收藏
页码:5436 / 5443
页数:8
相关论文
共 50 条
  • [31] Enhancing Moroccan Dialect Sentiment Analysis Through Optimized Preprocessing and Transfer Learning Techniques
    Matrane, Yassir
    Benabbou, Faouzia
    Ellaky, Zineb
    IEEE ACCESS, 2024, 12 : 187756 - 187777
  • [32] What influences sentiment analysis on social networks: a case study
    Mambelli, Giacomo
    Prandi, Catia
    Mirri, Silvia
    2020 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2020, : 992 - 997
  • [33] Sentiment Analysis of Social Media Comments in Mauritius
    Sahib, Nuzhah Gooda
    Marianne, Marie Angele Justine
    Gobin-Rahimbux, Baby
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 860 - 865
  • [34] Sentiment Analysis of Algerian Dialect Using Machine Learning and Deep Learning with Word2vec
    Mazari, Ahmed Cherif
    Djeffal, Abdelhamid
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2022, 46 (06): : 67 - 78
  • [35] Improving Sentiment Analysis Performance on Imbalanced Moroccan Dialect Datasets Using Resample and Feature Extraction Techniques
    Nassr, Zineb
    Benabbou, Faouzia
    Sael, Nawal
    Hamim, Touria
    INFORMATION, 2025, 16 (01)
  • [36] Sentiment Analysis of Social Networks Messages
    Tretyakov, Evgeny
    Savic, Dobrica
    Korpusenko, Anastasia
    Ionkina, Kristina
    BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES 2021, 2022, 1032 : 552 - 560
  • [37] Sentiment analysis of positive and negative comments, extracted from social networks and web in Albanian language
    Hoti, Mërgim H.
    Hoti, Hamdi
    Kurhasku, Edisona
    International Journal of Applied Systemic Studies, 2024, 11 (02) : 83 - 96
  • [38] Learning user sentiment orientation in social networks for sentiment analysis
    Chen, Jie
    Song, Nan
    Su, Yansen
    Zhao, Shu
    Zhang, Yanping
    INFORMATION SCIENCES, 2022, 616 : 526 - 538
  • [39] SANA: A Large Scale Multi-Genre, Multi-Dialect Lexicon for Arabic Subjectivity and Sentiment Analysis
    Abdul-Mageed, Muhammad
    Diab, Mona
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1162 - 1169
  • [40] Social Rankings: Visual Sentiment Analysis in Social Networks
    Fernandez, Javi
    Gutierrez, Yoan
    Gomez, Jose M.
    Martinez-Barco, Patricio
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2015, (55): : 199 - 202