Standardization of Dialect Comments in Social Networks in View of Sentiment Analysis : Case of Tunisian Dialect

被引:0
|
作者
Kchaou, Sameh [1 ]
Boujelbane, Rahma [1 ]
Fsih, Emna [1 ]
Belguith, Lamia Hadrich [1 ]
机构
[1] Univ Sfax, MIRACL Lab FSEGS, ANLP Res Grp, Sfax, Tunisia
来源
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年
关键词
Dialect Identification; Neural Machine Translation; Sentiment Analysis; Tunisian Dialect; Modern Standard Arabic;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the growing access to the internet, the spoken Arabic dialect language becomes an informal languages written in social media. Most users post comments using their own dialect. This linguistic situation inhibits mutual understanding between internet users and makes difficult to use computational approaches since most Arabic resources are intended for the formal language: Modern Standard Arabic (MSA). In this paper, we present a pipeline to standardize the written texts in social networks by translating them to the MSA. We fine-tune at first an identification bert-based model to select Tunisian Dialect (TD) comments from MSA and other dialects. Then, the resulting comments are translated using a neural translation model. Each of these steps was evaluated on the same test corpus. In order to test the effectiveness of the approach, we compared two opinion analysis models, the first is intended for the Sentiment Analysis (SA) of dialect texts and the second is for the MSA texts. We concluded that through standardization we obtain the best score.
引用
收藏
页码:5436 / 5443
页数:8
相关论文
共 50 条
  • [1] Sentiment Analysis of Tunisian Users on Social Networks: Overcoming the Challenge of Multilingual Comments in the Tunisian Dialect
    Jaballi, Samawel
    Zrigui, Salah
    Sghaier, Mohamed Ali
    Berchech, Dhaou
    Zrigui, Mounir
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 13501 : 176 - 192
  • [2] Deep Learning for Sentiment Analysis of Tunisian Dialect
    Masmoudi, Abir
    Hamdi, Jamila
    Belguith, Lamia Hadrich
    COMPUTACION Y SISTEMAS, 2021, 25 (01): : 129 - 148
  • [3] Bottom-up approach to translate Tunisian dialect texts in Social Networks
    Kchaou, Sameh
    Boujelbane, Rahma
    Belguith, Lamia Hadrich
    2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2022,
  • [4] Sentiment Analysis: Effect of Combining BERT as an Embedding Technique with CNN Model for Tunisian Dialect
    Mechti, Seifeddine
    Faiz, Rim
    Khoufi, Nabil
    Antit, Shaima
    Krichen, Moez
    ADVANCES IN INFORMATION SYSTEMS, ARTIFICIAL INTELLIGENCE AND KNOWLEDGE MANAGEMENT, ICIKS 2023, 2024, 486 : 309 - 320
  • [5] Dialect Versus MSA Sentiment Analysis
    Rizkallah, Sandra
    Atiya, Amir
    Mahgoub, Hossam Eldin
    Heragy, Momen
    INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 605 - 613
  • [6] Sentiment Analysis of Emirati Dialect
    A. Al Shamsi, Arwa
    Abdallah, Sherief
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (02)
  • [7] Sentiment Analysis of Code-Switched Tunisian Dialect: Exploring RNN-Based Techniques
    Jerbi, Mohamed Amine
    Achour, Hadhemi
    Souissi, Emna
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 122 - 131
  • [8] Sentiment Analysis of Users on Social Networks: Overcoming the challenge of the Loose Usages of the Algerian Dialect
    Soumeur, Assia
    Mokdadi, Mheni
    Guessoum, Ahmed
    Dao, Amina
    ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 26 - 37
  • [9] Sentiment Analysis on Algerian Dialect with Transformers
    Benmounah, Zakaria
    Boulesnane, Abdennour
    Fadheli, Abdeladim
    Khial, Mustapha
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [10] Sentiment Analysis of Arabic Jordanian Dialect Tweets
    Atoum, Jalal Omer
    Nouman, Mais
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (02) : 256 - 262