Enhancing Moroccan Dialect Sentiment Analysis Through Optimized Preprocessing and Transfer Learning Techniques

被引:0
|
作者
Matrane, Yassir [1 ]
Benabbou, Faouzia [1 ]
Ellaky, Zineb [1 ]
机构
[1] Hassan II Univ Casablanca, Fac Sci Ben MSick, Lab Informat Technol & Modeling, Casablanca 20000, Morocco
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Sentiment analysis; Accuracy; Natural language processing; Linguistics; Transfer learning; Analytical models; Text categorization; Complexity theory; Tuning; Arabic dialect; Moroccan dialect; preprocessing; deep learning; fine-tuning; ARABIC TEXT CLASSIFICATION; STEMMER;
D O I
10.1109/ACCESS.2024.3514934
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work investigates the challenges of sentiment analysis for Moroccan Arabic dialect (MD), where the lack of dialect-specific preprocessing methods complicates natural language processing tasks and affects sentiment classification performance. The research evaluates various preprocessing techniques, including stemming and feature extraction, using two main transfer learning approaches: feature extraction with deep learning models and fine-tuning pre-trained models. Experimentations were conducted on four MD datasets to assess combinations of stemmers, feature extractors, and architectures. In the feature extraction approach, omitting stemming and employing the QARiB feature extractor with a BiGRU model yielded the highest accuracy on the FB and MAC datasets, reaching 90.45% and 75.50%, respectively. In the fine-tuning approach, DarijaBERT excelled on the FB dataset with an accuracy of 93.37% and an F1-score of 88.55%, while QaRIB and AraBERT performed comparably well on the MAC and MSAC datasets. Results suggest that excluding base form reduction methods, such as stemming and lemmatization, during fine-tuning enhances sentiment analysis performance in MD, highlighting the limitations of Modern Standard Arabic techniques for MD processing. This study provides valuable insights for improving Natural language processing (NLP) applications in Arabic dialects, particularly in sentiment analysis, by optimizing model performance without relying on standard preprocessing methods.
引用
收藏
页码:187756 / 187777
页数:22
相关论文
共 50 条
  • [41] Sentiment Analysis for Women in STEM using Twitter and Transfer Learning Models
    Fouad, Shereen
    Alkooheji, Ezzaldin
    2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 227 - 234
  • [42] Deep Learning Based Techniques for Sentiment Analysis: A Survey
    Etaiwi, Wael
    Suleiman, Dima
    Awajan, Arafat
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (07): : 89 - 96
  • [43] Investigating Machine Learning Techniques for User Sentiment Analysis
    Patel, Nimesh, V
    Chhinkaniwala, Hitesh
    INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2019, 11 (03) : 1 - 12
  • [44] Improving Sentiment Analysis of Moroccan Tweets Using Ensemble Learning
    Oussous, Ahmed
    Ait Lahcen, Ayoub
    Belfkih, Samir
    BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 91 - 104
  • [45] Sentiment Analysis of Code-Switched Tunisian Dialect: Exploring RNN-Based Techniques
    Jerbi, Mohamed Amine
    Achour, Hadhemi
    Souissi, Emna
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 122 - 131
  • [46] Machine learning techniques for sentiment analysis
    Lopez, Jessica Olivares
    Lopez, Abraham Sanchez
    Velazquez, Rogelio Gonzalez
    Diaz, Maria del Carmen Santiago
    Vazquez, Ana Claudia Zenteno
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2024, 15 (05): : 6 - 16
  • [47] Sentiment analysis model for Airline customers' feedback using deep learning techniques
    Samir, Heba Allah
    Abd-Elmegid, Laila
    Marie, Mohamed
    INTERNATIONAL JOURNAL OF ENGINEERING BUSINESS MANAGEMENT, 2023, 15
  • [48] Analysis of Preprocessing Techniques, Keras Tuner, and Transfer Learning on Cloud Street image data
    Joshi, Sharmad
    Owens, Jessie Ann
    Shah, Shlok
    Munasinghe, Thilanka
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4165 - 4168
  • [49] A Comparative Study of Machine Learning and Deep Learning Techniques for Sentiment Analysis
    Jain, Kruttika
    Kaushal, Shivani
    2018 7TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO) (ICRITO), 2018, : 483 - 487
  • [50] Enhancing Human Action Recognition Through Transfer Learning and Body Articulation Analysis
    Jlidi, Nozha
    Jemai, Olfa
    Bouchrika, Tahani
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025, : 4394 - 4422