Automatic Algerian Sarcasm Detection from Texts and Images

被引:2
作者
Bousmaha, Kheira Zineb [1 ]
Hamadouche, Khaoula [1 ]
Djouabi, Hadjer [1 ]
Hadrich-Belguith, Lamia [2 ]
机构
[1] Univ Oran 1 Ahmed Ben Bella, Comp Sci, Oran, Algeria
[2] Univ Sfax, Comp Sci, Sfax, Tunisia
关键词
Sentiment analysis; Sarcasm detection; Algerian Dialect; linguistic features; BERT model; IRONY;
D O I
10.1145/3670403
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the number of Algerian Internet users has significantly increased, providing a valuable opportunity for collecting and utilizing opinions and sentiments expressed online. They now post not just texts but also images. However, to benefit from this wealth of information, it is crucial to address the challenge of sarcasm detection, which poses a limitation in sentiment analysis. Sarcasm often involves the use of nonliteral and ambiguous language, making its detection complex. To enhance the quality and relevance of sentiment analysis, it is essential to develop effective methods for sarcasm detection. By overcoming this limitation, we can fully harness the expressed online opinions and benefit from their valuable insights for a better understanding of trends and sentiments among the Algerian public. In this work, our aim is to develop a comprehensive system that addresses sarcasm detection in Algerian dialect, encompassing both text and image analysis. We propose a hybrid approach that combines linguistic characteristics and machine learning techniques for text analysis. Additionally, for image analysis, we utilized the deep learning model VGG-19 for image classification, and employed the EasyOCR technique for Arabic text extraction. By integrating these approaches, we strive to create a robust system capable of detecting sarcasm in both textual and visual content in the Algerian dialect. Our system achieved an accuracy of 92.79% for the textual models and 89.28% for the visual model.
引用
收藏
页数:25
相关论文
共 64 条
[21]  
Dave AD, 2016, 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), P1985, DOI 10.1109/ICEEOT.2016.7755036
[22]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[23]   Multi-feature fusion framework for sarcasm identification on twitter data: A machine learning based approach [J].
Eke, Christopher Ifeanyi ;
Norman, Azah Anir ;
Shuib, Liyana .
PLOS ONE, 2021, 16 (06)
[24]  
Faraj D., P 6 AR NAT LANG PROC
[25]  
Farha I.A., 2021, P 6 AR NAT LANG PROC, P21
[26]  
Farha I.A., 2022, P FIND ASS COMP LING, P5284, DOI [10.18653/v1/2022.findings-emnlp.387, DOI 10.18653/V1/2022.FINDINGS-EMNLP.387]
[27]   Arabic sarcasm detection: An enhanced fine-tuned language model approach [J].
Galal, Mohamed A. ;
Yousef, Ahmed Hassan ;
Zayed, Hala H. ;
Medhat, Walaa .
AIN SHAMS ENGINEERING JOURNAL, 2024, 15 (06)
[28]  
Gonzalez-Ibanez Roberto, 2011, P 49 ANN M ASS COMP, P581, DOI DOI 10.5555/2002736.2002850
[29]   A machine learning approach in analysing the effect of hyperboles using negative sentiment tweets for sarcasm detection [J].
Govindan, Vithyatheri ;
Balakrishnan, Vimala .
JOURNAL OF KING SAUD UNIVERSITY COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) :5110-5120
[30]  
Gupta R, 2020, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), P633, DOI [10.1109/iciccs48265.2020.9120917, 10.1109/ICICCS48265.2020.9120917]