Enhancing Cross-Lingual Sarcasm Detection by a Prompt Learning Framework with Data Augmentation and Contrastive Learning

被引:1
作者
An, Tianbo [1 ,2 ,3 ]
Yan, Pingping [1 ,2 ,3 ]
Zuo, Jiaai [4 ]
Jin, Xing [5 ,6 ]
Liu, Mingliang [1 ]
Wang, Jingrui [1 ,2 ,3 ]
机构
[1] Changchun Univ, Coll Comp Sci & Technol, Changchun 130022, Peoples R China
[2] Changchun Univ, Key Lab Intelligent Rehabil & Barrier Free Disable, Minist Educ, Changchun 130022, Peoples R China
[3] Jilin Prov Key Lab Human Hlth Status Identificat &, Changchun 130022, Peoples R China
[4] Henan Univ Sci & Technol, Coll Informat Engn, Luoyang 471023, Peoples R China
[5] Hangzhou Dianzi Univ, Sch Cyberspace, Hangzhou 310018, Peoples R China
[6] Hangzhou Dianzi Univ, Expt Ctr Data Sci & Intelligent Decis Making, Hangzhou 310018, Peoples R China
关键词
cross-lingual; prompt learning; sarcasm detection; Chinese; SENTIMENT;
D O I
10.3390/electronics13112163
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given their intricate nature and inherent ambiguity, sarcastic texts often mask deeper emotions, making it challenging to discern the genuine feelings behind the words. The proposal of the sarcasm detection task is to assist us with more accurately understanding the true intention of the speaker. Advanced methods, such as deep learning and neural networks, are widely used in the field of sarcasm detection. However, most research mainly focuses on sarcastic texts in English, as other languages lack corpora and annotated datasets. To address the challenge of low-resource languages in sarcasm detection tasks, a zero-shot cross-lingual transfer learning method is proposed in this paper. The proposed approach is based on prompt learning and aims to assist the model with understanding downstream tasks through prompts. Specifically, the model uses prompt templates to construct training data into cloze-style questions and then trains them using a pre-trained cross-lingual language model. Combining data augmentation and contrastive learning can further improve the capacity of the model for cross-lingual transfer learning. To evaluate the performance of the proposed model, we utilize a publicly accessible sarcasm dataset in English as training data in a zero-shot cross-lingual setting. When tested with Chinese as the target language for transfer, our model achieves F1-scores of 72.14% and 76.7% on two test datasets, outperforming the strong baselines by significant margins.
引用
收藏
页数:14
相关论文
共 68 条
[1]   Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond [J].
Artetxe, Mikel ;
Schwenk, Holger .
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 :597-610
[2]  
Babanejad Nastaran, 2020, P 28 INT C COMP LING, P225
[3]  
Brown TB, 2020, ADV NEUR IN, V33
[4]  
Bukhari Syed Husnain Haider, 2023, 2023 3rd International Conference on Artificial Intelligence (ICAI), P26, DOI 10.1109/ICAI58407.2023.10136656
[5]  
Conneau A, 2020, Arxiv, DOI [arXiv:1911.02116, DOI 10.48550/ARXIV.1911.02116]
[6]  
Ding K., 2022, arXiv
[7]   Context-Based Feature Technique for Sarcasm Identification in Benchmark Datasets Using Deep Learning and BERT Model [J].
Eke, Christopher Ifeanyi ;
Norman, Azah Anir ;
Shuib, Liyana .
IEEE ACCESS, 2021, 9 (09) :48501-48518
[8]   Enriching contextualized semantic representation with textual information transmission for COVID-19 fake news detection: A study on English and Persian [J].
Ghayoomi, Masood .
DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2023, 38 (01) :99-110
[9]   Ensemble transfer learning-based multimodal sentiment analysis using weighted convolutional neural networks [J].
Ghorbanali, Alireza ;
Sohrabi, Mohammad Karim ;
Yaghmaee, Farzin .
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (03)
[10]   Applying Basic Features from Sentiment Analysis for Automatic Irony Detection [J].
Hernandez-Farias, Irazu ;
Benedi, Jose-Miguel ;
Rosso, Paolo .
PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 :337-344