SemiMemes: A Semi-supervised Learning Approach for Multimodal Memes Analysis

被引：1

作者：

Pham Thai Hoang Tung ^{[1
]}

Nguyen Tan Viet ^{[1
]}

Ngo Tien Anh ^{[1
]}

Phan Duy Hung ^{[1
]}

机构：

[1] FPT Univ, Hanoi, Vietnam

来源：

COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2023 | 2023年 / 14162卷

关键词：

Memes analysis; Multimodal learning; Semi-supervised learning;

D O I：

10.1007/978-3-031-41456-5_43

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The prevalence of memes on social media has created the need to sentiment analyze their underlying meanings for censoring harmful content. Meme censoring systems by machine learning raise the need for a semi-supervised learning solution to take advantage of the large number of unlabeled memes available on the internet and make the annotation process less challenging. Moreover, the approach needs to utilize multimodal data as memes' meanings usually come from both images and texts. This research proposes a multimodal semi-supervised learning approach that outperforms other multimodal semi-supervised learning and supervised learning state-of-the-art models on two datasets, the Multimedia Automatic Misogyny Identification and Hateful Memes dataset. Building on the insights gained from Contrastive Language-Image Pre-training, which is an effective multimodal learning technique, this research introduces SemiMemes, a novel training method that combines auto-encoder and classification task to make use of the resourceful unlabeled data.

引用

页码：565 / 577

页数：13

共 25 条

[1] Borzsei L.K., 2013, The selected works of Linda Borzsei, P1
[2] Chakraborty Tanmoy., 2020, P 14 WORKSHOP SEMANT
[3] Chua T.-S., 2009, P ACM INT C IM VID R, P1
[4] Douwe Kiela, 2021, PMLR, P344
[5] Fersini E., 2022, P 16 INT WORKSHOP SE, P533
[6] Goldberg A., 2009, J MACHINE LEARNING R, P169
[7] Gunti N., 2022, P AAAI C ARTIFICIAL, V36, P12959, DOI DOI 10.1609/AAAI.V36I11.21616
[8] Hakimov S., 2022, P SEMEVAL 2022 16 IN, P756, DOI 10.18653/v1/2022.semeval-1.105
[9] Li LH, 2019, Arxiv, DOI arXiv:1908.03557
[10] Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1026 - 1034

← 1 2 3 →