PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models

被引：6

作者：

Prakash, Nirmalendu ^{[1
]}

Wang, Han ^{[1
]}

Hoang, Nguyen Khoi ^{[2
]}

Hee, Ming Shan ^{[1
]}

Lee, Roy Ka-Wei ^{[1
]}

机构：

[1] Singapore Univ Technol & Design, Singapore, Singapore

[2] VinUniversity, Hanoi, Vietnam

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

关键词：

meme; multimodal; topic modeling; large language models; INTERNET MEMES;

D O I：

10.1145/3581783.3613836

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The proliferation of social media has given rise to a new form of communication: memes. Memes are multimodal and often contain a combination of text and visual elements that convey meaning, humor, and cultural significance. While meme analysis has been an active area of research, little work has been done on unsupervised multimodal topic modeling of memes, which is important for content moderation, social media analysis, and cultural studies. We propose PromptMTopic, a novel multimodal prompt-based model designed to learn topics from both text and visual modalities by leveraging the language modeling capabilities of large language models. Our model effectively extracts and clusters topics learned from memes, considering the semantic interaction between the text and visual modalities. We evaluate our proposed model through extensive experiments on three real-world meme datasets, which demonstrate its superiority over state-of-the-art topic modeling baselines in learning descriptive topics in memes. Additionally, our qualitative analysis shows that PromptMTopic can identify meaningful and culturally relevant topics from memes. Our work contributes to the understanding of the topics and themes of memes, a crucial form of communication in today's society.

引用

页码：621 / 631

页数：11

共 50 条

[31] Instruction Tuning Large Language Models for Multimodal Relation Extraction Using LoRA
Li, Zou
Pang, Ning
Zhao, Xiang
WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 364 - 376
[32] Can We Edit Multimodal Large Language Models?
Cheng, Siyuan
Tian, Bozhong
Liu, Qingbin
Chen, Xi
Wang, Yongheng
Chen, Huajun
Zhang, Ningyu
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13877 - 13888
[33] Contextual Object Detection with Multimodal Large Language Models
Zang, Yuhang
Li, Wei
Han, Jun
Zhou, Kaiyang
Loy, Chen Change
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (02) : 825 - 843
[34] Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Zhai, Yuexiang
Tong, Shengbang
Li, Xiao
Cai, Mu
Qu, Qing
Lee, Yong Jae
Ma, Yi
CONFERENCE ON PARSIMONY AND LEARNING, VOL 234, 2024, 234 : 202 - 227
[35] A Survey on Multimodal Large Language Models for Autonomous Driving
Cui, Can
Ma, Yunsheng
Cao, Xu
Ye, Wenqian
Zhou, Yang
Liang, Kaizhao
Chen, Jintai
Lu, Juanwu
Yang, Zichong
Liao, Kuei-Da
Gao, Tianren
Li, Erlong
Tang, Kun
Cao, Zhipeng
Zhou, Tong
Liu, Ao
Yan, Xinrui
Mei, Shuqi
Cao, Jianguo
Wang, Ziran
Zheng, Chao
2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 958 - 979
[36] Multimodal Food Image Classification with Large Language Models
Kim, Jun-Hwa
Kim, Nam-Ho
Jo, Donghyeok
Won, Chee Sun
ELECTRONICS, 2024, 13 (22)
[37] Woodpecker: hallucination correction for multimodal large language models
Yin, Shukang
Fu, Chaoyou
Zhao, Sirui
Xu, Tong
Wang, Hao
Sui, Dianbo
Shen, Yunhang
Li, Ke
Sun, Xing
Chen, Enhong
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (12)
[38] Query Generation Using Large Language Models A Reproducibility Study of Unsupervised Passage Reranking
Rau, David
Kamps, Jaap
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT IV, 2024, 14611 : 226 - 239
[39] Do multimodal large language models understand welding?
Khvatskii, Grigorii
Lee, Yong Suk
Angst, Corey
Gibbs, Maria
Landers, Robert
Chawla, Nitesh V.
INFORMATION FUSION, 2025, 120
[40] Woodpecker: hallucination correction for multimodal large language models
Shukang YIN
Chaoyou FU
Sirui ZHAO
Tong XU
Hao WANG
Dianbo SUI
Yunhang SHEN
Ke LI
Xing SUN
Enhong CHEN
Science China(Information Sciences), 2024, 67 (12) : 52 - 64

← 1 2 3 4 5 →