PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models

被引:6
|
作者
Prakash, Nirmalendu [1 ]
Wang, Han [1 ]
Hoang, Nguyen Khoi [2 ]
Hee, Ming Shan [1 ]
Lee, Roy Ka-Wei [1 ]
机构
[1] Singapore Univ Technol & Design, Singapore, Singapore
[2] VinUniversity, Hanoi, Vietnam
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
关键词
meme; multimodal; topic modeling; large language models; INTERNET MEMES;
D O I
10.1145/3581783.3613836
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The proliferation of social media has given rise to a new form of communication: memes. Memes are multimodal and often contain a combination of text and visual elements that convey meaning, humor, and cultural significance. While meme analysis has been an active area of research, little work has been done on unsupervised multimodal topic modeling of memes, which is important for content moderation, social media analysis, and cultural studies. We propose PromptMTopic, a novel multimodal prompt-based model designed to learn topics from both text and visual modalities by leveraging the language modeling capabilities of large language models. Our model effectively extracts and clusters topics learned from memes, considering the semantic interaction between the text and visual modalities. We evaluate our proposed model through extensive experiments on three real-world meme datasets, which demonstrate its superiority over state-of-the-art topic modeling baselines in learning descriptive topics in memes. Additionally, our qualitative analysis shows that PromptMTopic can identify meaningful and culturally relevant topics from memes. Our work contributes to the understanding of the topics and themes of memes, a crucial form of communication in today's society.
引用
收藏
页码:621 / 631
页数:11
相关论文
共 50 条
  • [41] Do Multimodal Large Language Models and Humans Ground Language Similarly?
    Jones, Cameron R.
    Bergen, Benjamin
    Trott, Sean
    COMPUTATIONAL LINGUISTICS, 2024, 50 (04) : 1415 - 1440
  • [42] Revisiting Automated Topic Model Evaluation with Large Language Models
    Stammbach, Dominik
    Zouhar, Vilem
    Hoyle, Alexander
    Sachan, Mrinmaya
    Ash, Elliott
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 9348 - 9357
  • [43] Process Modeling with Large Language Models
    Kourani, Humam
    Berti, Alessandro
    Schuster, Daniel
    van der Aalst, Wil M. P.
    ENTERPRISE, BUSINESS-PROCESS AND INFORMATION SYSTEMS MODELING, BPMDS 2024, EMMSAD 2024, 2024, 511 : 229 - 244
  • [44] Computing Architecture for Large-Language Models (LLMs) and Large Multimodal Models (LMMs)
    Liang, Bor-Sung
    PROCEEDINGS OF THE 2024 INTERNATIONAL SYMPOSIUM ON PHYSICAL DESIGN, ISPD 2024, 2024, : 233 - 234
  • [45] Unsupervised language model adaptation via topic modeling based on named entity hypotheses
    Liu, Yang
    Liu, Feifan
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4921 - 4924
  • [46] Harnessing multimodal approaches for depression detection using large language models and facial expressions
    Misha Sadeghi
    Robert Richer
    Bernhard Egger
    Lena Schindler-Gmelch
    Lydia Helene Rupp
    Farnaz Rahimi
    Matthias Berking
    Bjoern M. Eskofier
    npj Mental Health Research, 3 (1):
  • [47] BigARTM: Open Source Library for Regularized Multimodal Topic Modeling of Large Collections
    Vorontsov, Konstantin
    Frei, Oleksandr
    Apishev, Murat
    Romov, Peter
    Dudarenko, Marina
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2015, 2015, 542 : 370 - 381
  • [48] Understanding Russian Information Operations Using Unsupervised Multilingual Topic Modeling
    Chew, Peter A.
    Turnley, Jessica G.
    SOCIAL, CULTURAL, AND BEHAVIORAL MODELING, 2017, 10354 : 102 - 107
  • [49] Discovery of activity composites using topic models: An analysis of unsupervised methods
    Seiter, Julia
    Amft, Oliver
    Rossi, Mirco
    Troster, Gerhard
    PERVASIVE AND MOBILE COMPUTING, 2014, 15 : 215 - 227
  • [50] SEMI-SUPERVISED LEARNING OF LANGUAGE MODEL USING UNSUPERVISED TOPIC MODEL
    Bai, Shuanhu
    Huang, Chien-Lin
    Ma, Bin
    Li, Haizhou
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5382 - 5385