Multi-label movie genre classification based on multimodal fusion

被引:0
|
作者
Zihui Cai
Hongwei Ding
Jinlu Wu
Ying Xi
Xuemeng Wu
Xiaohui Cui
机构
[1] Wuhan University,Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education, School of Cyber Science and Engineering
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Multi-label; Movie genre classification; Multimodal fusion; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Determining the genre of a movie based on its relevant information is a challenging multi-label classification task. Previous studies tended to classify movies based on only one or two modalities, ignoring some valuable modalities. Considering this, we propose a multimodal movie genre classification framework which comprehensively considers the data from different modalities including the audio, poster, plot and frame sequences from video. To be specific, it processes the data from various modalities with the help of deep learning technologies, and fuses them in the way of decision-level fusion and intermediate fusion including concatenation and element-wise sum, which can improve the classification performance due to making full use of the information complementarity between multiple modalities. We train and evaluate the proposed framework on the LMTD-9 dataset. The results show that our best multimodal model outperforms state-of-the-art methods by 8.6% improvement in AU(PRC) and 5.3% improvement in AU(PRC)w. It can be seen that the performance of movie genre classification can be effectively improved by means of multimodal fusion.
引用
收藏
页码:36823 / 36840
页数:17
相关论文
共 50 条
  • [11] A Multi-label Multimodal Deep Learning Framework for Imbalanced Data Classification
    Pouyanfar, Samira
    Wang, Tianyi
    Chen, Shu-Ching
    2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 199 - 204
  • [12] Multi-Label Multimodal Emotion Recognition With Transformer-Based Fusion and Emotion-Level Representation Learning
    Le, Hoai-Duy
    Lee, Guee-Sang
    Kim, Soo-Hyung
    Kim, Seungwon
    Yang, Hyung-Jeong
    IEEE ACCESS, 2023, 11 : 14742 - 14751
  • [13] Multi-label classification based on analog reasoning
    Nicolas, Ruben
    Sancho-Asensio, Andreu
    Golobardes, Elisabet
    Fornells, Albert
    Orriols-Puig, Albert
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (15) : 5924 - 5931
  • [14] A Survey of Multi-label Text Classification Based on Deep Learning
    Chen, Xiaolong
    Cheng, Jieren
    Liu, Jingxin
    Xu, Wenghang
    Hua, Shuai
    Tang, Zhu
    Sheng, Victor S.
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 443 - 456
  • [15] Research on Micro-video Multi-Label Classification Based on Deep Multimodal Association Learning
    Li, Yun
    Lu, Zhixiang
    Liu, Shuyi
    Wang, Su
    Lü, Zimin
    Jing, Peiguang
    Data Analysis and Knowledge Discovery, 2024, 8 (07) : 77 - 88
  • [16] Multi-modal, Multi-task and Multi-label for Music Genre Classification and Emotion Regression
    Pandeya, Yagya Raj
    You, Jie
    Bhattarai, Bhuwan
    Lee, Joonwhoan
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1042 - 1045
  • [17] Automatic movie genre classification & emotion recognition via a BiProjection Multimodal Transformer
    Moreno-Galvan, Diego Aaron
    Lopez-Santillan, Roberto
    Gonzalez-Gurrola, Luis Carlos
    Montes-Y-Gomez, Manuel
    Sanchez-Vega, Fernando
    Lopez-Monroy, Adrian Pastor
    INFORMATION FUSION, 2025, 113
  • [18] Multimodal Attentive Representation Learning for Micro-video Multi-label Classification
    Jing, Peiguang
    Liu, Xianyi
    Zhang, Lijuan
    Li, Yun
    Liu, Yu
    Su, Yuting
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (06)
  • [19] Multi-label classification of technical articles based on deep neural network
    Zhao, Qiuhan
    Yang, Wenchuan
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8391 - 8397
  • [20] Mineral Identification Based on Multi-Label Image Classification
    Wu, Baokun
    Ji, Xiaohui
    He, Mingyue
    Yang, Mei
    Zhang, Zhaochong
    Chen, Yan
    Wang, Yuzhu
    Zheng, Xinqi
    MINERALS, 2022, 12 (11)