Multi-label movie genre classification based on multimodal fusion

被引:9
|
作者
Cai, Zihui [1 ]
Ding, Hongwei [1 ]
Wu, Jinlu [1 ]
Xi, Ying [1 ]
Wu, Xuemeng [1 ]
Cui, Xiaohui [1 ]
机构
[1] Wuhan Univ, Minist Educ, Key Lab Aerosp Informat Secur & Trusted Comp, Sch Cyber Sci & Engn, Wuhan, Peoples R China
关键词
Multi-label; Movie genre classification; Multimodal fusion; Deep learning; RECOGNITION; NETWORK;
D O I
10.1007/s11042-023-16121-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the genre of a movie based on its relevant information is a challenging multi-label classification task. Previous studies tended to classify movies based on only one or two modalities, ignoring some valuable modalities. Considering this, we propose a multimodal movie genre classification framework which comprehensively considers the data from different modalities including the audio, poster, plot and frame sequences from video. To be specific, it processes the data from various modalities with the help of deep learning technologies, and fuses them in the way of decision-level fusion and intermediate fusion including concatenation and element-wise sum, which can improve the classification performance due to making full use of the information complementarity between multiple modalities. We train and evaluate the proposed framework on the LMTD-9 dataset. The results show that our best multimodal model outperforms state-of-the-art methods by 8.6% improvement in AU(PRC) and 5.3% improvement in AU(PRC)(w). It can be seen that the performance of movie genre classification can be effectively improved by means of multimodal fusion.
引用
收藏
页码:36823 / 36840
页数:18
相关论文
共 50 条
  • [1] Multi-label movie genre classification based on multimodal fusion
    Zihui Cai
    Hongwei Ding
    Jinlu Wu
    Ying Xi
    Xuemeng Wu
    Xiaohui Cui
    Multimedia Tools and Applications, 2024, 83 : 36823 - 36840
  • [2] A multimodal approach for multi-label movie genre classification
    Mangolin, Rafael B.
    Pereira, Rodolfo M.
    Britto, Alceu S., Jr.
    Silla, Carlos N., Jr.
    Feltrim, Valeria D.
    Bertolini, Diego
    Costa, Yandre M. G.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (14) : 19071 - 19096
  • [3] A multimodal approach for multi-label movie genre classification
    Rafael B. Mangolin
    Rodolfo M. Pereira
    Alceu S. Britto
    Carlos N. Silla
    Valéria D. Feltrim
    Diego Bertolini
    Yandre M. G. Costa
    Multimedia Tools and Applications, 2022, 81 : 19071 - 19096
  • [4] Evaluating multimodal strategies for multi-label movie genre classification
    Paulino, Marco Aurelio D.
    Costa, Yandre M. G.
    Feltrim, Valeria D.
    2022 29TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2022,
  • [5] Movie genre classification: A multi-label approach based on convolutions through time
    Wehrmann, Jonatas
    Barros, Rodrigo C.
    APPLIED SOFT COMPUTING, 2017, 61 : 973 - 982
  • [6] Multi-label multi-modal classification of movie scenes
    Soykok, Irmak Turkoz
    Guvenir, H. Altay
    KNOWLEDGE-BASED SYSTEMS, 2025, 318
  • [7] A Multi-label and Adaptive Genre Classification of Web Pages
    Jebari, Chaker
    Wani, M. Arif
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 578 - 581
  • [8] Multi-label Movie Genre Detection from a Movie Poster Using Knowledge Transfer Learning
    Kaushil Kundalia
    Yash Patel
    Manan Shah
    Augmented Human Research, 2020, 5 (1)
  • [9] Shot-Based Hybrid Fusion for Movie Genre Classification
    Bi, Tianyu
    Jarnikov, Dimitri
    Lukkien, Johan
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT I, 2022, 13231 : 257 - 269
  • [10] A Combination based on OWA Operators for Multi-label Genre Classification of web pages
    Jebari, Chaker
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2015, (54): : 13 - 20