Multi-label movie genre classification based on multimodal fusion

被引:9
|
作者
Cai, Zihui [1 ]
Ding, Hongwei [1 ]
Wu, Jinlu [1 ]
Xi, Ying [1 ]
Wu, Xuemeng [1 ]
Cui, Xiaohui [1 ]
机构
[1] Wuhan Univ, Minist Educ, Key Lab Aerosp Informat Secur & Trusted Comp, Sch Cyber Sci & Engn, Wuhan, Peoples R China
关键词
Multi-label; Movie genre classification; Multimodal fusion; Deep learning; RECOGNITION; NETWORK;
D O I
10.1007/s11042-023-16121-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the genre of a movie based on its relevant information is a challenging multi-label classification task. Previous studies tended to classify movies based on only one or two modalities, ignoring some valuable modalities. Considering this, we propose a multimodal movie genre classification framework which comprehensively considers the data from different modalities including the audio, poster, plot and frame sequences from video. To be specific, it processes the data from various modalities with the help of deep learning technologies, and fuses them in the way of decision-level fusion and intermediate fusion including concatenation and element-wise sum, which can improve the classification performance due to making full use of the information complementarity between multiple modalities. We train and evaluate the proposed framework on the LMTD-9 dataset. The results show that our best multimodal model outperforms state-of-the-art methods by 8.6% improvement in AU(PRC) and 5.3% improvement in AU(PRC)(w). It can be seen that the performance of movie genre classification can be effectively improved by means of multimodal fusion.
引用
收藏
页码:36823 / 36840
页数:18
相关论文
共 50 条
  • [41] A multi-label classification algorithm based on random walk model
    Zheng W.
    Wang C.-K.
    Liu Z.
    Wang J.-M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (08): : 1418 - 1426
  • [42] Multi-Label Arabic Text Classification Based On Deep Learning
    Alsukhni, Batool
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 475 - 477
  • [43] Multi-label images classification based on convolutional neural network
    Chen M.-S.
    Yu L.-L.
    Su Y.
    Sang A.-J.
    Zhao Y.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2020, 50 (03): : 1077 - 1084
  • [44] Multi-instance multi-label image classification: A neural approach
    Chen, Zenghai
    Chi, Zheru
    Fu, Hong
    Feng, Dagan
    NEUROCOMPUTING, 2013, 99 : 298 - 306
  • [45] Multi-label classification algorithm research based on swarm intelligence
    Qinghua Wu
    Hanmin Liu
    Xuesong Yan
    Cluster Computing, 2016, 19 : 2075 - 2085
  • [46] Multi-label emotion classification based on adversarial multi-task learning
    Lin, Nankai
    Fu, Sihui
    Lin, Xiaotian
    Wang, Lianxi
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (06)
  • [47] An ensemble-based approach for multi-view multi-label classification
    Gibaja E.L.
    Moyano J.M.
    Ventura S.
    Ventura, Sebastián (sventura@uco.es), 2016, Springer Verlag (05) : 251 - 259
  • [48] Multi-label category enhancement fusion distillation based on variational estimation
    Li, Li
    Xu, Jingzhou
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [49] TRANSFORMER-BASED MULTI-MODAL LEARNING FOR MULTI-LABEL REMOTE SENSING IMAGE CLASSIFICATION
    Hoffmann, David Sebastian
    Clasen, Kai Norman
    Demir, Begum
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 4891 - 4894
  • [50] Multi-Label Active Learning with Label Correlation for Image Classification
    Ye, Chen
    Wu, Jian
    Sheng, Victor S.
    Zhao, Pengpeng
    Cui, Zhiming
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3437 - 3441