Multi-label movie genre classification based on multimodal fusion

被引:0
作者
Zihui Cai
Hongwei Ding
Jinlu Wu
Ying Xi
Xuemeng Wu
Xiaohui Cui
机构
[1] Wuhan University,Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education, School of Cyber Science and Engineering
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Multi-label; Movie genre classification; Multimodal fusion; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Determining the genre of a movie based on its relevant information is a challenging multi-label classification task. Previous studies tended to classify movies based on only one or two modalities, ignoring some valuable modalities. Considering this, we propose a multimodal movie genre classification framework which comprehensively considers the data from different modalities including the audio, poster, plot and frame sequences from video. To be specific, it processes the data from various modalities with the help of deep learning technologies, and fuses them in the way of decision-level fusion and intermediate fusion including concatenation and element-wise sum, which can improve the classification performance due to making full use of the information complementarity between multiple modalities. We train and evaluate the proposed framework on the LMTD-9 dataset. The results show that our best multimodal model outperforms state-of-the-art methods by 8.6% improvement in AU(PRC) and 5.3% improvement in AU(PRC)w. It can be seen that the performance of movie genre classification can be effectively improved by means of multimodal fusion.
引用
收藏
页码:36823 / 36840
页数:17
相关论文
共 50 条
  • [31] Multi-label text classification model based on semantic embedding
    Yan Danfeng
    Ke Nan
    Gu Chao
    Cui Jianfei
    Ding Yiqi
    The Journal of China Universities of Posts and Telecommunications, 2019, 26 (01) : 95 - 104
  • [32] OCBMLC: An Overlapping Clustering Based Multi-Label Classification Algorithm
    Peng, Liwen
    Liu, Yongguo
    Liao, Huan
    Zhang, Peng
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2017), 2017, : 113 - 116
  • [33] Multi-label classification algorithm research based on swarm intelligence
    Wu, Qinghua
    Liu, Hanmin
    Yan, Xuesong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 2075 - 2085
  • [34] Machine Learning Based Embedded Code Multi-Label Classification
    Zhou, Yu
    Cui, Suxia
    Wang, Yonghui
    IEEE ACCESS, 2021, 9 : 150187 - 150200
  • [35] A multi-label classification algorithm based on random walk model
    Zheng W.
    Wang C.-K.
    Liu Z.
    Wang J.-M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (08): : 1418 - 1426
  • [36] Multi-Label Arabic Text Classification Based On Deep Learning
    Alsukhni, Batool
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 475 - 477
  • [37] Multi-label images classification based on convolutional neural network
    Chen M.-S.
    Yu L.-L.
    Su Y.
    Sang A.-J.
    Zhao Y.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2020, 50 (03): : 1077 - 1084
  • [38] Multi-label classification algorithm research based on swarm intelligence
    Qinghua Wu
    Hanmin Liu
    Xuesong Yan
    Cluster Computing, 2016, 19 : 2075 - 2085
  • [39] An ensemble-based approach for multi-view multi-label classification
    Gibaja E.L.
    Moyano J.M.
    Ventura S.
    Ventura, Sebastián (sventura@uco.es), 2016, Springer Verlag (05) : 251 - 259
  • [40] Multi-label category enhancement fusion distillation based on variational estimation
    Li, Li
    Xu, Jingzhou
    KNOWLEDGE-BASED SYSTEMS, 2024, 300