Multi-label movie genre classification based on multimodal fusion

被引:9
|
作者
Cai, Zihui [1 ]
Ding, Hongwei [1 ]
Wu, Jinlu [1 ]
Xi, Ying [1 ]
Wu, Xuemeng [1 ]
Cui, Xiaohui [1 ]
机构
[1] Wuhan Univ, Minist Educ, Key Lab Aerosp Informat Secur & Trusted Comp, Sch Cyber Sci & Engn, Wuhan, Peoples R China
关键词
Multi-label; Movie genre classification; Multimodal fusion; Deep learning; RECOGNITION; NETWORK;
D O I
10.1007/s11042-023-16121-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the genre of a movie based on its relevant information is a challenging multi-label classification task. Previous studies tended to classify movies based on only one or two modalities, ignoring some valuable modalities. Considering this, we propose a multimodal movie genre classification framework which comprehensively considers the data from different modalities including the audio, poster, plot and frame sequences from video. To be specific, it processes the data from various modalities with the help of deep learning technologies, and fuses them in the way of decision-level fusion and intermediate fusion including concatenation and element-wise sum, which can improve the classification performance due to making full use of the information complementarity between multiple modalities. We train and evaluate the proposed framework on the LMTD-9 dataset. The results show that our best multimodal model outperforms state-of-the-art methods by 8.6% improvement in AU(PRC) and 5.3% improvement in AU(PRC)(w). It can be seen that the performance of movie genre classification can be effectively improved by means of multimodal fusion.
引用
收藏
页码:36823 / 36840
页数:18
相关论文
共 50 条
  • [31] Multi-Label Retinal Disease Classification Using Transformers
    Rodriguez, Manuel Alejandro
    AlMarzouqi, Hasan
    Liatsis, Panos
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (06) : 2739 - 2750
  • [32] Independent Feature and Label Components for Multi-label Classification
    Zhong, Yongjian
    Xu, Chang
    Du, Bo
    Zhang, Lefei
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 827 - 836
  • [33] Application of Label Correlation in Multi-Label Classification: A Survey
    Huang, Shan
    Hu, Wenlong
    Lu, Bin
    Fan, Qiang
    Xu, Xinyao
    Zhou, Xiaolei
    Yan, Hao
    APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [34] Scalable Label Distribution Learning for Multi-Label Classification
    Zhao, Xingyu
    An, Yuexuan
    Qi, Lei
    Geng, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [35] Deep Learning with a Rethinking Structure for Multi-label Classification
    Yang, Yao-Yuan
    Lin, Yi-An
    Chu, Hong-Min
    Lin, Hsuan-Tien
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 125 - 140
  • [36] Multi-Label Fundus Image Classification Using Attention Mechanisms and Feature Fusion
    Li, Zhenwei
    Xu, Mengying
    Yang, Xiaoli
    Han, Yanqi
    MICROMACHINES, 2022, 13 (06)
  • [37] Multi-label text classification model based on semantic embedding
    Yan Danfeng
    Ke Nan
    Gu Chao
    Cui Jianfei
    Ding Yiqi
    The Journal of China Universities of Posts and Telecommunications, 2019, 26 (01) : 95 - 104
  • [38] OCBMLC: An Overlapping Clustering Based Multi-Label Classification Algorithm
    Peng, Liwen
    Liu, Yongguo
    Liao, Huan
    Zhang, Peng
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2017), 2017, : 113 - 116
  • [39] Multi-label classification algorithm research based on swarm intelligence
    Wu, Qinghua
    Liu, Hanmin
    Yan, Xuesong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 2075 - 2085
  • [40] Machine Learning Based Embedded Code Multi-Label Classification
    Zhou, Yu
    Cui, Suxia
    Wang, Yonghui
    IEEE ACCESS, 2021, 9 : 150187 - 150200