Multi-label movie genre classification based on multimodal fusion

被引:0
作者
Zihui Cai
Hongwei Ding
Jinlu Wu
Ying Xi
Xuemeng Wu
Xiaohui Cui
机构
[1] Wuhan University,Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education, School of Cyber Science and Engineering
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Multi-label; Movie genre classification; Multimodal fusion; Deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
Determining the genre of a movie based on its relevant information is a challenging multi-label classification task. Previous studies tended to classify movies based on only one or two modalities, ignoring some valuable modalities. Considering this, we propose a multimodal movie genre classification framework which comprehensively considers the data from different modalities including the audio, poster, plot and frame sequences from video. To be specific, it processes the data from various modalities with the help of deep learning technologies, and fuses them in the way of decision-level fusion and intermediate fusion including concatenation and element-wise sum, which can improve the classification performance due to making full use of the information complementarity between multiple modalities. We train and evaluate the proposed framework on the LMTD-9 dataset. The results show that our best multimodal model outperforms state-of-the-art methods by 8.6% improvement in AU(PRC) and 5.3% improvement in AU(PRC)w. It can be seen that the performance of movie genre classification can be effectively improved by means of multimodal fusion.
引用
收藏
页码:36823 / 36840
页数:17
相关论文
共 50 条
  • [41] Micro-video multi-label classification method based on multi-modal feature encoding
    Jing P.
    Li Y.
    Su Y.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (04): : 109 - 117
  • [42] TRANSFORMER-BASED MULTI-MODAL LEARNING FOR MULTI-LABEL REMOTE SENSING IMAGE CLASSIFICATION
    Hoffmann, David Sebastian
    Clasen, Kai Norman
    Demir, Begum
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 4891 - 4894
  • [43] Exploiting Label Dependency and Feature Similarity for Multi-Label Classification
    Nedungadi, Prema
    Haripriya, H.
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 2196 - 2200
  • [44] Label-Aware Recurrent Reading for Multi-Label Classification
    Ming, Shenglan
    Liu, Huajun
    Luo, Ziming
    Huang, Peng
    Li, Mark Junjie
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 498 - 504
  • [45] Multi-Label Active Learning with Label Correlation for Image Classification
    Ye, Chen
    Wu, Jian
    Sheng, Victor S.
    Zhao, Pengpeng
    Cui, Zhiming
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3437 - 3441
  • [46] Feature Selection for Multi-label Classification Problems
    Doquire, Gauthier
    Verleysen, Michel
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2011, PT I, 2011, 6691 : 9 - 16
  • [47] Set Labelling using Multi-label Classification
    Sanjaya, Ngurah Agus E. R.
    Read, Jesse
    Abdessalem, Talel
    Bressan, Stephane
    IIWAS2018: THE 20TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2014, : 216 - 220
  • [48] Multi-label emotion classification of Urdu tweets
    Ashraf, Noman
    Khan, Lal
    Butt, Sabur
    Chang, Hsien-Tsung
    Sidorov, Grigori
    Gelbukh, Alexander
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [49] MULTI-LABEL TEXT CLASSIFICATION WITH A ROBUST LABEL DEPENDENT REPRESENTATION
    Alfaro, Rodrigo
    Allende, Hector
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 211 - 214
  • [50] Causal multi-label learning for image classification
    Tian, Yingjie
    Bai, Kunlong
    Yu, Xiaotong
    Zhu, Siyu
    NEURAL NETWORKS, 2023, 167 : 626 - 637