Multi-label movie genre classification based on multimodal fusion

被引:9
|
作者
Cai, Zihui [1 ]
Ding, Hongwei [1 ]
Wu, Jinlu [1 ]
Xi, Ying [1 ]
Wu, Xuemeng [1 ]
Cui, Xiaohui [1 ]
机构
[1] Wuhan Univ, Minist Educ, Key Lab Aerosp Informat Secur & Trusted Comp, Sch Cyber Sci & Engn, Wuhan, Peoples R China
关键词
Multi-label; Movie genre classification; Multimodal fusion; Deep learning; RECOGNITION; NETWORK;
D O I
10.1007/s11042-023-16121-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the genre of a movie based on its relevant information is a challenging multi-label classification task. Previous studies tended to classify movies based on only one or two modalities, ignoring some valuable modalities. Considering this, we propose a multimodal movie genre classification framework which comprehensively considers the data from different modalities including the audio, poster, plot and frame sequences from video. To be specific, it processes the data from various modalities with the help of deep learning technologies, and fuses them in the way of decision-level fusion and intermediate fusion including concatenation and element-wise sum, which can improve the classification performance due to making full use of the information complementarity between multiple modalities. We train and evaluate the proposed framework on the LMTD-9 dataset. The results show that our best multimodal model outperforms state-of-the-art methods by 8.6% improvement in AU(PRC) and 5.3% improvement in AU(PRC)(w). It can be seen that the performance of movie genre classification can be effectively improved by means of multimodal fusion.
引用
收藏
页码:36823 / 36840
页数:18
相关论文
共 50 条
  • [21] Multi-label classification of technical articles based on deep neural network
    Zhao, Qiuhan
    Yang, Wenchuan
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8391 - 8397
  • [22] Mineral Identification Based on Multi-Label Image Classification
    Wu, Baokun
    Ji, Xiaohui
    He, Mingyue
    Yang, Mei
    Zhang, Zhaochong
    Chen, Yan
    Wang, Yuzhu
    Zheng, Xinqi
    MINERALS, 2022, 12 (11)
  • [23] Plant Recommender System Based on Multi-label Classification
    Tharwat, Alaa
    Mahdi, Hani
    Hassanien, Aboul Ella
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 825 - 835
  • [24] Movie tag prediction: An extreme multi-label multi-modal transformer-based solution with explanation
    Guarascio, Massimo
    Minici, Marco
    Pisani, Francesco Sergio
    De Francesco, Erika
    Lambardi, Pasquale
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (04) : 1021 - 1043
  • [25] Multi-label Text Classification Method Based on Label Semantic Information
    Xiao L.
    Chen B.-L.
    Huang X.
    Liu H.-F.
    Jing L.-P.
    Yu J.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1079 - 1089
  • [26] Multi-module Fusion Relevance Attention Network for Multi-label Text Classification
    Yu, Xinmiao
    Li, Zhengpeng
    Wu, Jiansheng
    Liu, Mingao
    ENGINEERING LETTERS, 2022, 30 (04)
  • [27] MMPosE: Movie-Induced Multi-Label Positive Emotion Classification Through EEG Signals
    Du, Xiaobing
    Deng, Xiaoming
    Qin, Hangyu
    Shu, Yezhi
    Liu, Fang
    Zhao, Guozhen
    Lai, Yu-Kun
    Ma, Cuixia
    Liu, Yong-Jin
    Wang, Hongan
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2925 - 2938
  • [28] Multi-label Classification based on Association Rules with Application to Scene Classification
    Li, Bo
    Li, Hong
    Wu, Min
    Li, Ping
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 36 - 41
  • [29] Multi-Label Classification for Power Quality Disturbances by Integrated Deep Learning
    Xiao, Xiangui
    Li, Kaicheng
    IEEE ACCESS, 2021, 9 : 152250 - 152260
  • [30] Multi-Label Classification With Hyperdimensional Representations
    Chandrasekaran, Rishikanth
    Asgareinjad, Fatemeh
    Morris, Justin
    Rosing, Tajana
    IEEE ACCESS, 2023, 11 : 108458 - 108474