A unified framework of deep networks for genre classification using movie trailer

被引:35
作者
Yadav, Ashima [1 ]
Vishwakarma, Dinesh Kumar [1 ]
机构
[1] Delhi Technol Univ, Dept Informat Technol, Biometr Res Lab, Delhi, India
关键词
Affective computing; Deep learning; Emotions; Inception; Sentiments; Video classification; RECOMMENDATION; PREDICTION; VIDEO; MODEL;
D O I
10.1016/j.asoc.2020.106624
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Affective video content analysis has emerged as one of the most challenging and essential research tasks as it aims to analyze the emotions elicited by videos automatically. However, little progress has been achieved in this field due to the enigmatic nature of emotions. This widens the gap between the human affective state and the structure of the video. In this paper, we propose a novel deep affectbased movie trailer classification framework. We also develop an EmoGDB dataset, which contains 100 Bollywood movie trailers annotated with popular movie genres: Action, Comedy, Drama, Horror, Romance, Thriller, and six different types of induced emotions: Anger, Fear, Happy, Neutral, Sad, Surprise. The affect-based features are learned via ILDNet architecture trained on the EmoGDB dataset. Our work aims to analyze the relationship between the emotions elicited by the movie trailers and how they contribute in solving the multi-label genre classification problem. The proposed novel framework is validated by performing cross-dataset testing on three large scale datasets, namely LMTD-9, MMTF-14K, and ML-25M datasets. Extensive experiments show that the proposed algorithm outperforms all the state-of-the-art methods significantly, as reported by the precision, recall, F1 score, precision-recall curves (PRC), and area under the PRC evaluation metrics. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 57 条
[1]  
[Anonymous], 2015, Depth-gated recurrent neural networks
[2]  
[Anonymous], 2000, P IEEE INNS ENNS INT
[3]  
[Anonymous], 2014, P 2014 C EMP METH NA, DOI DOI 10.3115/V1/D14-1179
[4]  
Bansal S, 2016, INT CONF CONTEMP, P172
[5]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[6]  
Cambria Erik, 2012, Cognitive Behavioural Systems (COST 2012). International Training School. Revised Selected Papers, P144, DOI 10.1007/978-3-642-34584-5_11
[7]   Affective Recommendation of Movies Based on Selected Connotative Features [J].
Canini, Luca ;
Benini, Sergio ;
Leonardi, Riccardo .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (04) :636-647
[8]   Movie scene segmentation using background information [J].
Chen, Liang-Hua ;
Lai, Yu-Chun ;
Liao, Hong-Yuan Mark .
PATTERN RECOGNITION, 2008, 41 (03) :1056-1065
[9]  
Choros K., 2018, INT C COMP COLL INT
[10]   MMTF-14K: A Multifaceted Movie Trailer Feature Dataset for Recommendation and Retrieval [J].
Deldjoo, Yashar ;
Constantin, Mihai Gabriel ;
Ionescu, Bogdan ;
Schedl, Markus ;
Cremonesi, Paolo .
PROCEEDINGS OF THE 9TH ACM MULTIMEDIA SYSTEMS CONFERENCE (MMSYS'18), 2018, :450-455