Multi-label Emotion Classification in Music Videos Using Ensembles of Audio and Video Features

被引:7
作者
Kostiuk, Bruno [1 ]
Costa, Yandre M. G. [1 ,2 ]
Britto Jr, Alceu S. [1 ]
Hu, Xiao [3 ]
Silla Jr, Carlos N. [1 ]
机构
[1] Pontificia Univ Catolica Parana PUCPR, PPGIa, Curitiba, Parana, Brazil
[2] State Univ Maringa UEM, Dept Informat, Maringa, Parana, Brazil
[3] Univ Hong Kong, Div Informat & Technol Studies, Pok Fu Lam, Hong Kong, Peoples R China
来源
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019) | 2019年
关键词
Multi-label Classification; Multimodal Classification; Video Classification; Emotion Classification; Audio and Video Combination;
D O I
10.1109/ICTAI.2019.00078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video as well as music are potent means to convey emotions. However, despite their importance in several applications, few works deal with the issue of emotion classification in videos. The main reason is possibly the lack of available databases. In this work we extend the CAL500 database by including music videos, since the CAL500 was originally proposed as an audio-only database. The main rationale here is that the music videos must be official as they were developed to convey the same emotion as the song. After adapting the database, we have extracted audio and video features to perform our computational experiments. Our main result is that there is a complementarity between the audio and video features as the best result was achieved using their combination.
引用
收藏
页码:517 / 523
页数:7
相关论文
共 23 条
[1]  
Ang Li-Minn, 2016, IEEE T AFFECTIVE COM
[2]  
[Anonymous], 2017, IEEE T AFFECTIVE COM
[3]  
[Anonymous], 2019, YAAF AUD FEAT EXTR Y
[4]  
[Anonymous], 2008, P ECML PKDD 2008 WOR
[5]  
[Anonymous], 2017, IEEE T AFFECTIVE COM
[6]  
Chen OTC, 2012, INT CONF AWARE SCI, P104
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]   Multilabel classification via calibrated label ranking [J].
Fuernkranz, Johannes ;
Huellermeier, Eyke ;
Mencia, Eneldo Loza ;
Brinker, Klaus .
MACHINE LEARNING, 2008, 73 (02) :133-153
[9]  
Hatamikia S, 2014, 2014 21th Iranian Conference on Biomedical Engineering (ICBME), P333, DOI 10.1109/ICBME.2014.7043946
[10]  
Kim Y.E., 2010, 11 INTERNAT SOC MUSI, P937