Automatic music emotion classification model for movie soundtrack subtitling based on neuroscientific premises

被引：7

作者：

Lucia-Mulas, Maria Jose ^{[1
]}

Revuelta-Sanz, Pablo ^{[2
]}

Ruiz-Mezcua, Belen ^{[1
]}

Gonzalez-Carrasco, Israel ^{[1
]}

机构：

[1] Univ Carlos III Madrid, Comp Sci Dept, Av Univ 20, Madrid 28915, Spain

[2] Univ Oviedo, C Luis Ortiz Berrocal S-N, Gijon 33203, Spain

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 22期

关键词：

Music emotion recognition; Automatic subtitling; Convolutional neural network; RECOGNITION;

D O I：

10.1007/s10489-023-04967-w

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The ability of music to induce emotions has been arousing a lot of interest in recent years, especially due to the boom in music streaming platforms and the use of automatic music recommenders. Music Emotion Recognition approaches are based on combining multiple audio features extracted from digital audio samples and different machine learning techniques. In these approaches, neuroscience results on musical emotion perception are not considered. The main goal of this research is to facilitate the automatic subtitling of music. The authors approached the problem of automatic musical emotion detection in movie soundtracks considering these characteristics and using scientific musical databases, which have become a reference in neuroscience research. In the experiments, the Constant-Q-Transform spectrograms, the ones that best represent the relationships between musical tones from the point of view of human perception, are combined with Convolutional Neural Networks. Results show an efficient emotion classification model for 2-second musical audio fragments representative of intense basic feelings of happiness, sadness, and fear. Those emotions are the most interesting to be identified in the case of movie music captioning. The quality metrics have demonstrated that the results of the different models differ significantly and show no homogeneity. Finally, these results pave the way for an accessible and automatic captioning of music, which could automatically identify the emotional intent of the different segments of the movie soundtrack.

引用

页码：27096 / 27109

页数：14

共 40 条

[1]

AENOR, 2012, Norm UNE 153010. Subtitling for deaf and hearing-impaired persons

[2] Recognition of emotion in Japanese, Western, and Hindustani music by Japanese listeners [J].

Balkwill, LL ;

Thompson, WF ;

Matsunaga, R .

JAPANESE PSYCHOLOGICAL RESEARCH, 2004, 46 (04) :337-349

[3] A cross-cultural investigation of the perception of emotion in music: Psychophysical and cultural cues [J].

Balkwill, LL ;

Thompson, WF .

MUSIC PERCEPTION, 1999, 17 (01) :43-64

[4]

Bertin-Mahieux T., 2011, P INT SOC MUS INF RE, P591, DOI DOI 10.7916/D8NZ8J07

[5] Automatic Lecture Subtitle Generation and How It Helps [J].

Che, Xiaoyin ;

Luo, Sheng ;

Yang, Haojin ;

Meinel, Christoph .

2017 IEEE 17TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT), 2017, :34-38

[6]

Donnelly K.J., 2005, SPECTRE SOUND MUSIC

[7] Emotional expression in music: contribution, linearity, and additivity of primary musical cues [J].

Eerola, Tuomas ;

Friberg, Anders ;

Bresin, Roberto .

FRONTIERS IN PSYCHOLOGY, 2013, 4

[8] A comparison of the discrete and dimensional models of emotion in music [J].

Eerola, Tuomas ;

Vuoskoski, Jonna K. .

PSYCHOLOGY OF MUSIC, 2011, 39 (01) :18-49

[9] AN ARGUMENT FOR BASIC EMOTIONS [J].

EKMAN, P .

COGNITION & EMOTION, 1992, 6 (3-4) :169-200

[10]

Feng Y., 2003, Pro ACM Int. Conf. Information Retrieval, P375, DOI [DOI 10.1145/860435, DOI 10.1145/860500.860508]

← 1 2 3 4 →