Evaluation of the affective valence of speech using pitch substructure

被引:21
作者
Cook, ND [1 ]
Fujisawa, TX [1 ]
Takami, K [1 ]
机构
[1] Kansai Univ, Dept Informat, Osaka, Japan
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 01期
关键词
emotion; fundamental frequency; Gaussian clusters; harmony perception; intonation; prosody; speech;
D O I
10.1109/TSA.2005.854115
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In order to study the relationship between emotion and intonation, a new technique is introduced for the extraction of the dominant pitches within speech utterances and the quasi-musical analysis of the multipitch structure. After the distribution of fundamental frequencies over the entire utterance has been obtained, the underlying pitch structure is determined using an unsupervised "cluster" (Gaussian mixtures) algorithm. The technique normally results in 3-6 pitch clusters per utterance that can then be evaluated in terms of their inherent dissonance, harmonic "tension," and "major or minor modality." Stronger dissonance and tension were found in utterances with negative affect, relative to utterances with positive affect. Most importantly, utterances that were evaluated as having positive or negative affect had significantly different modality values. Factor analysis showed that the measures involving multiple pitches were distinct from other acoustical measures, indicating that the pitch substructure is an independent factor contributing to the affective valence of speech prosody.
引用
收藏
页码:142 / 151
页数:10
相关论文
共 40 条
[1]  
BECKMAN ME, 1986, PHONOL YB, V3, P310
[2]  
Brown S, 2000, ORIGINS OF MUSIC, P3
[3]  
Cook N., 2002, TONE VOICE MIND
[4]  
COOK ND, 2001, P 6 ANN M SOC MUS PE
[5]  
COOK ND, 1999, BIOL FDN MUSIC, P382
[6]  
COOK ND, 2002, P 8 INT C FUNCT MAPP
[7]  
COOK ND, 2004, P 8 ICMPC
[8]  
COOK ND, 2000, P 5 ANN M SOC MUS PE
[9]  
COOK ND, 2003, P 6 SMPC
[10]  
DEUTSCH Diana, 2013, The Psychology of Music, V3