Emotional speech classification with prosodic prameters by using neural networks

被引:0
作者
Sato, H [1 ]
Mitsukura, Y [1 ]
Fukumi, M [1 ]
Akamatsu, N [1 ]
机构
[1] Univ Tokushima, Fac Engn, Tokushima 7708506, Japan
来源
ANZIIS 2001: PROCEEDINGS OF THE SEVENTH AUSTRALIAN AND NEW ZEALAND INTELLIGENT INFORMATION SYSTEMS CONFERENCE | 2001年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interestingly, in order to achieve a new Human Interface such that digital computers can deal with the KASEI information, the study of the KANSEI information processing recently has been approached. In this paper, we propose a new classification method of emotional speech by analyzing feature parameters obtained from the emotional speech and by learning them using neural networks, which is regarded as a KANSEI information processing. In the present research, KANSEI information is usually human emotion. The emotion is classified broadly into four patterns such as neutral, anger, sad and joy. The pitch as one of feature parameters governs voice modulation, and can be sensitive to change of emotion. The pitch is extracted from each emotional speech by the cepstrum method. Input values of neural networks (NNs) are then emotional pitch patterns, which are time-varying. It is shown that NNs can achieve classification of emotion by learning each emotional pitch pattern by means of computer simulations.
引用
收藏
页码:395 / 398
页数:4
相关论文
共 9 条
[1]  
COWIE R, 1996, P ICSLP, V3
[2]  
KINJO Y, P IEE IEICE JAP
[3]  
KUNIEDA N, 1997, J I ELECT ENG JPN, P435
[4]  
MORIYAMA T, 1999, EVALUATION RELATION, P703
[5]  
NAGAO M, 2000, INFORMATION PROCESSI
[6]  
NAKAGAWA S, 1994, SPEECH AUDITORY SENS
[7]  
PEREIRA C, 1998, P ICSLP, P3
[8]  
SAITO S, 1985, FUNDAMENTALS SPEECH
[9]  
TSUJI S, 1997, SCI KANSEI