Comparison of Formant Features of Male and Female Emotional Speech in Czech and Slovak

被引:10
作者
Pribil, J. [1 ,2 ]
Pribilova, A. [3 ]
Matousek, J. [1 ]
机构
[1] Univ W Bohemia, Fac Sci Appl, Dept Cybernet, CZ-30614 Plzen, Czech Republic
[2] Slovak Acad Sci, Inst Measurement Sci, SK-84104 Bratislava, Slovakia
[3] Slovak Univ Technol Bratislava, FEI, Inst Elect & Photon, SK-81219 Bratislava, Slovakia
关键词
Speech processing; spectral analysis; speech analysis; emotion recognition; RECOGNITION;
D O I
10.5755/j01.eee.19.8.1739
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The paper describes analysis and comparison of formant features comprising the first three formant positions together with their 3 -dB bandwidths and the formant tilts. These features were determined from the smoothed spectral envelopes or directly calculated from the complex roots of the LPC polynomial. Subsequently, statistical analysis and comparison of the formant features from emotional speech representing joy, sadness, anger, and a neutral state was performed. In this experiment we use the speech material in the form of sentences uttered by male and female professional speakers in Czech and Slovak languages. For detailed analysis, the derived speech database consisting of manually selected sounds corresponding to the stationary parts of five vowels and two nasals was created. The determined formant positions and their value ranges are in correspondence with the general knowledge for male and female voices. Obtained statistical results and values of parameter ratios will be used for emotional speech conversion or they can also be applied for extension of the text-to-speech system enabling expressive speech production.
引用
收藏
页码:83 / 88
页数:6
相关论文
共 18 条
[1]  
[Anonymous], P IEEE REG 8 EUROCON
[2]  
Atassi H, 2010, LECT NOTES COMPUT SC, V5967, P255
[3]  
Boersma P., Praat: Doing Phonetics by Computer
[4]  
Burkhardt F., 2005, INTERSPEECH, V5, P1517, DOI DOI 10.21437/INTERSPEECH.2005-446
[5]  
Ceidaite G, 2010, ELEKTRON ELEKTROTECH, P69
[6]   Time-Scale Feature Extractions for Emotional Speech Characterization [J].
Chetouani, Mohamed ;
Mahdhaoui, Ammar ;
Ringeval, Fabien .
COGNITIVE COMPUTATION, 2009, 1 (02) :194-201
[7]  
Fant G., 2004, SPEECH ACOUSTICS AND
[8]  
Fant G., 1997, ACOUSTICAL ANALYSIS, P1589
[9]  
Gruber M, 2012, LECT NOTES COMPUT SC, V7499, P656, DOI 10.1007/978-3-642-32790-2_80
[10]   Effects of tonsillectomy on speech spectrum [J].
Ilk, HG ;
Erogul, O ;
Satar, B ;
Özkaptan, Y .
JOURNAL OF VOICE, 2002, 16 (04) :580-586