Cepstral analysis of vocal dysperiodicities in disordered connected speech

被引:0
作者
Alpan, A. [1 ]
Schoentgen, J. [1 ,2 ]
Maryn, Y. [3 ]
Grenez, F. [1 ]
Murphy, P. [4 ]
机构
[1] Univ Libre Bruxelles, Lab Images Signals & Telecommun Devices, Brussels, Belgium
[2] Natl Fund Sci Res, Liege, Belgium
[3] St Jan Gen Hosp, Dept Speech Language Pathol & Audiol, Dept Otorhinolaryngol & Head & Neck Surg, Brugge, Belgium
[4] Univ Limburg, Dept Elect & Comp Engn, Limerick, Ireland
来源
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年
关键词
Voice analysis; cepstrum; first rahmonic; correlation analysis; connected disordered speech; DYSPHONIA; SIGNALS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several studies have shown that the amplitude of the first rahmonic peak (RI) in the cepstrum is an indicator of hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier Transform of the log-magnitude spectrum. In the present study, a number of spectral analysis processing steps are implemented, including period-synchronous and period-asynchronous analysis, as well as harmonic-synchronous and harmonic-asynchronous spectral band-limitation prior to computing the cepstrum. The analysis is applied to connected speech signals. The correlation between amplitude RI and perceptual ratings is examined for a corpus comprising 28 normophonic and 223 dysphonic speakers. One observes that the correlation between RI and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a popular cepstral cue which is the cepstral peak prominence (CPP).
引用
收藏
页码:948 / +
页数:2
相关论文
共 10 条