Human hearing modelling real-time spectrography for visual feedback in singing training

被引:3
作者
Howard, DM [1 ]
机构
[1] Univ York, Dept Elect, Media Engn Res Grp, York YO10 5DD, N Yorkshire, England
关键词
singing; hearing modelling spectrography; visual feedback; singing analysis;
D O I
10.1159/000087085
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
The traditional form of spectrography employed for speech analysis makes use of fixed bandwidth bandpass filters, which are usually set either to wide-band or narrow-band to optimise the time and frequency resolution of the resulting spectrogram respectively. The acoustic analysis carried out by the human ear can be modelled as a bank of bandpass filters whose bandwidth changes as a function of centre frequency, and therefore the time and frequency resolution varies with frequency. This paper describes a spectrographic analysis technique that is based on peripheral human hearing and considers its potential for application within a real-time visual display for singing training by comparing its output with traditional wide- and narrow-band spectrograms. The potential advantages and disadvantages of hearing modelling spectrography for this application are illustrated and discussed for a selection of sung material, some of which has been recorded during singing lessons where traditional spectrography is being employed.
引用
收藏
页码:328 / 341
页数:14
相关论文
共 29 条
[1]  
[Anonymous], 1966, VISIBLE SPEECH
[2]  
[Anonymous], 1999, AUSTR VOICE
[3]  
[Anonymous], 2000, Speech Processing and Synthesis Toolboxes
[4]  
Baken R.J., 1987, CLIN MEASUREMENT SPE
[5]  
Baken R.J., 1991, READINGS CLIN SPECTR
[6]  
Brookes T, 2000, Logoped Phoniatr Vocol, V25, P72
[7]  
Fry D.B., 1979, The Physics of Speech
[8]  
Howard D.M., 1997, ORGAN SOUND, V2, P65
[9]  
Howard D.M., 1995, FORENSIC LINGUIST, V2, P28
[10]  
Howard D. M., 1997, LOGOP PHONIATR VOCO, V22, P169