A computationally efficient cochlear filter bank for perceptual audio coding

被引:0
作者
Baumgarte, F [1 ]
机构
[1] Lucent Techno, Bell Labs, Multimedia Commun Res Lab, Murray Hill, NJ 07974 USA
来源
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM | 2001年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many applications in auditory modeling require analysis filters that approximate the frequency selectivity given by psychophysical data, e.g. from masking experiments using narrow-band maskers. This frequency selectivity is largely determined by the spectral decomposition process inside the human cochlea. Currently used spectral decomposition schemes for masking modeling in audio coding generally do not achieve the non-uniform time and frequency resolution provided by the cochlea. These applications rather take advantage of the computational efficiency of uniform filter banks or transforms at the expense of coding gain. This paper presents a suitable analysis filter-bank structure employing cascaded low-order IIR filters and appropriate downsampling to increase efficiency. In an application example, the filter responses were optimized to model auditory masking effects. The results show that the time and frequency resolution of the filter bank matches or exceeds the masking properties. Thus, the filter bank enables improved masking modeling for audio coding at low computational costs.
引用
收藏
页码:3265 / 3268
页数:4
相关论文
共 8 条
[1]  
BAUMGARTE F, 1998, 8 AES CONV SAN FRANC
[2]  
DENNIS JE, 1983, NUMERICAL METHODS NC
[3]  
EDLER B, 2000, P ICASSP 2001 IST JU, P1881
[4]   Asymmetry of masking revisited: Generalization of masker and probe bandwidth [J].
Hall, JL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (02) :1023-1033
[5]  
*ISO IEC, 1997, JTC1SC29WG11 ISOIEC
[6]  
Moore B.C.J., 1997, INTRO PSYCHOL HEARIN
[7]   Basilar-membrane responses to tones at the base of the chinchilla cochlea [J].
Ruggero, MA ;
Rich, NC ;
Recio, A ;
Narayan, SS ;
Robles, L .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (04) :2151-2163
[8]  
Zwicker E., 1999, PSYCHOACOUSTICS, DOI [10.1007/978-3-662-09562-1, DOI 10.1007/978-3-662-09562-1]