Audio coding using a psychoacoustic pre- and post-filter

被引:0
作者
Edler, B [1 ]
Schuller, G [1 ]
机构
[1] Lucent Technol, Bell Labs, Multimedia Commun Res Lab, Murray Hill, NJ USA
来源
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI | 2000年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A novel concept for perceptual audio coding is presented which is based on the combination of a pre- and post-filter, controlled by a psychoacoustic model, with a transform coding scheme. This paradigm allows modeling of the temporal and spectral shape of the masked threshold with a resolution independent of the used transform. By using frequency warping techniques the maximum possible detail for a given filter order can be made frequency-dependent and thus better adapted to the human auditory system. The filter coefficients are represented efficiently by LSF parameters which can be adaptively interpolated over time. First experiments with a system obtained by extending an existing transform codec showed that this approach can significantly improve the performance for speech signals, while the performance for other signals remained the same.
引用
收藏
页码:881 / 884
页数:4
相关论文
共 10 条
[1]  
Brandenburg K, 1997, J AUDIO ENG SOC, V45, P4
[2]  
Chen JH, 1996, INT CONF ACOUST SPEE, P275, DOI 10.1109/ICASSP.1996.540411
[3]  
EDLER B, 1989, FREQUENZ, V43, P252, DOI 10.1515/FREQ.1989.43.9.252
[4]  
Hall JL, 1998, ELEC ENG HANDB SER, P391
[5]  
Kleijn W. B., 1995, Speech Coding and Synthesis
[6]  
LAINE UK, 1994, INT CONF ACOUST SPEE, P349
[7]  
Ramprashad S. A., 1999, IEEE WORKSH SPEECH C, P10
[8]   OPTIMIZING DIGITAL SPEECH CODERS BY EXPLOITING MASKING PROPERTIES OF THE HUMAN EAR [J].
SCHROEDER, MR ;
ATAL, BS ;
HALL, JL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 66 (06) :1647-1652
[9]  
SINHA D, 1997, DIGIT SIGNAL PROCESS, pCH42
[10]   LINEAR PREDICTION ON A WARPED FREQUENCY SCALE [J].
STRUBE, HW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (04) :1071-1076