Improved audio coding using a psychoacoustic model based on a cochlear filter bank

被引:16
作者
Baumgarte, F [1 ]
机构
[1] Agere Syst, Media Signal Proc Res Dept, Berkeley Hts, NJ 07922 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2002年 / 10卷 / 07期
关键词
audio coding; filter bank; masked threshold; model of masking; perceptual model;
D O I
10.1109/TSA.2002.804536
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is derived from a psychoacoustic model mimicking the properties of masking. Most psychoacoustic models for coding applications use a uniform (equal bandwidth) spectral decomposition as a first step to approximate the frequency selectivity of the human auditory system. However, the equal filter properties of the uniform subbands do not match the nonuniform characteristics of cochlear filters and reduce the precision of psychoacoustic modeling. Even so, uniform filter banks are applied because they are computationally efficient. This paper presents a psychoacoustic model based on an efficient nonuniform cochlear filter bank and a simple masked threshold estimation.. The novel filter-bank structure employs cascaded low-order HR filters and appropriate down-sampling to increase efficiency. The filter responses are. optimized for the modeling of auditory masking effects. Results of the new psychoacoustic model applied to audio coding show better performance in terms of bit rate and/or quality of the new model in comparison with other state-of-the-art models using a uniform spectral decomposition. The low delay of the new model is particularly suitable for low-delay coders.
引用
收藏
页码:495 / 503
页数:9
相关论文
共 41 条
  • [21] Model-based deadzone optimization for stack-run audio coding with uniform scalar quantization
    Oger, Marie
    Ragot, Stephane
    Antonini, Marc
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4761 - +
  • [22] Estimating Doppler shift using bat-inspired cochlear filter bank models: A comparison of methods for echoes from single and multiple reflectors
    Carmena, JM
    Hallam, JCT
    [J]. ADAPTIVE BEHAVIOR, 2001, 9 (3-4) : 241 - 261
  • [23] Blind Narrowband Interference Mitigation Using Filter Bank for Energy Detection Based UWB Receivers
    Xu, Zhimeng
    Nie, Hong
    Chen, Zhizhang
    Khani, Hassan
    Yang, Aidong
    [J]. 2013 IEEE RADIO AND WIRELESS SYMPOSIUM (RWS), 2013, : 139 - 141
  • [24] Face Recognition Using LDA Based Generalized Half band Polynomial Wavelet Filter Bank
    Muqeet, Mohd Abdul
    Holambe, Raghunath S.
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 4649 - 4653
  • [25] Classification of EEG Signals using Filter Bank Common Spatial Pattern Based on Fisher and Laplacian Criteria
    Wei, Qingguo
    Wan, Bin
    Lu, Zongwu
    [J]. MEASUREMENT TECHNOLOGY AND ITS APPLICATION, PTS 1 AND 2, 2013, 239-240 : 1033 - 1038
  • [26] A new source-filter model audio bandwidth extension using high frequency perception feature for IoT communications
    Jiang, Lin
    Yu, Shaoqian
    Wang, Xiaochen
    Wang, Chao
    Wang, Tonghan
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (13)
  • [27] EEG-Based Familiar and Unfamiliar Face Classification Using Filter-Bank Differential Entropy Features
    Liu, Guoyang
    Wen, Yiming
    Hsiao, Janet H.
    Zhang, Di
    Tian, Lan
    Zhou, Weidong
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2024, 54 (01) : 44 - 55
  • [28] Attentional State Classification Using Amplitude and Phase Feature Extraction Method Based on Filter Bank and Riemannian Manifold
    Xu, Guiying
    Wang, Zhenyu
    Zhao, Xi
    Li, Ruxue
    Zhou, Ting
    Xu, Tianheng
    Hu, Honglin
    [J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 4402 - 4412
  • [29] Identification of Inter-Area Oscillations Using Zolotarev Polynomial Based Filter Bank With Eigen Realization Algorithm
    Mandadi, Kalyani
    Kumar, B. Kalyan
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2016, 31 (06) : 4650 - 4659
  • [30] A New Filter Bank Multicarrier (FBMC) Based Cognitive Radio for 5G Networks Using Optimization Techniques
    Sheikh, Javaid A.
    Mir, Zaffer Iqbal
    Mufti, Mehboob ul Amin
    Parah, Shabir A.
    Bhat, G. Mohiuddin
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2020, 112 (02) : 1265 - 1280