High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling

被引:56
|
作者
Srinivasan, P [1 ]
Jamieson, LH [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
关键词
audio compression; wavelet applications; wavelet packets; zero trees;
D O I
10.1109/78.668558
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a technique to incorporate psychoacoustic models into an adaptive wavelet packet scheme to achieve perceptually transparent compression of high-quality (44.1 kHz) audio signals at about 45 kb/s, The filter bank structure adapts according to psychoacoustic criteria and according to the computational complexity that is available at the decoder, This permits software implementations that can perform according to the computational power available in order to achieve real time coding/decoding. The bit allocation scheme is an adapted zero-tree algorithm that also takes input from the psychoacoustic model, The measure of performance is a quantity called subband perceptual rate, which the filter bank structure adapts to approach the perceptual entropy (PE) as closely as possible. In addition, this method is also amenable to progressive transmission, that is, it can achieve the best quality of reconstruction possible considering the size of the bit stream available at the encoder, The result is a variable-rate compression scheme for high-quality audio that takes into account the allowed computational complexity, the available bit-budget, and the psychoacoustic criteria for transparent coding, This paper thus provides a novel scheme to marry the results in wavelet packets and perceptual coding to construct an algorithm that is well suited to high-quality audio transfer for internet and storage applications.
引用
收藏
页码:1085 / 1093
页数:9
相关论文
共 50 条
  • [31] HIGH-QUALITY AUDIO AMPLIFIERS
    DINSDALE, J
    ELECTRONICS AND POWER, 1968, 14 (MAY): : 199 - &
  • [32] High-quality image compression using a wavelet transform and left-neighbour-parent dependencies
    Lesani, F
    Freeman, GH
    1997 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS I AND II: ENGINEERING INNOVATION: VOYAGE OF DISCOVERY, 1997, : 133 - 136
  • [33] High quality switched wavelet packet and discrete cosine transform audio coding technique
    Chang, PC
    Wu, TH
    Ho, KY
    ISPACS 2005: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, 2005, : 749 - 752
  • [34] Complexity scalable audio coding algorithm based on wavelet packet decomposition
    He, DM
    Gao, W
    Wu, JQ
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 659 - 665
  • [35] LOSSLESS COMPRESSION OF CFA SAMPLED IMAGE USING DECORRELATED MALLAT WAVELET PACKET DECOMPOSITION
    Lee, Yeejin
    Hirakawa, Keigo
    Nguyen, Truong Q.
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2721 - 2725
  • [36] A hybrid approach of wavelet packet and directional decomposition for image compression
    Zhang, CN
    Wu, XY
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2002, 12 (02) : 51 - 55
  • [37] Speech Enhancement Using the Combination of Adaptive Wavelet Threshold and Spectral Subtraction based on Wavelet Packet Decomposition
    Li Ruwei
    Bao Changchun
    Xia Bingyin
    Jia Maoshen
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 481 - 484
  • [38] Audio compression using wavelets and wavelet packets
    Devassia, V.P.
    Sandeep, G.
    Antony, N. Sajeev
    Thomas, Tessamma
    Advances in Modelling and Analysis B, 2003, 46 (3-4): : 61 - 72
  • [39] Improved low bit-rate audio compression using reduced rank ICA instead of psychoacoustic modeling
    Ben-Shalom, A
    Werman, M
    Dubnov, S
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 461 - 464
  • [40] HIGH-QUALITY DIGITAL AUDIO ENCODING WITH 3.0 BITS PER SAMPLE USING ADAPTIVE TRANSFORM CODING
    SCHRODER, EF
    VOESSING, W
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1986, 34 (05): : 380 - 380