A Dynamic Fine-Grain Scalable Compression Scheme With Application to Progressive Audio Coding

被引:1
作者
Strahl, Stefan [1 ]
Hansen, Heiko [1 ]
Mertins, Alfred [2 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, D-26111 Oldenburg, Germany
[2] Med Univ Lubeck, Inst Signal Proc, D-23538 Lubeck, Germany
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 01期
关键词
Audio coding; embedded coding; progressive compression; significance tree;
D O I
10.1109/TASL.2010.2042129
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper studies the fine-grain scalable compression problem with emphasis on 1-D signals such as audio signals. Like in the successful 2-D still image compression techniques embedded zerotree wavelet coder (EZW) and set partitioning in hierarchical trees (SPIHT), the desired fine-granular scalability and high coding efficiency are benefited from a tree-based significance mapping technique. A significance tree serves to quickly locate and efficiently encode the important coefficients in the transform domain. The aim of this paper is to find such suitable significance trees for compressing dynamically variant 1-D signals. The proposed solution is a novel dynamic significance tree (DST) where, unlike in existing solutions with a single type of tree, a significance tree is chosen dynamically out of a set of trees by taking into account the actual coefficients distribution. We show how a set of possible DSTs can be derived that is optimized for a given (training) dataset. The method outperforms the existing scheme for lossy audio compression based on a single-type tree (SPIHT) and the scalable audio coding schemes MPEG-4 BSAC and MPEG-4 SLS. For bitrates less than 32 kbps, it results in an improved perceived audio quality compared to the fixed-bitrate MPEG-2/4 AAC audio coding scheme while providing progressive transmission and finer scalability.
引用
收藏
页码:14 / 23
页数:10
相关论文
共 31 条
  • [1] MPEG-4 natural audio coding
    Brandenburg, K
    Kunz, O
    Sugiyama, A
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2000, 15 (4-5) : 423 - 444
  • [2] DUNN C, 2001, P AES 111 CONV NEW Y
  • [3] *EUR BROADC UN, 1988, 3253 EUR BROADC UN
  • [4] PEMO-Q - A new method for objective: Audio quality assessment using a model of auditory perception
    Huber, Rainer
    Kollmeier, Birger
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 1902 - 1911
  • [5] *ISO IEC, 1449631999AMD1 ISOIE
  • [6] *ISO IEC, 2010, 1449652001AMD102007
  • [7] *ISO IEC, 2010, 1449652001 ISOIEC MP
  • [8] *ISO IEC, 1449632005AMD32006 I
  • [9] *ITU, 1990, ITU PUBL
  • [10] *ITU, 1988, ITU PUBL