Parametric audio coding

被引:0
|
作者
Edler, B [1 ]
Purnhagen, H [1 ]
机构
[1] Univ Hannover, Informat Technol Lab, D-30167 Hannover, Germany
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For very low bit rate audio coding applications in mobile communications or on the internet, parametric audio coding has evolved as a technique complementing the more traditional approaches. These are transform codecs originally designed for achieving CD-like quality on one hand, and specialized speech codecs on the other hand. Both of these techniques usually represent the audio signal waveform in a way such that the decoder output signal gives an approximation of the encoder input signal, while taking into account perceptual criteria. Compared to this approach, in parametric audio coding the models of the signal source and of human perception are extended. The source model is now based on the assumption that the audio signal is the sum of "components," each of which can be approximated by a relatively simple signal model with a small number of parameters. The perception model is based on the assumption that the sound of the decoder output signal should be as similar as possible to that of the encoder input signal. Therefore, the approximation of waveforms is no longer necessary. This approach can lead to a very efficient representation. However, a suitable set of models for signal components, a good decomposition, and a good parameter estimation are all vital for achieving maximum audio quality. We will give an overview on the current status of parametric audio coding developments and demonstrate advantages and challenges of this approach. Finally, we will indicate possible directions of further improvements.
引用
收藏
页码:21 / 24
页数:4
相关论文
共 50 条
  • [41] Structured audio, Kolmogorov complexity, and generalized audio coding
    Scheirer, ED
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 914 - 931
  • [42] ATSC video and audio coding
    Davidson, GA
    Isnardi, MA
    Fielder, LD
    Goldman, MS
    Todd, CC
    PROCEEDINGS OF THE IEEE, 2006, 94 (01) : 60 - 76
  • [43] Audio coding via EMD
    Boudraa, Abdel-Ouahab
    Khaldi, Kais
    Chonavel, Thierry
    Hadj-Alouane, Mounia Turki
    Komaty, Ali
    DIGITAL SIGNAL PROCESSING, 2020, 104
  • [44] Audio coding for conversion to MIDI
    Sieger, NJ
    Tewfik, AH
    1997 IEEE FIRST WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1997, : 101 - 106
  • [45] DECORRELATION FOR AUDIO OBJECT CODING
    Villemoes, Lars
    Hirvonen, Toni
    Purnhagen, Heiko
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 706 - 710
  • [46] Immersive audio, objects, and coding
    Rumsey, Francis
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2015, 63 (05): : 394 - 398
  • [47] Multichannel audio decorrelation for coding
    Torres-Guijarro, S
    Ander, J
    Alava, B
    Casajús-Quirós, FJ
    Ortiz-Berenguer, LI
    DAFX-03: 6TH INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, PROCEEDINGS, 2003, : 57 - 60
  • [48] DRA Audio Coding Standard
    Ma Wenhua
    Xu Jing
    Ma Yuanzhe
    You Yuli
    CHINESE JOURNAL OF ELECTRONICS, 2014, 23 (03) : 521 - 526
  • [49] Immersive audio, objects, and coding
    Rumsey, Francis
    AES: Journal of the Audio Engineering Society, 2015, 63 (05): : 394 - 398
  • [50] Lossless coding for audio discs
    Craven, Peter
    Gerzon, Michael
    AES: Journal of the Audio Engineering Society, 1996, 44 (09): : 706 - 720