Parametric audio coding

被引：0

作者：

Edler, B ^{[1
]}

Purnhagen, H ^{[1
]}

机构：

[1] Univ Hannover, Informat Technol Lab, D-30167 Hannover, Germany

来源：

2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III | 2000年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

For very low bit rate audio coding applications in mobile communications or on the internet, parametric audio coding has evolved as a technique complementing the more traditional approaches. These are transform codecs originally designed for achieving CD-like quality on one hand, and specialized speech codecs on the other hand. Both of these techniques usually represent the audio signal waveform in a way such that the decoder output signal gives an approximation of the encoder input signal, while taking into account perceptual criteria. Compared to this approach, in parametric audio coding the models of the signal source and of human perception are extended. The source model is now based on the assumption that the audio signal is the sum of "components," each of which can be approximated by a relatively simple signal model with a small number of parameters. The perception model is based on the assumption that the sound of the decoder output signal should be as similar as possible to that of the encoder input signal. Therefore, the approximation of waveforms is no longer necessary. This approach can lead to a very efficient representation. However, a suitable set of models for signal components, a good decomposition, and a good parameter estimation are all vital for achieving maximum audio quality. We will give an overview on the current status of parametric audio coding developments and demonstrate advantages and challenges of this approach. Finally, we will indicate possible directions of further improvements.

引用

页码：21 / 24

页数：4

共 50 条

[41] Structured audio, Kolmogorov complexity, and generalized audio coding
Scheirer, ED
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 914 - 931
[42] ATSC video and audio coding
Davidson, GA
Isnardi, MA
Fielder, LD
Goldman, MS
Todd, CC
PROCEEDINGS OF THE IEEE, 2006, 94 (01) : 60 - 76
[43] Audio coding via EMD
Boudraa, Abdel-Ouahab
Khaldi, Kais
Chonavel, Thierry
Hadj-Alouane, Mounia Turki
Komaty, Ali
DIGITAL SIGNAL PROCESSING, 2020, 104
[44] Audio coding for conversion to MIDI
Sieger, NJ
Tewfik, AH
1997 IEEE FIRST WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1997, : 101 - 106
[45] DECORRELATION FOR AUDIO OBJECT CODING
Villemoes, Lars
Hirvonen, Toni
Purnhagen, Heiko
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 706 - 710
[46] Immersive audio, objects, and coding
Rumsey, Francis
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2015, 63 (05): : 394 - 398
[47] Multichannel audio decorrelation for coding
Torres-Guijarro, S
Ander, J
Alava, B
Casajús-Quirós, FJ
Ortiz-Berenguer, LI
DAFX-03: 6TH INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, PROCEEDINGS, 2003, : 57 - 60
[48] DRA Audio Coding Standard
Ma Wenhua
Xu Jing
Ma Yuanzhe
You Yuli
CHINESE JOURNAL OF ELECTRONICS, 2014, 23 (03) : 521 - 526
[49] Immersive audio, objects, and coding
Rumsey, Francis
AES: Journal of the Audio Engineering Society, 2015, 63 (05): : 394 - 398
[50] Lossless coding for audio discs
Craven, Peter
Gerzon, Michael
AES: Journal of the Audio Engineering Society, 1996, 44 (09): : 706 - 720

← 1 2 3 4 5 →