Perceptual coding of digital audio

被引：441

作者：

Painter, T ^{[1
]}

Spanias, A ^{[1
]}

机构：

[1] Arizona State Univ, Ctr Telecommun Res, Dept Elect Engn, Tempe, AZ 85287 USA

来源：

PROCEEDINGS OF THE IEEE | 2000年 / 88卷 / 04期

关键词：

AC-2; AC-3; advanced audio coding (AAC); MPEG; ATRAC; audio coding; audio coding standards; audio signal processing; data compression; digital audio radio (DAR); digital broadcast audio (DBA); filter banks; high-definition TV (HDTV); linear predictive coding; lossy compression; modified discrete cosine transform (MDCT); MP3; MPEG-1; MPEG-2; MPEG-4; MPEG audio; multimedia signal processing; perceptual audio coding (PAC); perceptual coding; perceptual model; pseudoquadrature mirror filter (PQMF); psychoacoustic model; psychoacoustics; SDDS; signal compression; signal-processing applications; sinusoidal coding; subband coding; transform coding;

D O I：

10.1109/5.842996

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

During the last decade, CD-quality digital audio has essentially replaced analog audio. Emerging digital audio applications for network, wireless, and multimedia computing systems face a series of constraints such as reduced channel bandwidth, limited storage capacity, and low rest. These new applications have created a demand for high-quality digital audio delivery at low bit rates. In response to this need, consider-able research has been devoted to the development of algorithms for perceptually transparent coding of high,fidelity (CD-quality) digital audio. As a result, many algorithms have been proposed, and several have non become international and/or commercial product standards. This paper reviews algorithms for perceptually transparent coding of CD-quality digital audio, including both research and standardization activities. This paper is organized as follows. First, psychoacoustic principles are described, with the MPEG psychoacoustic signal analysis model I discussed in some detail. Next, filter bank design issues and algorithms are addressed, with a particular emphasis placed on the modified discrete cosine transform, a perfect reconstruction cosine-modulated filter bank that has become of central importance in perceptual audio coding. Then, we review methodologies that achieve perceptually transparent coding of FM- and CD-quality audio signals, including algorithms that manipulate transform components, subband signal decompositions, sinusoidal signal components, and linens prediction parameters, as well as hybrid algorithms that make use of more than one signal model. These discussions concentrate on architectures and applications of those techniques that utilize psychoacoustic models to exploit efficiently masking characteristics of the human receiver. Several algorithms that have become international and/or commercial standards receive in-depth treatment, including the ISO/IEC MPEG family (-1, -2, -4), the Lucent Technologies PAC/EPAC/MPAC, the Dolby(1) AC-2/AC-3, and the Sony ATRAC/SDDS algorithms. Then, Mle describe subjective evaluation methodologies in some detail, including the ITU-R BS.1116 recommendation on subjective measurements of small impairments. This paper concludes with a discussion of future research directions.

引用

页码：451 / 513

页数：63

共 50 条

[1] Perceptual Coding of High-Quality Digital Audio
Brandenburg, Karlheinz
Faller, Christof
Herre, Juergen
Johnston, James D.
Kleijn, W. Bastiaan
PROCEEDINGS OF THE IEEE, 2013, 101 (09) : 1905 - 1919
[2] A review of algorithms for perceptual coding of digital audio signals
Painter, T
Spanias, A
DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2: SPECIAL SESSIONS, 1997, : 179 - 208
[3] Prolog to -: Perceptual coding of digital audio -: An introduction to the paper by Painter and Spanias
Falk, H
PROCEEDINGS OF THE IEEE, 2000, 88 (04) : 449 - 450
[4] Joint speech/audio coding based scalable perceptual audio coding
Gao, Li
Hu, Ruimin
Yang, Yuhong
2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2014, : 419 - 424
[5] Perceptual Audio Coding - A history and timeline
Johnston, James D.
CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 2085 - 2087
[6] Compression artifacts in perceptual audio coding
Liu, Chi-Min
Hsu, Han-Wen
Lee, Wen-Chieh
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (04): : 681 - 695
[7] A progressive approach for perceptual audio coding
Shen, Y
Ai, HM
Kuo, CCJ
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 815 - 818
[8] On integer MDCT for perceptual audio coding
Li, Te
Rahardja, Susanto
Yu, Rongshan
Koh, Soo Ngee
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2236 - 2248
[9] CODING FOR DIGITAL AUDIO
RUDNICK, P
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1978, 26 (7-8): : 579 - 579
[10] Wideband speech and audio coding in the perceptual domain
Lin, L
Ambikairajah, E
Holmes, WH
ADVANCED SIGNAL PROCESSING FOR COMMUNICATION SYSTEMS, 2002, 703 : 15 - 30

← 1 2 3 4 5 →