Single-Channel and Multi-Channel Sinusoidal Audio Coding Using Compressed Sensing

被引:30
作者
Griffin, Anthony [1 ,2 ]
Hirvonen, Toni [1 ]
Tzagkarakis, Christos [1 ,2 ]
Mouchtaris, Athanasios [1 ,2 ]
Tsakalides, Panagiotis [1 ,2 ]
机构
[1] Fdn Res & Technol Hellas FORTH ICS, Inst Comp Sci, GR-70013 Iraklion, Crete, Greece
[2] Univ Crete, Dept Comp Sci, GR-70013 Iraklion, Crete, Greece
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 05期
关键词
Audio coding; compressed sensing (CS); signal reconstruction; signal sampling; sinusoidal model; SPARSE SIGNALS; QUANTIZATION; RECONSTRUCTION; MODEL;
D O I
10.1109/TASL.2010.2090656
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Compressed sensing (CS) samples signals at a much lower rate than the Nyquist rate if they are sparse in some basis. In this paper, the CS methodology is applied to sinusoidally modeled audio signals. As this model is sparse by definition in the frequency domain (being equal to the sum of a small number of sinusoids), we investigate whether CS can be used to encode audio signals at low bitrates. In contrast to encoding the sinusoidal parameters (amplitude, frequency, phase) as current state-of-the-art methods do, we propose encoding few randomly selected samples of the time-domain description of the sinusoidal component (per signal segment). The potential of applying compressed sensing both to single-channel and multi-channel audio coding is examined. The listening test results are encouraging, indicating that the proposed approach can achieve comparable performance to that of state-of-the-art methods. Given that CS can lead to novel coding systems where the sampling and compression operations are combined into one low-complexity step, the proposed methodology can be considered as an important step towards applying the CS framework to audio coding applications.
引用
收藏
页码:1382 / 1395
页数:14
相关论文
共 41 条
[1]  
[Anonymous], 2006, P IEEE DALL CAS WORK
[2]  
[Anonymous], 1997, METH SUBJ ASS SMALL
[3]  
[Anonymous], P EUR SIGN PROC C EU
[4]  
Baron D, 2006, TREE0612 RIC U DEP E
[5]  
Boufounos P., 2007, 0701 TREE RIC U EL C
[6]   Robust uncertainty principles:: Exact signal reconstruction from highly incomplete frequency information [J].
Candès, EJ ;
Romberg, J ;
Tao, T .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (02) :489-509
[7]  
Cappe O., 1995, 1995 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics (Cat. No.95TH8144), P213, DOI 10.1109/ASPAA.1995.482993
[8]   Exact reconstruction of sparse signals via nonconvex minimization [J].
Chartrand, Rick .
IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (10) :707-710
[9]  
den Brinker A. C., 2002, P 112 AUD ENG SOC AE, P5554
[10]   Compressed sensing [J].
Donoho, DL .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (04) :1289-1306