On perceptual distortion minimization and nonlinear least-squares frequency estimation

被引:18
作者
Christensen, MG [1 ]
Jensen, SH [1 ]
机构
[1] Aalborg Univ, Dept Commun Technol, DK-9220 Aalborg, Denmark
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 01期
关键词
audio coding; estimation; frequency estimation; matching pursuit; perceptual distortion measures; signal representations; sinusoidal modeling;
D O I
10.1109/TSA.2005.860347
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present a framework for perceptual error minimization and sinusoidal frequency estimation based on a new perceptual distortion measure, and we state its optimal solution. Using this framework, we relate a number of well-known practical methods for perceptual sinusoidal parameter estimation such as the prefiltering method, the weighted matching pursuit, and the perceptual matching pursuit. In particular, we derive and compare the sinusoidal estimation criteria used in these methods. We show that for the sinusoidal estimation problem, the prefiltering method and the weighted matching pursuit are equivalent to the perceptual matching pursuit under certain conditions.
引用
收藏
页码:99 / 109
页数:11
相关论文
共 55 条
[1]  
ADLER J, 1996, 30 AS C SIGN SYST CO, V1, P252
[2]   PREDICTIVE CODING OF SPEECH SIGNALS AND SUBJECTIVE ERROR CRITERIA [J].
ATAL, BS ;
SCHROEDER, MR .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (03) :247-254
[3]  
CHRISTENSEN MG, 2005, UNPUB IEEE T SPEECH
[4]   A quantitative model of the ''effective'' signal processing in the auditory system .2. Simulations and measurements [J].
Dau, T ;
Puschel, D ;
Kohlrausch, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (06) :3623-3631
[5]   A quantitative model of the ''effective'' signal processing in the auditory system .1. Model structure [J].
Dau, T ;
Puschel, D ;
Kohlrausch, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (06) :3615-3622
[6]  
DAVIS G, 1994, THESIS NEW YORK U
[7]  
Edler B, 2000, INT CONF ACOUST SPEE, P881
[8]  
EDLER B, 1996, P 100 CONV AUD ENG S
[9]  
FEICHTINGER HG, 1994, P SOC PHOTO-OPT INS, V2302, P222, DOI 10.1117/12.188061
[10]   Speech analysis synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model [J].
George, EB ;
Smith, MJT .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (05) :389-406