Frequency-domain algorithms for audio signal enhancement based on transient modification

被引:0
作者
Goodwin, Michael M. [1 ]
Avendano, Carlos
机构
[1] Creat Adv Technol Ctr, Scotts Valley, CA 95066 USA
[2] Audience Inc, Mountain View, CA 94041 USA
来源
JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2006年 / 54卷 / 09期
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Two processing approaches that enable perceptually compelling modification of audio signals via accentuation or suppression of transients are described. The first algorithm uses a frequency-domain analysis and a soft-decision paradigm to characterize transients in the signal; the transient characterization is then used to drive a nonlinear frequency-domain modification. In the second algorithm the modulation spectrum of the audio signal is manipulated by modifying the time trajectories of spectral envelopes in different frequency bands scaling of higher modulation frequencies with shelving filters is used to modify rapidly changing signal events, thus altering transient components without the need for explicit detection. The algorithms are described in detail and it is demonstrated that they can achieve substantial modification of a signal's perceptual attributes without introducing significant artifacts.
引用
收藏
页码:827 / 840
页数:14
相关论文
共 52 条
  • [1] ABDULLAH S, 2003, P CAMBR MUS PROC COL
  • [2] UNIFIED APPROACH TO SHORT-TIME FOURIER-ANALYSIS AND SYNTHESIS
    ALLEN, JB
    RABINER, LR
    [J]. PROCEEDINGS OF THE IEEE, 1977, 65 (11) : 1558 - 1564
  • [3] [Anonymous], 1996, Proceedings of the International Computer Music Conference
  • [4] [Anonymous], 1997, ALGORITHMS IMAGE PRO
  • [5] [Anonymous], P IEEE INT C AC SPEE
  • [6] Syllable intelligibility for temporally filtered LPC cepstral trajectories
    Arai, T
    Pavel, M
    Hermansky, H
    Avendano, C
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (05) : 2783 - 2791
  • [7] Atlas L, 2004, INT CONF ACOUST SPEE, P761
  • [8] AVENANDANO C, 2005, J AUDIO ENG SOC, V53, P104
  • [9] AVENANDANO C, 2005, J AUDIO ENG SOC, V53, P103
  • [10] Avendano C, 2004, J AUDIO ENG SOC, V52, P740