Non-negative Matrix Factorization with Linear Constraints for Single-Channel Speech Enhancement

被引:0
作者
Lyubimov, Nikolay [1 ]
Kotov, Mikhail [2 ]
机构
[1] Moscow MV Lomonosov State Univ, Moscow, Russia
[2] STEL Comp Syst Ltd, Moscow, Russia
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
speech enhancement; sinusoidal model; non-negative matrix factorization; SUPPRESSION; NOISE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates a non-negative matrix factorization (NMF)-based approach to the semi-supervised single-channel speech enhancement problem where only non-stationary additive noise signals are given. The proposed method relies on sinusoidal model of speech production which is integrated inside NMF framework using linear constraints on dictionary atoms. This method is further developed to regularize harmonic amplitudes. Simple multiplicative algorithms are presented. The experimental evaluation was made on TIMIT corpus mixed with various types of noise. It has been shown that the proposed method outperforms some of the state-of-the-art noise suppression techniques in terms of signal-to-noise ratio.
引用
收藏
页码:446 / 450
页数:5
相关论文
共 14 条
  • [1] [Anonymous], 2011, PROC IEEE INT S INTE
  • [2] Bertin Nancy, 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), P29, DOI 10.1109/ASPAA.2009.5346531
  • [3] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    BOLL, SF
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
  • [4] Cauchi B., 2012, P 1 WORKSH SPEECH MU, P28
  • [5] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [6] Fevotte Cedric., 2010, Neural Computation, V13, P1
  • [7] Gold B., 2000, SPEECH AUDIO SIGNAL
  • [8] Evaluation of objective quality measures for speech enhancement
    Hu, Yi
    Loizou, Philipos C.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01): : 229 - 238
  • [9] SPEECH ANALYSIS SYNTHESIS BASED ON A SINUSOIDAL REPRESENTATION
    MCAULAY, RJ
    QUATIERI, TF
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (04): : 744 - 754
  • [10] Mysore GJ, 2011, INT CONF ACOUST SPEE, P17