SPEECH ENHANCEMENT USING β- DIVERGENCE BASED NMF WITH UPDATE BASES

被引:0
作者
Sunnydayal, V. [1 ]
Kumar, T. Kishore [1 ]
机构
[1] Natl Inst Technol Warangal, Dept Elect & Commun Engn, Warangal, Telangana, India
来源
2016 INTERNATIONAL CONFERENCE ON MICROELECTRONICS, COMPUTING AND COMMUNICATIONS (MICROCOM) | 2016年
关键词
Nonnegative matrix factorization (NMF); beta-; Divergence; Speech Enhancement; speech presence probability (SPP); NONNEGATIVE MATRIX FACTORIZATION; ALGORITHMS; NOISE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, combination of statistical model based approach and Non-negative matrix factorization (NMF) based approach with on-line update of speech and noise bases for speech enhancement is proposed. Template based approaches are more robust and performs better to non-stationary noises compared to the statistical model based approaches. However, the template based approach is dependent on a priori information. Combining the approaches avoids the drawbacks of both. To improve the performance further, speech and noise bases are adapted simultaneously in NMF approach with the help of the estimated speech presence probability (SPP). The proposed approach yields better results than statistical based approach, NMF based approach and also combination of both approaches without on-line update in non-stationary noise environments.
引用
收藏
页数:6
相关论文
共 23 条
[1]   Algorithms and applications for approximate nonnegative matrix factorization [J].
Berry, Michael W. ;
Browne, Murray ;
Langville, Amy N. ;
Pauca, V. Paul ;
Plemmons, Robert J. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :155-173
[2]  
Cabras G., 2010, P SOUND MUS COMP C, P314
[3]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445
[4]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[5]   Algorithms for Nonnegative Matrix Factorization with the β-Divergence [J].
Fevotte, Cedric ;
Idier, Jerome .
NEURAL COMPUTATION, 2011, 23 (09) :2421-2456
[6]   Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].
Fevotte, Cedric ;
Bertin, Nancy ;
Durrieu, Jean-Louis .
NEURAL COMPUTATION, 2009, 21 (03) :793-830
[7]  
Garofolo J., 1988, Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database
[8]   Evaluation of objective quality measures for speech enhancement [J].
Hu, Yi ;
Loizou, Philipos C. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01) :229-238
[9]  
Kisoo Kwon, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P7053, DOI 10.1109/ICASSP.2014.6854968
[10]   Learning the parts of objects by non-negative matrix factorization [J].
Lee, DD ;
Seung, HS .
NATURE, 1999, 401 (6755) :788-791