SPEECH ENHANCEMENT USING NONNEGATIVE MATRIX FACTORIZATION WITH TEMPORAL CONTINUITY

被引：0

作者：

Nam, Seung-Hyon ^{[1
]}

机构：

[1] Paichai Univ, Dept Elect Engn, 155-40,Baejae Ro, Daejeon 303735, South Korea

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA | 2015年 / 34卷 / 03期

关键词：

Speech enhancement; Nonnegative matrix factorization; Variational Bayesian inference; Gamma-Makov chain; Temporal continuity;

D O I：

10.7776/ASK.2015.34.3.240

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, speech enhancement using nonnegative matrix factorization with temporal continuity has been addressed. Speech and noise signals are modeled as Possion distributions, and basis vectors and gain vectors of NMF are modeled as Gamma distributions. Temporal continuity of the gain vector is known to be critical to the quality of enhanced speech signals. In this paper, temporal continiuty is implemented by adopting Gamma-Markov chain priors for noise gain vectors during the separation phase. Simulation results show that the Gamma-Markov chain models temporal continuity of noise signals and track changes in noise effectively.

引用

页码：240 / 246

页数：7

共 10 条

[1]

Bishop C.M., 2006, PATTERN RECOGN, P462

[2]

Cemgil A. T., 2007, 7 INT C IND COMP AN, P697

[3]

Cemgil Ali Taylan, 2009, Comput Intell Neurosci, P785152, DOI 10.1155/2009/785152

[4] Learning the parts of objects by non-negative matrix factorization [J].

Lee, DD ;

Seung, HS .

NATURE, 1999, 401 (6755) :788-791

[5]

Loizou P. C, 2013, SPEECH ENHANCEMENT T, P1

[6] Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization [J].

Mohammadiha, Nasser ;

Smaragdis, Paris ;

Leijon, Arne .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10) :2140-2151

[7]

Quackenbush S. R., 1988, OBJECTIVE MEASURES S, P45

[8] Static and Dynamic Source Separation Using Nonnegative Factorizations [A unifed view] [J].

Smaragdis, Paris ;

Fevotte, Cedric ;

Mysore, Gautham J. ;

Mohammadiha, Nasser ;

Hoffman, Matthew .

IEEE SIGNAL PROCESSING MAGAZINE, 2014, 31 (03) :66-75

[9] Performance measurement in blind audio source separation [J].

Vincent, Emmanuel ;

Gribonval, Remi ;

Févotte, Cedric .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04) :1462-1469

[10] Bayesian extensions to non-negative matrix factorisation for audio signal modelling [J].

Virtanen, Tuomas ;

Cemgil, A. Taylan ;

Godsill, Simon .

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :1825-1828

← 1 →