Speech enhancement using posterior regularized NMF with bases update

被引：10

作者：

Sunnydayal, V. ^{[1
]}

Kumar, T. Kishore ^{[1
]}

机构：

[1] Natl Inst Technol Warangal, Elect & Commun Dept, Warangal 506004, Telangana, India

来源：

COMPUTERS & ELECTRICAL ENGINEERING | 2017年 / 62卷

关键词：

Non-negative matrix factorization; On-line bases update; Statistical model-based enhancement; Posterior regularization; NONNEGATIVE MATRIX FACTORIZATION; SPECTRAL AMPLITUDE ESTIMATOR; NOISE; DIVERGENCE; SPARSE;

D O I：

10.1016/j.compeleceng.2017.02.021

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a combination of statistical model-based approach and Non-negative Matrix Factorization (NMF)-based approach with on-line update of speech and noise bases for speech enhancement is proposed. Template-based approaches are more robust and perform better than non-stationary noises compared to statistical model-based approaches but are dependent on a priori information. Combining the approaches avoids the drawbacks of both. To improve the performance further, speech and noise bases are adapted simultaneously in NMF approach with the help of the estimated speech presence probability (SPP). The proposed method outperforms other benchmark algorithms in terms of perceptual evaluation of speech quality (PESQ) and source-to-distortion ratio (SDR) in stationary and non-stationary noise environment conditions with matched and mismatched noise basis. (C) 2017 Elsevier Ltd. All rights reserved.

引用

页码：663 / 675

页数：13

共 25 条

[1] Algorithms and applications for approximate nonnegative matrix factorization [J].

Berry, Michael W. ;

Browne, Murray ;

Langville, Amy N. ;

Pauca, V. Paul ;

Plemmons, Robert J. .

COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :155-173

[2]

Cedric Fevotte, 2013, AC SPEECH SIGN PROC

[3]

DanielD Lee, 2001, ADV NEURAL INF PROCE

[4] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445

[5] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121

[6] Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].

Fevotte, Cedric ;

Bertin, Nancy ;

Durrieu, Jean-Louis .

NEURAL COMPUTATION, 2009, 21 (03) :793-830

[7]

Graca Joao V., 2007, EXPECTATION MAXIMIZA

[8]

Hanwook Chung, 2014, MACH LEARN SIGN PROC

[9] Evaluation of objective quality measures for speech enhancement [J].

Hu, Yi ;

Loizou, Philipos C. .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01) :229-238

[10]

JohnS Garofolo, 1988, GETTING STARTED DARP, P107

← 1 2 3 →