Oblique Projection and Cepstral Subtraction in Signal Subspace Speech Enhancement for Colored Noise Reduction

被引：4

作者：

Surendran, Sudeep ^{[1
]}

Kumar, T. Kishore ^{[1
]}

机构：

[1] Natl Inst Technol, Dept Elect & Commun Engn, Warangal 506004, Andhra Pradesh, India

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2018年 / 26卷 / 12期

关键词：

Masking property; oblique projection; speech enhancement; signal subspace approach; variance normalization; SUPPRESSION; VARIANCE; MASKING; SPARSE;

D O I：

10.1109/TASLP.2018.2864535

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, a subspace speech enhancement method handling the case of colored noise using oblique projection in the cepstral domain is proposed. Perceptual features and variance normalization are used to reduce the residual noise and improve the intelligibility of the output speech. Initially, the additive noise present in the noisy speech is removed by removing the orthogonal noise subspace from the noisy speech subspace to obtain the speech subspace. Then, the oblique projection of the noise subspace on the speech subspace along the additive noise subspace is used to determine the colored noise that remains. Colored noise removal is performed by power spectral subtraction in the cepstral domain. The spectral domain constrained estimator that incorporates the combined masking property of the human auditory system is employed to estimate the clean speech signal using the variance of the colored noise. To avoid the occurrence of any abrupt spikes in the output, variance normalization is performed by adaptively changing the control parameter of the estimator's gain matrix. The spectrograms, the objective measures and the subjective intelligibility test show the superior performance of the proposed method over the other existing speech enhancement methods.

引用

页码：2328 / 2340

页数：13

共 37 条

[1]

[Anonymous], 1969, IEEE T ACOUST SPEECH, VAU17, P225

[2]

[Anonymous], 1963, PROC S TIME SER ANAL

[3] SIGNAL-PROCESSING APPLICATIONS OF OBLIQUE PROJECTION OPERATORS [J].

BEHRENS, RT ;

SCHARF, LL .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1994, 42 (06) :1413-1424

[4] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].

BOLL, SF .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120

[5] Evaluating the intelligibility benefit of speech modifications in known noise conditions [J].

Cooke, Martin ;

Mayo, Catherine ;

Valentini-Botinhao, Cassia ;

Stylianou, Yannis ;

Sauert, Bastian ;

Tang, Yan .

SPEECH COMMUNICATION, 2013, 55 (04) :572-585

[6] Sparse Hidden Markov Models for Speech Enhancement in Non-Stationary Noise Environments [J].

Deng, Feng ;

Bao, Changchun ;

Kleijn, W. Bastiaan .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) :1973-1987

[7] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445

[8] A SIGNAL SUBSPACE APPROACH FOR SPEECH ENHANCEMENT [J].

EPHRAIM, Y ;

VANTREES, HL .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04) :251-266

[9] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121

[10] Robust speech recognition system using bidirectional Kalman filter [J].

Goh, Yeh Huann ;

Raveendran, Paramesran ;

Goh, Yann Ling .

IET SIGNAL PROCESSING, 2015, 9 (06) :491-497

← 1 2 3 4 →