Noisy speech enhancement with sparsity regularization

被引：7

作者：

Kammi, Sonay ^{[1
]}

Mollaei, Mohammad Reza Karami ^{[1
]}

机构：

[1] Babol Univ Technol, Fac Elect & Comp Engn, Babol Sar, Iran

来源：

SPEECH COMMUNICATION | 2017年 / 87卷

关键词：

Unsupervised speech enhancement; Regularization term; Sparsity property; Alternating direction method of multipliers; SUBSPACE APPROACH;

D O I：

10.1016/j.specom.2017.01.003

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, a novel unsupervised speech enhancement algorithm is proposed assuming that both speech spectrogram and its temporal gradient are sparse. This assumption is reliable due to quasi harmonic nature of speech signals. In the proposed method, speech enhancement is performed by minimizing an appropriate objective function composed of a data fidelity term and sparsity imposing regularization terms. Alternating direction method of multipliers (ADMM) is adapted to solve the proposed model, and an efficient iterative algorithm is developed for speech enhancement. Extensive experiments demonstrate that the proposed method outperforms other competing methods in terms of different performance evaluation metrics. (C) 2017 Elsevier B.V. All rights reserved.

引用

页码：58 / 69

页数：12

共 50 条

[1] Harmonic Structure Mask for Speech Enhancement Using Sparsity Regularization
Wang, Haonan
Iwai, Kenta
Nishiura, Takanobu
2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 662 - 666
[2] Processing Noisy Speech for Enhancement
Krishnamoorthy, P.
Prasanna, Mahadeva
IETE TECHNICAL REVIEW, 2007, 24 (05) : 351 - 357
[3] Noisy speech recognition based on speech enhancement
Wang, Xia
Tang, Hongmei
Zhao, Xiaoqun
SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 713 - +
[4] Speech Enhancement of Noisy and Reverberant Speech for Text-to-Speech
Valentini-Botinhao, Cassia
Yamagishi, Junichi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (08) : 1420 - 1433
[5] ENHANCEMENT AND BANDWIDTH COMPRESSION OF NOISY SPEECH
LIM, JS
OPPENHEIM, AV
PROCEEDINGS OF THE IEEE, 1979, 67 (12) : 1586 - 1604
[6] Speech enhancement applied to speech recognition in noisy environments
Xu, Y.F., 2001, Press of Tsinghua University (41):
[7] Speech Synthesis enhancement in noisy environments
Bonardo, Davide
Zovato, Enrico
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 789 - 792
[8] Multimodal speech enhancement in noisy environment
Xu, R
Ren, Z
Dai, W
Lao, D
Kwan, C
PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 21 - 24
[9] Robust recognition of noisy speech using speech enhancement
Xu, YF
Zhang, JJ
Yao, KS
Cao, ZG
Ma, ZX
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 734 - 737
[10] SPARSITY-BASED CONFIDENCE MEASURE FOR PITCH ESTIMATION IN NOISY SPEECH
Huang, Feng
Lee, Tan
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4601 - 4604

← 1 2 3 4 5 →