Noisy speech enhancement with sparsity regularization

被引:7
|
作者
Kammi, Sonay [1 ]
Mollaei, Mohammad Reza Karami [1 ]
机构
[1] Babol Univ Technol, Fac Elect & Comp Engn, Babol Sar, Iran
关键词
Unsupervised speech enhancement; Regularization term; Sparsity property; Alternating direction method of multipliers; SUBSPACE APPROACH;
D O I
10.1016/j.specom.2017.01.003
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a novel unsupervised speech enhancement algorithm is proposed assuming that both speech spectrogram and its temporal gradient are sparse. This assumption is reliable due to quasi harmonic nature of speech signals. In the proposed method, speech enhancement is performed by minimizing an appropriate objective function composed of a data fidelity term and sparsity imposing regularization terms. Alternating direction method of multipliers (ADMM) is adapted to solve the proposed model, and an efficient iterative algorithm is developed for speech enhancement. Extensive experiments demonstrate that the proposed method outperforms other competing methods in terms of different performance evaluation metrics. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:58 / 69
页数:12
相关论文
共 50 条
  • [1] Harmonic Structure Mask for Speech Enhancement Using Sparsity Regularization
    Wang, Haonan
    Iwai, Kenta
    Nishiura, Takanobu
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 662 - 666
  • [2] Processing Noisy Speech for Enhancement
    Krishnamoorthy, P.
    Prasanna, Mahadeva
    IETE TECHNICAL REVIEW, 2007, 24 (05) : 351 - 357
  • [3] Noisy speech recognition based on speech enhancement
    Wang, Xia
    Tang, Hongmei
    Zhao, Xiaoqun
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 713 - +
  • [4] Speech Enhancement of Noisy and Reverberant Speech for Text-to-Speech
    Valentini-Botinhao, Cassia
    Yamagishi, Junichi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (08) : 1420 - 1433
  • [5] ENHANCEMENT AND BANDWIDTH COMPRESSION OF NOISY SPEECH
    LIM, JS
    OPPENHEIM, AV
    PROCEEDINGS OF THE IEEE, 1979, 67 (12) : 1586 - 1604
  • [6] Speech enhancement applied to speech recognition in noisy environments
    Xu, Y.F., 2001, Press of Tsinghua University (41):
  • [7] Speech Synthesis enhancement in noisy environments
    Bonardo, Davide
    Zovato, Enrico
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 789 - 792
  • [8] Multimodal speech enhancement in noisy environment
    Xu, R
    Ren, Z
    Dai, W
    Lao, D
    Kwan, C
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 21 - 24
  • [9] Robust recognition of noisy speech using speech enhancement
    Xu, YF
    Zhang, JJ
    Yao, KS
    Cao, ZG
    Ma, ZX
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 734 - 737
  • [10] SPARSITY-BASED CONFIDENCE MEASURE FOR PITCH ESTIMATION IN NOISY SPEECH
    Huang, Feng
    Lee, Tan
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4601 - 4604