A Speech Enhancement Method Based on Multi-Task Bayesian Compressive Sensing

被引:5
|
作者
You, Hanxu [1 ]
Ma, Zhixian [1 ]
Li, Wei [1 ]
Zhu, Jie [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2017年 / E100D卷 / 03期
基金
中国国家自然科学基金;
关键词
speech enhancement; compressive sensing; overcomplete dictionary; sparse representation; SPARSE;
D O I
10.1587/transinf.2016EDP7350
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional speech enhancement (SE) algorithms usually have fluctuant performance when they deal with different types of noisy speech signals. In this paper, we propose multi-task Bayesian compressive sensing based speech enhancement (MT-BCS-SE) algorithm to achieve not only comparable performance to but also more stable performance than traditional SE algorithms. MT-BCS-SE algorithm utilizes the dependence information among compressive sensing (CS) measurements and the sparsity of speech signals to perform SE. To obtain sufficient sparsity of speech signals, we adopt overcomplete dictionary to transform speech signals into sparse representations. K-SVD algorithm is employed to learn various overcomplete dictionaries. The influence of the overcomplete dictionary on MT-BCS-SE algorithm is evaluated through large numbers of experiments, so that the most suitable dictionary could be adopted by MT-BCS-SE algorithm for obtaining the best performance. Experiments were conducted on well-known NOIZEUS corpus to evaluate the performance of the proposed algorithm. In these cases of NOIZEUS corpus, MT-BCS-SE is shown that to be competitive or even superior to traditional SE algorithms, such as optimally-modified log-spectral amplitude (OMLSA), multi-band spectral subtraction (SSMul), and minimum mean square error (MMSE), in terms of signal-noise ratio (SNR), speech enhancement gain (SEG) and perceptual evaluation of speech quality (PESQ) and to have better stability than traditional SE algorithms.
引用
收藏
页码:556 / 563
页数:8
相关论文
共 50 条
  • [21] Multi-Static Passive SAR Imaging Based on Bayesian Compressive Sensing
    Wu, Qisong
    Zhang, Yimin D.
    Amin, Moeness G.
    Himed, Braham
    COMPRESSIVE SENSING III, 2014, 9109
  • [22] Speech Coding and Enhancement Using Quantized Compressive Sensing Measurements
    Ramdas, Vinitha
    Mishra, Deepak
    Gorthi, Sai Subrahmanyam
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [23] SAR ATR based on Bayesian compressive sensing
    Zhang, Xin-Zheng
    Huang, Pei-Kang
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2013, 35 (01): : 40 - 44
  • [24] Multi-Task Learning U-Net for Single-Channel Speech Enhancement and Mask-Based Voice Activity Detection
    Lee, Geon Woo
    Kim, Hong Kook
    APPLIED SCIENCES-BASEL, 2020, 10 (09):
  • [25] Speech Recognition and Classification Using the Compressive Sensing Method
    Buakhlai, Sombat
    Udomsiri, Sakol
    2017 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF SIGNAL PROCESSING (ICFSP), 2017, : 8 - 14
  • [26] Speech Enhancement in Non-Stationary Noise Using Compressive Sensing
    Sulong, Amart
    Gunawan, Teddy Surya
    Khalifa, Othman O.
    Kartiwi, Mira
    PROCEEDINGS OF 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE 2016), 2016, : 489 - 493
  • [27] DIGITAL IMAGE WATERMARKING BASED ON BAYESIAN COMPRESSIVE SENSING
    Lv, Jun
    Li, Xiu-Mei
    2017 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION (ICWAPR), 2017, : 59 - 64
  • [28] Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks
    Chen, Zhuo
    Watanabe, Shinji
    Erdogan, Hakan
    Hershey, John R.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3274 - 3278
  • [29] A Speech Enhancement Method for Long-range Speech Acquisition Task
    Geng, Yanzhang
    Zhang, Tao
    Wang, Heng
    Zhao, Xin
    INTERSPEECH 2022, 2022, : 5453 - 5457
  • [30] Sparse Signal Recovery through Long Short-Term Memory Networks for Compressive Sensing-Based Speech Enhancement
    Shukla, Vasundhara
    Swami, Preety D.
    ELECTRONICS, 2023, 12 (14)