A Speech Enhancement Method Based on Multi-Task Bayesian Compressive Sensing

被引:5
|
作者
You, Hanxu [1 ]
Ma, Zhixian [1 ]
Li, Wei [1 ]
Zhu, Jie [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2017年 / E100D卷 / 03期
基金
中国国家自然科学基金;
关键词
speech enhancement; compressive sensing; overcomplete dictionary; sparse representation; SPARSE;
D O I
10.1587/transinf.2016EDP7350
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional speech enhancement (SE) algorithms usually have fluctuant performance when they deal with different types of noisy speech signals. In this paper, we propose multi-task Bayesian compressive sensing based speech enhancement (MT-BCS-SE) algorithm to achieve not only comparable performance to but also more stable performance than traditional SE algorithms. MT-BCS-SE algorithm utilizes the dependence information among compressive sensing (CS) measurements and the sparsity of speech signals to perform SE. To obtain sufficient sparsity of speech signals, we adopt overcomplete dictionary to transform speech signals into sparse representations. K-SVD algorithm is employed to learn various overcomplete dictionaries. The influence of the overcomplete dictionary on MT-BCS-SE algorithm is evaluated through large numbers of experiments, so that the most suitable dictionary could be adopted by MT-BCS-SE algorithm for obtaining the best performance. Experiments were conducted on well-known NOIZEUS corpus to evaluate the performance of the proposed algorithm. In these cases of NOIZEUS corpus, MT-BCS-SE is shown that to be competitive or even superior to traditional SE algorithms, such as optimally-modified log-spectral amplitude (OMLSA), multi-band spectral subtraction (SSMul), and minimum mean square error (MMSE), in terms of signal-noise ratio (SNR), speech enhancement gain (SEG) and perceptual evaluation of speech quality (PESQ) and to have better stability than traditional SE algorithms.
引用
收藏
页码:556 / 563
页数:8
相关论文
共 50 条
  • [41] Watermarking Based on Compressive Sensing for Digital Speech Detection and Recovery
    Lu, Wenhuan
    Chen, Zonglei
    Li, Ling
    Cao, Xiaochun
    Wei, Jianguo
    Xiong, Naixue
    Li, Jian
    Dang, Jianwu
    SENSORS, 2018, 18 (07)
  • [42] Speech enhancement based on the subspace method
    Asano, F
    Hayamizu, S
    Yamada, T
    Nakamura, S
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (05): : 497 - 507
  • [43] Exploiting Block-Sparsity for Hyperspectral Kronecker Compressive Sensing: A Tensor-Based Bayesian Method
    Zhao, Rongqiang
    Wang, Qiang
    Fu, Jun
    Ren, Luquan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 1654 - 1668
  • [44] An Improved Bayesian NMF-Based Speech Enhancement Method Using Multivariate Laplace Distribution
    Zhang, Liwei
    Zhang, Xiongwei
    Zou, Xia
    Min, Gang
    2014 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2014,
  • [45] A Multi-Task Deep Feature Selection Method for Brain Imaging Genetics
    Yu, Chenglin
    Zhang, Shu
    Shang, Muheng
    Guo, Lei
    Han, Junwei
    Du, Lei
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (06) : 1613 - 1622
  • [46] Robust Bayesian estimation for context-based speech enhancement
    Naidu, Devireddy Hanumantha Rao
    Srinivasan, Sriram
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 12
  • [47] Speech enhancement based on Bayesian decision and spectral amplitude estimation
    Deng, Feng
    Bao, Chang-Chun
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
  • [48] Robust Bayesian estimation for context-based speech enhancement
    Devireddy Hanumantha Rao Naidu
    Sriram Srinivasan
    EURASIP Journal on Audio, Speech, and Music Processing, 2014 (1)
  • [49] Speech enhancement based on Bayesian decision and spectral amplitude estimation
    Feng Deng
    Chang-Chun Bao
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [50] Codebook-based Bayesian speech enhancement for nonstationary environments
    Srinivasan, Sriram
    Samuelsson, Jonas
    Kleijn, W. Bastiaan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 441 - 452