Improved single channel phase-aware speech enhancement technique for low signal-to-noise ratio signal

被引:20
|
作者
Samui, Suman [1 ]
Chakrabarti, Indrajit [2 ]
Ghosh, Soumya Kanti [3 ]
机构
[1] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, W Bengal, India
[2] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur, W Bengal, India
[3] Indian Inst Technol, Sch Informat Technol, Kharagpur, W Bengal, India
关键词
speech enhancement; signal denoising; spectral analysis; signal reconstruction; speech intelligibility; amplitude estimation; improved single channel phase-aware speech enhancement technique; low-signal-to-noise ratio signal; short-time spectral amplitude; phase corruption; additive noise contamination; phase-aware multiband spectral subtraction technique; spectral amplitude estimates; noise signal components; clean speech signal; composite quality measures; intelligibility assessment metrics; objective measure quality evaluation technique; SPECTRAL SUBTRACTION; SUPPRESSION; ALGORITHMS; INTELLIGIBILITY; DELAY;
D O I
10.1049/iet-spr.2015.0182
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the state-of-the-art single channel speech enhancement techniques, the short-time spectral amplitude is modified while the effect of the phase corruption due to the contamination of additive noise is neglected. This study introduces an improved speech enhancement algorithm based on a phase-aware multi-band spectral subtraction technique which estimates the spectral amplitude of the clean speech signal by considering the phase of the speech and noise signal components, and uses the estimated phase of the clean speech signal for signal reconstruction in the time domain. Experimental results show that the proposed algorithm yields better performance in terms of various objective and composite quality measures and other intelligibility assessment metrics while compared with other existing spectral subtraction methods. Using the composite objective measure quality evaluation technique, it is observed that the overall signal quality of the enhanced speech signal is improved on an average by 70% at 0 dB global input signal-to-noise ratio by using the proposed approach.
引用
收藏
页码:641 / 650
页数:10
相关论文
共 50 条
  • [1] Improved signal-to-noise ratio estimation for speech enhancement
    Plapous, Cyril
    Marro, Claude
    Scalart, Pascal
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2098 - 2108
  • [2] A New Weighted Loss for Single Channel Speech Enhancement under Low Signal-to-Noise Ratio Environment
    Xiao, Jian
    Liu, Hongqing
    Zhou, Yi
    Luo, Zhen
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 15 - 19
  • [3] FPGA Implementation of a Phase-Aware Single-Channel Speech Enhancement System
    Samui, Suman
    Sahu, Pragya
    Chakrabarti, Indrajit
    Ghosh, Soumya K.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (11) : 4688 - 4715
  • [4] Phase-Aware Single-channel Speech Enhancement
    Mowlaee, Pejman
    Watanabe, Mario Kaoru
    Saeidi, Rahim
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1871 - 1873
  • [5] FPGA Implementation of a Phase-Aware Single-Channel Speech Enhancement System
    Suman Samui
    Pragya Sahu
    Indrajit Chakrabarti
    Soumya K. Ghosh
    Circuits, Systems, and Signal Processing, 2017, 36 : 4688 - 4715
  • [6] On Speech Intelligibility Estimation of Phase-Aware Single-Channel Speech Enhancement
    Gaich, Andreas
    Mowlaee, Pejman
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2553 - 2557
  • [7] Phase-Aware Signal Processing for Automatic Speech Recognition
    Fahringer, Johannes
    Schrank, Tobias
    Stahl, Johannes
    Mowlaee, Pejman
    Pernkopf, Franz
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3374 - 3378
  • [8] Single-Channel Speech Enhancement Algorithm Based on ME-MGCRN in Low Signal-to-Noise Scenario
    Lan, Chaofeng
    Zhao, Shilong
    Chen, Huan
    Zhang, Lei
    Yang, Yuchen
    Fan, Zixu
    Zhang, Meng
    IEEE ACCESS, 2024, 12 : 101342 - 101355
  • [9] An evaluation of the perceptual quality of phase-aware single-channel speech enhancement
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (04) : EL364 - EL369
  • [10] A robust and lightweight voice activity detection algorithm for speech enhancement at low signal-to-noise ratio
    Zhu, Zhehui
    Zhang, Lijun
    Pei, Kaikun
    Chen, Siqi
    DIGITAL SIGNAL PROCESSING, 2023, 141