Improved single channel phase-aware speech enhancement technique for low signal-to-noise ratio signal

被引:20
|
作者
Samui, Suman [1 ]
Chakrabarti, Indrajit [2 ]
Ghosh, Soumya Kanti [3 ]
机构
[1] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, W Bengal, India
[2] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur, W Bengal, India
[3] Indian Inst Technol, Sch Informat Technol, Kharagpur, W Bengal, India
关键词
speech enhancement; signal denoising; spectral analysis; signal reconstruction; speech intelligibility; amplitude estimation; improved single channel phase-aware speech enhancement technique; low-signal-to-noise ratio signal; short-time spectral amplitude; phase corruption; additive noise contamination; phase-aware multiband spectral subtraction technique; spectral amplitude estimates; noise signal components; clean speech signal; composite quality measures; intelligibility assessment metrics; objective measure quality evaluation technique; SPECTRAL SUBTRACTION; SUPPRESSION; ALGORITHMS; INTELLIGIBILITY; DELAY;
D O I
10.1049/iet-spr.2015.0182
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the state-of-the-art single channel speech enhancement techniques, the short-time spectral amplitude is modified while the effect of the phase corruption due to the contamination of additive noise is neglected. This study introduces an improved speech enhancement algorithm based on a phase-aware multi-band spectral subtraction technique which estimates the spectral amplitude of the clean speech signal by considering the phase of the speech and noise signal components, and uses the estimated phase of the clean speech signal for signal reconstruction in the time domain. Experimental results show that the proposed algorithm yields better performance in terms of various objective and composite quality measures and other intelligibility assessment metrics while compared with other existing spectral subtraction methods. Using the composite objective measure quality evaluation technique, it is observed that the overall signal quality of the enhanced speech signal is improved on an average by 70% at 0 dB global input signal-to-noise ratio by using the proposed approach.
引用
收藏
页码:641 / 650
页数:10
相关论文
共 50 条
  • [21] A Phase-aware Single Channel Speech Enhancement Technique using Separate Bayesian Estimators for Voiced and Unvoiced Regions with Digital Hearing Aid Application
    Samui, Suman
    Chakrabarti, Indrajit
    Ghosh, Soumya Kanti
    2015 17TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATION & SERVICES (HEALTHCOM), 2015, : 336 - 341
  • [22] A nonlinear total variation based denoising method for electrostatic signal of low signal-to-noise ratio
    Zhong, Zhirong
    Zuo, Hongfu
    Jiang, Heng
    ADVANCES IN MECHANICAL ENGINEERING, 2022, 14 (11)
  • [23] On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement
    Dat, Tran Huy
    Takeda, Kazuya
    Itakura, Fumitada
    SPEECH COMMUNICATION, 2006, 48 (11) : 1515 - 1527
  • [24] Speech enhancement employing a sigmoid-type gain function with a modified a priori signal-to-noise ratio (SNR) estimator
    Alam, Md. Jahangir
    O'Shaughnessy, Douglas
    Selouani, Sid-Ahmed
    2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 604 - +
  • [25] Beamforming and Single-Microphone Noise Reduction: Effects on Signal-to-Noise Ratio and Speech Recognition of Bimodal Cochlear Implant Users
    Stronks, H. Christiaan
    Briaire, Jeroen J.
    Frijns, Johan H. M.
    TRENDS IN HEARING, 2022, 26
  • [26] The Revised Speech Perception in Noise Test (R-SPIN) in a Multiple Signal-to-Noise Ratio Paradigm
    Wilson, Richard H.
    McArdle, Rachel
    Watts, Kelly L.
    Smith, Sherri L.
    JOURNAL OF THE AMERICAN ACADEMY OF AUDIOLOGY, 2012, 23 (08) : 590 - 605
  • [27] Lung mediated auditory contrast enhancement improves the Signal-to-noise ratio for communication in frogs
    Lee, Norman
    Christensen-Dalsgaard, Jakob
    White, Lauren A.
    Schrode, Katrina M.
    Bee, Mark A.
    CURRENT BIOLOGY, 2021, 31 (07) : 1488 - 1498.e4
  • [28] STFT Phase Reconstruction in Voiced Speech for an Improved Single-Channel Speech Enhancement
    Krawczyk, Martin
    Gerkmann, Timo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 1931 - 1940
  • [29] PHASE RANDOMIZATION - A NEW PARADIGM FOR SINGLE-CHANNEL SIGNAL ENHANCEMENT
    Sugiyama, Akihiko
    Miyahara, Ryoji
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7487 - 7491
  • [30] Improved CEM for Speech Harmonic Enhancement in Single Channel Noise Suppression
    Song, Yanjue
    Madhu, Nilesh
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2492 - 2503