Improved single channel phase-aware speech enhancement technique for low signal-to-noise ratio signal

被引:20
|
作者
Samui, Suman [1 ]
Chakrabarti, Indrajit [2 ]
Ghosh, Soumya Kanti [3 ]
机构
[1] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, W Bengal, India
[2] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur, W Bengal, India
[3] Indian Inst Technol, Sch Informat Technol, Kharagpur, W Bengal, India
关键词
speech enhancement; signal denoising; spectral analysis; signal reconstruction; speech intelligibility; amplitude estimation; improved single channel phase-aware speech enhancement technique; low-signal-to-noise ratio signal; short-time spectral amplitude; phase corruption; additive noise contamination; phase-aware multiband spectral subtraction technique; spectral amplitude estimates; noise signal components; clean speech signal; composite quality measures; intelligibility assessment metrics; objective measure quality evaluation technique; SPECTRAL SUBTRACTION; SUPPRESSION; ALGORITHMS; INTELLIGIBILITY; DELAY;
D O I
10.1049/iet-spr.2015.0182
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the state-of-the-art single channel speech enhancement techniques, the short-time spectral amplitude is modified while the effect of the phase corruption due to the contamination of additive noise is neglected. This study introduces an improved speech enhancement algorithm based on a phase-aware multi-band spectral subtraction technique which estimates the spectral amplitude of the clean speech signal by considering the phase of the speech and noise signal components, and uses the estimated phase of the clean speech signal for signal reconstruction in the time domain. Experimental results show that the proposed algorithm yields better performance in terms of various objective and composite quality measures and other intelligibility assessment metrics while compared with other existing spectral subtraction methods. Using the composite objective measure quality evaluation technique, it is observed that the overall signal quality of the enhanced speech signal is improved on an average by 70% at 0 dB global input signal-to-noise ratio by using the proposed approach.
引用
收藏
页码:641 / 650
页数:10
相关论文
共 50 条
  • [31] Enhancing speech at very low signal-to-noise ratios using non-acoustic reference signals
    Milner, Ben
    SPEECH COMMUNICATION, 2013, 55 (09) : 879 - 892
  • [32] A Feature Study for Classification-Based Speech Separation at Low Signal-to-Noise Ratios
    Chen, Jitong
    Wang, Yuxuan
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 1993 - 2002
  • [33] Phase Based Single-Channel Speech Enhancement Using Phase Ratio
    Singh, Sachin
    Mutawa, A. M.
    Gupta, Monika
    Tripathy, Manoj
    Anand, R. S.
    2017 6TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS IN ELECTRICAL ENGINEERING - RECENT ADVANCES (CERA), 2017, : 393 - 396
  • [34] Signal-to-noise ratio of a semiconductor optical-amplifier-based optical phase shifter
    Shumakher, Evgeny
    Duill, Sean O.
    Eisenstein, Gadi
    OPTICS LETTERS, 2009, 34 (13) : 1940 - 1942
  • [35] Phase estimation for signal reconstruction in single-channel speech separation
    Mowlaee, Pejman
    Saeidi, Rahim
    Martin, Rainer
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1546 - 1549
  • [36] Comparison of wavelet and FFT based single channel speech signal noise reduction techniques
    Fan, NP
    Balan, R
    Rosca, J
    WAVELET APPLICATIONS IN INDUSTRIAL PROCESSING II, 2004, 5607 : 127 - 138
  • [37] Effect of cochlear implant n-of-m strategy on signal-to-noise ratio below which noise hinders speech recognition
    Stam, Lucas
    Goverts, S. Theo
    Smits, Cas
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (05) : EL417 - EL422
  • [38] Symbolic Data Analysis to Defy Low Signal-to-Noise Ratio in Microarray Data for Breast Cancer Prognosis
    Hedjazi, Lyamine
    Le Lann, Marie-Veronique
    Kempowsky, Tatiana
    Dalenc, Florence
    Aguilar-Martin, Joseph
    Favre, Gilles
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2013, 20 (08) : 610 - 620
  • [39] Similarity-oriented method for inverse synthetic aperture radar imaging with low signal-to-noise ratio
    Xu, Xinbo
    Zhang, Qiang
    Su, Fulin
    Liu, Jinshan
    Wen, Yuan
    Jin, Xinfei
    Li, Hongxu
    IET RADAR SONAR AND NAVIGATION, 2024, 18 (07) : 1068 - 1079
  • [40] MULTI-CHANNEL SIGNAL ENHANCEMENT WITH SPEECH AND NOISE COVARIANCE ESTIMATES COMPUTED BY A PROBABILISTIC LOCALIZATION MODEL
    Anemueller, Joern
    Kayser, Hendrik
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 156 - 160