Improved single channel phase-aware speech enhancement technique for low signal-to-noise ratio signal

被引:20
|
作者
Samui, Suman [1 ]
Chakrabarti, Indrajit [2 ]
Ghosh, Soumya Kanti [3 ]
机构
[1] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, W Bengal, India
[2] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur, W Bengal, India
[3] Indian Inst Technol, Sch Informat Technol, Kharagpur, W Bengal, India
关键词
speech enhancement; signal denoising; spectral analysis; signal reconstruction; speech intelligibility; amplitude estimation; improved single channel phase-aware speech enhancement technique; low-signal-to-noise ratio signal; short-time spectral amplitude; phase corruption; additive noise contamination; phase-aware multiband spectral subtraction technique; spectral amplitude estimates; noise signal components; clean speech signal; composite quality measures; intelligibility assessment metrics; objective measure quality evaluation technique; SPECTRAL SUBTRACTION; SUPPRESSION; ALGORITHMS; INTELLIGIBILITY; DELAY;
D O I
10.1049/iet-spr.2015.0182
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the state-of-the-art single channel speech enhancement techniques, the short-time spectral amplitude is modified while the effect of the phase corruption due to the contamination of additive noise is neglected. This study introduces an improved speech enhancement algorithm based on a phase-aware multi-band spectral subtraction technique which estimates the spectral amplitude of the clean speech signal by considering the phase of the speech and noise signal components, and uses the estimated phase of the clean speech signal for signal reconstruction in the time domain. Experimental results show that the proposed algorithm yields better performance in terms of various objective and composite quality measures and other intelligibility assessment metrics while compared with other existing spectral subtraction methods. Using the composite objective measure quality evaluation technique, it is observed that the overall signal quality of the enhanced speech signal is improved on an average by 70% at 0 dB global input signal-to-noise ratio by using the proposed approach.
引用
收藏
页码:641 / 650
页数:10
相关论文
共 50 条
  • [41] Maximum a posteriori estimation of noise from non-acoustic reference signals in very low signal-to-noise ratio environments
    Milner, Ben
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 364 - 367
  • [42] Toward a more comprehensive understanding of the impact of masker type and signal-to-noise ratio on the pupillary response while performing a speech-in-noise test
    Wendt, Dorothea
    Koelewijn, Thomas
    Ksiazek, Patrycja
    Kramer, Sophia E.
    Lunner, Thomas
    HEARING RESEARCH, 2018, 369 : 67 - 78
  • [43] Real-time photonic sampling with improved signal-to-noise and distortion ratio using polarization-dependent modulators
    Liang, Dong
    Zhang, Zhiyao
    Liu, Yong
    Li, Xiaojun
    Jiang, Wei
    Tan, Qinggui
    OPTICS COMMUNICATIONS, 2018, 413 : 200 - 206
  • [44] Verification of Estimated Output Signal-to-Noise Ratios From a Phase Inversion Technique Using a Simulated Hearing Aid
    Yun, Donghyeon
    Shen, Yi
    Lentz, Jennifer J.
    AMERICAN JOURNAL OF AUDIOLOGY, 2023, 32 (01) : 197 - 209
  • [45] Intelligent identification technology for high-order digital modulation signals under low signal-to-noise ratio conditions
    Zha, Yanping
    Wang, Hongjun
    Shen, Zhexian
    Shi, Yingchun
    Shu, Feng
    IET SIGNAL PROCESSING, 2023, 17 (02)
  • [46] LDnADMM-Net: A Denoising Unfolded Deep Neural Network for Direction-of-Arrival Estimations in A Low Signal-to-Noise Ratio
    Liang, Can
    Liu, Mingxuan
    Li, Yang
    Wang, Yanhua
    Hu, Xueyao
    REMOTE SENSING, 2024, 16 (03)
  • [47] SINGLE CHANNEL SPEECH ENHANCEMENT TECHNIQUE FOR LOW SNR QUASI-PERIODIC NOISE BASED ON REDUCED ORDER LINEAR PREDICTION1
    Reddy, Chandan K. A.
    Montazeri, Vahid
    Rao, Yu
    Panahi, Issa M. S.
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 712 - 716
  • [48] Watt-level ultrahigh-optical signal-to-noise ratio single-longitudinal-mode tunable Brillouin fiber laser
    Wang, Gaomeng
    Zhan, Li
    Liu, Jinmei
    Zhang, Tao
    Li, Jun
    Zhang, Liang
    Peng, Junsong
    Yi, Lilin
    OPTICS LETTERS, 2013, 38 (01) : 19 - 21
  • [49] DNN-Based Low-Musical-Noise Single-Channel Speech Enhancement Based on Higher-Order-Moments Matching
    Mizoguchi, Satoshi
    Saito, Yuki
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (11) : 1971 - 1980
  • [50] On the effects of the time gate position and width on the signal-to-noise ratio for detection of Raman spectrum in a time-gated CMOS single-photon avalanche diode based sensor
    Nissinen, Ilkka
    Nissinen, Jan
    Keranen, Pekka
    Kostamovaara, Juha
    SENSORS AND ACTUATORS B-CHEMICAL, 2017, 241 : 1145 - 1152