Reduction of noise in speech signals through image processing using the spectrogram

被引:4
作者
Graduate School of Science and Technology, Meijo University, 1-501 Shiogamaguchi, Tempaku-ku, Nagoya, 468-8502, Japan [1 ]
不详 [2 ]
机构
[1] Graduate School of Science and Technology, Meijo University, Tempaku-ku, Nagoya, 468-8502
来源
IEEJ Trans. Electron. Inf. Syst. | 2006年 / 12卷 / 1483-1489+10期
关键词
Image processing; Noise reduction; Spectrogram; Speech recognition; Speech signal;
D O I
10.1541/ieejeiss.126.1483
中图分类号
学科分类号
摘要
Speech recognition systems have come to be used widely. When any speech recognition system is disturbed by surrounding noises, considerable reduction in the recognition rate is inevitable. It is much desired to develop noise reduction methods so that any speech recognition system can be used in realistic environments. We propose a novel scheme especially effective for reducing the noise generated in the vehicles. The reduction is achieved through image processing techniques applied to the corresponding spectrograms. Experiments have been conducted on speech sounds in the vehicles. The performances have been evaluated in terms of the output signal-to-noise ratio (SNR). The proposed scheme has been compared with the conventional spectral subtraction method, and found to be promising especially for speeches corrupted with great amount of car noises.
引用
收藏
页码:1483 / 1489+10
相关论文
共 50 条
  • [21] Estimation of the Noise Variance in Image and Noise Reduction
    Kim, Yeong-Hwa
    Nam, Jiho
    KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (05) : 905 - 914
  • [22] Fake Speech Detection Using Modulation Spectrogram
    Magazine, Raghav
    Agarwal, Ayush
    Hedge, Anand
    Prasanna, S. R. Mahadeva
    SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 451 - 463
  • [23] Segmentation of a speech spectrogram using Mathematical Morphology
    Steinberg, Raphael
    O'Shaughnessy, Douglas
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1637 - +
  • [24] Noise Reduction and Speech Enhancement Using Wiener Filter
    Nuha, Hilal H.
    Absa, Ahmad Abo
    2022 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ITS APPLICATIONS (ICODSA), 2022, : 177 - 180
  • [25] BLIND SPEECH SEGMENTATION USING SPECTROGRAM IMAGE-BASED FEATURES AND MEL CEPSTRAL COEFFICIENTS
    Stan, Adriana
    Valentini-Botinhao, Cassia
    Orza, Bogdan
    Giurgiu, Mircea
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 597 - 602
  • [26] Contribution of noise reduction pre-processing and microphone directionality strategies in the speech recognition in noise in adult cochlear implant users
    Goffi-Gomez, Maria Valeria Schmidt
    Muniz, Lilian
    Wiemes, Gislaine
    Onuki, Lucia Cristina
    Calonga, Luciane
    Osterne, Francisco Jose
    Kos, Maria Isabel
    Caldas, Fernanda Ferreira
    Cardoso, Carolina
    Cagnacci, Byanka
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2021, 278 (08) : 2823 - 2828
  • [27] Analysis of Noise Reduction Techniques in Speech Recognition
    Zheng, Bo
    Hu, Jinsong
    Zhang, Ge
    Wu, Yuling
    Deng, Jianshuang
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 928 - 933
  • [28] Contribution of noise reduction pre-processing and microphone directionality strategies in the speech recognition in noise in adult cochlear implant users
    Maria Valeria Schmidt Goffi-Gomez
    Lilian Muniz
    Gislaine Wiemes
    Lucia Cristina Onuki
    Luciane Calonga
    Francisco José Osterne
    Maria Isabel Kós
    Fernanda Ferreira Caldas
    Carolina Cardoso
    Byanka Cagnacci
    European Archives of Oto-Rhino-Laryngology, 2021, 278 : 2823 - 2828
  • [29] Subspace-Based Noise Reduction for Speech Signals via Diagonal and Triangular Matrix Decompositions: Survey and Analysis
    Per Christian Hansen
    Søren Holdt Jensen
    EURASIP Journal on Advances in Signal Processing, 2007
  • [30] Detecting Human Emotion via Speech Recognition by Using Speech Spectrogram
    Prasomphan, Sathit
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 113 - 122