Reduction of noise in speech signals through image processing using the spectrogram

被引:4
作者
Graduate School of Science and Technology, Meijo University, 1-501 Shiogamaguchi, Tempaku-ku, Nagoya, 468-8502, Japan [1 ]
不详 [2 ]
机构
[1] Graduate School of Science and Technology, Meijo University, Tempaku-ku, Nagoya, 468-8502
来源
IEEJ Trans. Electron. Inf. Syst. | 2006年 / 12卷 / 1483-1489+10期
关键词
Image processing; Noise reduction; Spectrogram; Speech recognition; Speech signal;
D O I
10.1541/ieejeiss.126.1483
中图分类号
学科分类号
摘要
Speech recognition systems have come to be used widely. When any speech recognition system is disturbed by surrounding noises, considerable reduction in the recognition rate is inevitable. It is much desired to develop noise reduction methods so that any speech recognition system can be used in realistic environments. We propose a novel scheme especially effective for reducing the noise generated in the vehicles. The reduction is achieved through image processing techniques applied to the corresponding spectrograms. Experiments have been conducted on speech sounds in the vehicles. The performances have been evaluated in terms of the output signal-to-noise ratio (SNR). The proposed scheme has been compared with the conventional spectral subtraction method, and found to be promising especially for speeches corrupted with great amount of car noises.
引用
收藏
页码:1483 / 1489+10
相关论文
共 50 条
[31]   Adaptive EMG Noise Reduction in ECG Signals Using Noise Level Approximation [J].
Marouf, Mohamed ;
Saranovac, Lazar .
2017 INTERNATIONAL CONFERENCE ON ROBOTICS AND MACHINE VISION, 2017, 10613
[32]   Derivation of Criterion Suitable for Evaluation of Multichannel Noise Reduction Systems for Speech Processing [J].
Bolom, Vaclav ;
Ingerle, Jan ;
Sovka, Pavel .
RADIOENGINEERING, 2009, 18 (04) :651-664
[33]   Symbolic dynamics for processing chaotic signals - I: Noise reduction of chaotic sequences [J].
Schweizer, J ;
Schimming, T .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2001, 48 (11) :1269-1282
[34]   Speech Emotion Recognition Using Spectrogram & Phoneme Embedding [J].
Yenigalla, Promod ;
Kumar, Abhay ;
Tripathi, Suraj ;
Singh, Chirag ;
Kar, Sibsambhu ;
Vepa, Jithendra .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :3688-3692
[35]   Application of infrared image noise reduction by processing of wavelet transform threshold [J].
Liang, M ;
Zhao, FY ;
Li, QS .
INFRARED COMPONENTS AND THEIR APPLICATIONS, 2005, 5640 :293-296
[36]   Distribution of Operations in Heterogeneous Computing Systems for Processing Speech Signals [J].
Rakhimov, Mekhriddin ;
Ochilov, Manon .
2021 IEEE 15TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2021), 2021,
[37]   Estimation of the speech quality of noise reduced signals [J].
Gautier-Turbin, Valerie ;
Le Faucheur, Nicolas .
MEASUREMENT OF SPEECH, AUDIO AND VIDEO QUALITY IN NETWORKS, 2007, :53-58
[38]   A Novel Fuzzy Technique for Image Noise Reduction [J].
Nejad, Hamed Vahdat ;
Pourreza, Hameed Reza ;
Ebrahimi, Hasan .
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 14, 2006, 14 :390-395
[39]   Effect of Noise Reduction on Reaction Time to Speech in Noise [J].
Huckvale, Mark ;
Leak, Jayne .
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, :1327-1330
[40]   A new conjugate gradient algorithm for noise reduction in signal processing and image restoration [J].
Huang, Pan ;
Liu, Kaiping .
FRONTIERS IN PHYSICS, 2022, 10