Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise

被引:0
作者
机构
[1] Visvesvaraya Technological University,Department of Electronics and Communication Engineering, Vidyavardhaka College of Engineering
来源
Wireless Personal Communications | 2023年 / 130卷
关键词
Approximation coefficients; Detail coefficients; Monophones; Tri-phones; Deep neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic Speech Recognition system is developed for recognizing the continuous and spontaneous Kannada speech sentences in clean and noisy environments. The language models and acoustic models are constructed using Kaldi toolkit. The speech corpus is developed with the native female and male Kannada speakers and is partioned into training set and testing set. The Performance of the proposed system is analysed and evaluated using the metric Word Error Rate (WER). The Wavelet Packets amalgamated with Mel filter banks are utilized to perform feature vector generation. The proposed hand crafted features perform better than the baseline features such as Perceptual Linear Prediction, Mel Frequency Cepstral Coefficients interms of WER under both clean and nosiy environmental conditions.
引用
收藏
页码:2039 / 2058
页数:19
相关论文
共 76 条
[11]  
Nandi D(2001)Mel filter-like admissible wavelet packet structure for speech recognition IEEE Signal Processing Letters 53 2489-399
[12]  
Pati D(2005)Fourier transform representation by frequency-time wavelets IEEE Transactions on Signal Processing 17 389-911
[13]  
Sreenivasa Rao K(2014)Feature extraction technique using ERB like wavelet sub-band periodic and aperiodic decomposition for TIMIT phoneme recognition International Journal of Speech Technology 10 902-3511
[14]  
Li J(2016)Admissible wavelet packet sub-band based harmonic energy features using ANOVA fusion techniques for Hindi phoneme recognition IET Signal Processing 41 3497-2232
[15]  
Deng L(1993)Theory of regular M-band wavelet bases IEEE Transactions on Signal Processing 40 2207-730
[16]  
Gong Y(1992)Wavelets and filter banks: Theory and design IEEE Transactions on Signal Processing 172 717-355
[17]  
Haeb-Umbach R(2006)An algebraic construction of orthonormal M-band wavelets with perfect reconstruction Applied Mathematics and Computation 07 341-362
[18]  
Mukherjee H(2009)Discrete wavelet transform applied on personal identity verification with ECG signal International Journal of Wavelets, Multiresolution and Information Processing 49 355-197
[19]  
Obaidullah SM(2002)An efficient coding algorithm for the compression of ECG signals using the wavelet transform IEEE Transactions on Biomedical Engineering 106 184-911
[20]  
Santosh KC(2015)Speech enhancement using a wavelet thresholding method based on symmetric Kullback–Leibler divergence Signal Processing 22 899-87