Robust Automatic Speech Recognition System for the Recognition of Continuous Kannada Speech Sentences in the Presence of Noise

被引：2

作者：

Mahadevaswamy ^{[1
]}

机构：

[1] Visvesvaraya Technol Univ, Vidyavardhaka Coll Engn, Dept Elect & Commun Engn, Mysuru, Karnataka, India

来源：

WIRELESS PERSONAL COMMUNICATIONS | 2023年 / 130卷 / 03期

关键词：

Approximation coefficients; Detail coefficients; Monophones; Tri-phones; Deep neural networks; DISCRETE WAVELET TRANSFORM; WORD RECOGNITION; FEATURES;

D O I：

10.1007/s11277-023-10371-x

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Automatic Speech Recognition system is developed for recognizing the continuous and spontaneous Kannada speech sentences in clean and noisy environments. The language models and acoustic models are constructed using Kaldi toolkit. The speech corpus is developed with the native female and male Kannada speakers and is partioned into training set and testing set. The Performance of the proposed system is analysed and evaluated using the metric Word Error Rate (WER). The Wavelet Packets amalgamated with Mel filter banks are utilized to perform feature vector generation. The proposed hand crafted features perform better than the baseline features such as Perceptual Linear Prediction, Mel Frequency Cepstral Coefficients interms of WER under both clean and nosiy environmental conditions.

引用

页码：2039 / 2058

页数：20

共 53 条

[1] Analysis of EEG records in an epileptic patient using wavelet transform [J].