Dysarthric Speech Recognition Using Variational Mode Decomposition and Convolutional Neural Networks

被引:0
作者
R. Rajeswari
T. Devi
S. Shalini
机构
[1] Bharathiar University,Department of Computer Applications
来源
Wireless Personal Communications | 2022年 / 122卷
关键词
Automatic speech recognition; Dysarthric speech; Variational mode decomposition; Convolutional neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
Dysarthric speech recognition requires a learning technique that is able to capture dysarthric speech specific features. Dysarthric speech is considered as speech with source distortion or noisy speech. Hence, as a first step speech enhancement is performed using variational mode decomposition (VMD) and wavelet thresholding. The reconstructed signals are then fed as input to convolutional neural networks. These networks learn dysarthric speech specific features and generate a speech model that supports dysarthric speech recognition. The performance of the proposed method is evaluated using UA-Speech database. The average accuracy values obtained by the proposed method for speakers with different intelligibility levels with VMD based enhancement and without enhancement are 95.95 and 91.80% respectively. The proposed method also provides an increased accuracy value compared to existing methods that are based on generative models and artificial neural networks.
引用
收藏
页码:293 / 307
页数:14
相关论文
共 49 条
[1]  
Rampello L(2016)When the word doesnt come out: A synthetic overview of dysarthria Journal of the Neurological Sciences 369 354-360
[2]  
Rampello L(2018)Whistle-blowing ASRS: Evaluating the need for more inclusive automatic speech recognition systems Proceedings of the Annual conference of the International Speech Communication Association INTERSPEECH 2018 466-470
[3]  
Patti F(2017)Speech signal analysis and pattern recognition in diagnosis of dysarthria Annals of Indian Academy of Neurology 20 302-357
[4]  
Zappia M(2017)A relationship between processing speech in noise and dysarthric speech Journal of Acoustics Society of AMerica 141 4460-4467
[5]  
Moore M(2014)Variational mode decomposition IEEE Transactions on Signal Processing 62 531-544
[6]  
Demakethepalli VH(2017)Comparative analysis of EMD and VMD algorithm in speech enhancement International Journal of Natural Computing Research 6 17-35
[7]  
Panchanathan S(1991)On the use of hidden Markov modelling for recognition of dysarthric speech Computers Methods and Programs in Biomedicine 35 125-139
[8]  
Thoppil MG(2019)Assessment of dysarthria using one word speech recognition with hidden Markov models Journal of Korean Medical Science 34 e108-561
[9]  
Kumar CS(2016)Generative model-driven feature learning for dysarthric speech recognition Biocybernetics and Biomedical Engineering 36 553-110
[10]  
Kumar A(2014)Artificial networks as speech recognizers for dysarthric speech: Identifying the best performing set of MFCC parameters and studying a speaker independent approach Advanced Engineering Informatics 28 102-748