StutterNet: Stuttering Disfluencies Detection in Synthetic Speech Signals via Mel Frequency Cepstral Coefficients Features Using Deep Learning

被引：1

作者：

Abubakar, Muhammad ^{[1
]}

Mujahid, Muhammad ^{[2
]}

Kanwal, Khadija ^{[3
]}

Iqbal, Sajid ^{[4
]}

Asghar, Muhammad Nabeel ^{[4
]}

Alaulamie, Abdullah ^{[4
]}

机构：

[1] Khwaja Fareed Univ Engn & Informat Technol, Dept Comp Sci, Rahim Yar Khan 64500, Pakistan

[2] Prince Sultan Univ, Artificial Intelligence & Data Analyt AIDA Lab, CCIS, Riyadh 11586, Saudi Arabia

[3] Women Univ, Inst Comp Sci & Informat Technol, Multan 60000, Pakistan

[4] King Faisal Univ, Coll Comp Sci & Informat Technol, Dept Informat Syst, Al Hasa 31982, Saudi Arabia

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Speech recognition; Hidden Markov models; Mathematical models; Feature extraction; Mel frequency cepstral coefficient; Neural networks; Information technology; BiLSTM; Recurrent neural networks; Long short term memory; Stuttering detection; machine learning; MFCC features; speech signals;

D O I：

10.1109/ACCESS.2024.3429343

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Stuttering is a speech disorder characterised by the repetition, prolongation, or blocking of sounds, syllables, or words, which can cause significant social and emotional difficulties for those who experience it. To help find and diagnose stuttering early, it is important for clinicians and researchers to accurately separate stuttering from normal speech. This helps them understand the disorder, find possible causes, and come up with effective interventions and treatments. The UCLASS dataset was used in this study, and 40 MFCC features were taken out to see how well different machine learning classifiers could indicate the difference between normal and stuttering speech. The major problem is the UCLASS imbalanced dataset. The authors address it with the synthetic minority oversampling technique. After machine learning experimentation's, we propose a novel hybrid model that performs better than individual machine learning. In hybrid, the lightweight SutterNet model takes the best features from the data and then make prediction. The results indicate that the evaluated classifiers showed varying levels of performance. Overall, the results suggest that hybrid classifiers have the potential to accurately classify normal and stuttering speech, which could have important implications for the early identification and diagnosis of stuttering, as well as the development of assistive technologies and effective interventions and treatments.

引用

页码：99308 / 99320

页数：13

共 29 条

[1]

Anusuya MA, 2010, Arxiv, DOI arXiv:1001.2267

[2] A novel attention model across heterogeneous features for stuttering event detection [J].

Al-Banna, Abedal-Kareem ;

Fang, Hui ;

Edirisinghe, Eran .

EXPERT SYSTEMS WITH APPLICATIONS, 2024, 244

[3] Stuttering Detection Using Atrous Convolutional Neural Networks [J].

Al-Banna, Abedal-Kareem ;

Edirisinghe, Eran ;

Fang, Hui .

2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, :252-256

[4] Stuttering Disfluency Detection Using Machine Learning Approaches [J].

Al-Banna, Abedal-Kareem ;

Edirisinghe, Eran ;

Fang, Hui ;

Hadi, Wael .

JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2022, 21 (02)

[5] Sequence labeling to detect stuttering events in read speech [J].

Alharbi, Sadeen ;

Hasan, Madina ;

Simons, Anthony J. H. ;

Brumfitt, Shelagh ;

Green, Phil .

COMPUTER SPEECH AND LANGUAGE, 2020, 62

[6]

Arjun K. N., 2020, Procedia Computer Science, V171, P1363, DOI 10.1016/j.procs.2020.04.146

[7] TranStutter: A Convolution-Free Transformer-Based Deep Learning Method to Classify Stuttered Speech Using 2D Mel-Spectrogram Visualization and Attention-Based Feature Representation [J].

Basak, Krishna ;

Mishra, Nilamadhab ;

Chang, Hsien-Tsung .

SENSORS, 2023, 23 (19)

[8] What causes stuttering? [J].

Büchel, C ;

Sommer, M .

PLOS BIOLOGY, 2004, 2 (02) :159-163

[9]

Chavan RupaliS., 2013, International Journal of Computer Science and Mobile Computing, V2, P233

[10]

Chopra M., 2020, Tech. Rep. CS224s

← 1 2 3 →