StutterNet: Stuttering Disfluencies Detection in Synthetic Speech Signals via Mel Frequency Cepstral Coefficients Features Using Deep Learning

被引:1
作者
Abubakar, Muhammad [1 ]
Mujahid, Muhammad [2 ]
Kanwal, Khadija [3 ]
Iqbal, Sajid [4 ]
Asghar, Muhammad Nabeel [4 ]
Alaulamie, Abdullah [4 ]
机构
[1] Khwaja Fareed Univ Engn & Informat Technol, Dept Comp Sci, Rahim Yar Khan 64500, Pakistan
[2] Prince Sultan Univ, Artificial Intelligence & Data Analyt AIDA Lab, CCIS, Riyadh 11586, Saudi Arabia
[3] Women Univ, Inst Comp Sci & Informat Technol, Multan 60000, Pakistan
[4] King Faisal Univ, Coll Comp Sci & Informat Technol, Dept Informat Syst, Al Hasa 31982, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Speech recognition; Hidden Markov models; Mathematical models; Feature extraction; Mel frequency cepstral coefficient; Neural networks; Information technology; BiLSTM; Recurrent neural networks; Long short term memory; Stuttering detection; machine learning; MFCC features; speech signals;
D O I
10.1109/ACCESS.2024.3429343
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stuttering is a speech disorder characterised by the repetition, prolongation, or blocking of sounds, syllables, or words, which can cause significant social and emotional difficulties for those who experience it. To help find and diagnose stuttering early, it is important for clinicians and researchers to accurately separate stuttering from normal speech. This helps them understand the disorder, find possible causes, and come up with effective interventions and treatments. The UCLASS dataset was used in this study, and 40 MFCC features were taken out to see how well different machine learning classifiers could indicate the difference between normal and stuttering speech. The major problem is the UCLASS imbalanced dataset. The authors address it with the synthetic minority oversampling technique. After machine learning experimentation's, we propose a novel hybrid model that performs better than individual machine learning. In hybrid, the lightweight SutterNet model takes the best features from the data and then make prediction. The results indicate that the evaluated classifiers showed varying levels of performance. Overall, the results suggest that hybrid classifiers have the potential to accurately classify normal and stuttering speech, which could have important implications for the early identification and diagnosis of stuttering, as well as the development of assistive technologies and effective interventions and treatments.
引用
收藏
页码:99308 / 99320
页数:13
相关论文
共 29 条
  • [1] Anusuya MA, 2010, Arxiv, DOI arXiv:1001.2267
  • [2] A novel attention model across heterogeneous features for stuttering event detection
    Al-Banna, Abedal-Kareem
    Fang, Hui
    Edirisinghe, Eran
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 244
  • [3] Stuttering Detection Using Atrous Convolutional Neural Networks
    Al-Banna, Abedal-Kareem
    Edirisinghe, Eran
    Fang, Hui
    [J]. 2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, : 252 - 256
  • [4] Stuttering Disfluency Detection Using Machine Learning Approaches
    Al-Banna, Abedal-Kareem
    Edirisinghe, Eran
    Fang, Hui
    Hadi, Wael
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2022, 21 (02)
  • [5] Sequence labeling to detect stuttering events in read speech
    Alharbi, Sadeen
    Hasan, Madina
    Simons, Anthony J. H.
    Brumfitt, Shelagh
    Green, Phil
    [J]. COMPUTER SPEECH AND LANGUAGE, 2020, 62
  • [6] Arjun K. N., 2020, Procedia Computer Science, V171, P1363, DOI 10.1016/j.procs.2020.04.146
  • [7] TranStutter: A Convolution-Free Transformer-Based Deep Learning Method to Classify Stuttered Speech Using 2D Mel-Spectrogram Visualization and Attention-Based Feature Representation
    Basak, Krishna
    Mishra, Nilamadhab
    Chang, Hsien-Tsung
    [J]. SENSORS, 2023, 23 (19)
  • [8] What causes stuttering?
    Büchel, C
    Sommer, M
    [J]. PLOS BIOLOGY, 2004, 2 (02): : 159 - 163
  • [9] Chavan RupaliS., 2013, International Journal of Computer Science and Mobile Computing, V2, P233
  • [10] Chopra M., 2020, Tech. Rep. CS224s