Diagnosis of pathological speech with streamlined features for long short-term memory learning

被引:4
|
作者
Pham, Tuan D. [1 ]
Holmes, Simon B. [1 ]
Zou, Lifong [1 ]
Patel, Mangala [1 ]
Coulthard, Paul [1 ]
机构
[1] Queen Mary Univ London, Barts & London Fac Med & Dent, Turner St, London E1 2AD, England
关键词
Pathological voice; Diagnosis; Feature extraction; Deep learning; Artificial intelligence; PARKINSONS-DISEASE; WAVE-PROPAGATION; SAMPLING THEORY; CLASSIFICATION; SCATTERING;
D O I
10.1016/j.compbiomed.2024.107976
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Pathological speech diagnosis is crucial for identifying and treating various speech disorders. Accurate diagnosis aids in developing targeted intervention strategies, improving patients' communication abilities, and enhancing their overall quality of life. With the rising incidence of speech -related conditions globally, including oral health, the need for efficient and reliable diagnostic tools has become paramount, emphasizing the significance of advanced research in this field. Methods: This paper introduces novel features for deep learning in the analysis of short voice signals. It proposes the incorporation of time -space and time-frequency features to accurately discern between two distinct groups: Individuals exhibiting normal vocal patterns and those manifesting pathological voice conditions. These advancements aim to enhance the precision and reliability of diagnostic procedures, paving the way for more targeted treatment approaches. Results: Utilizing a publicly available voice database, this study carried out training and validation using long short-term memory (LSTM) networks learning on the combined features, along with a data balancing strategy. The proposed approach yielded promising performance metrics: 90% accuracy, 93% sensitivity, 87% specificity, 88% precision, an F1 score of 0.90, and an area under the receiver operating characteristic curve of 0.96. The results surpassed those obtained by the networks trained using wavelet -time scattering coefficients, as well as several algorithms trained with alternative feature types. Conclusions: The incorporation of time-frequency and time -space features extracted from short segments of voice signals for LSTM learning demonstrates significant promise as an AI tool for the diagnosis of speech pathology. The proposed approach has the potential to enhance the accuracy and allow for real-time pathological speech assessment, thereby facilitating more targeted and effective therapeutic interventions.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Articulatory-to-speech conversion using bi-directional long short-term memory
    Taguchi, Fumiaki
    Kaburagi, Tokihiko
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2499 - 2503
  • [32] Prediction on the Urban GNSS Measurement Uncertainty Based on Deep Learning Networks With Long Short-Term Memory
    Zhang, Guohao
    Xu, Penghui
    Xu, Haosheng
    Hsu, Li-Ta
    IEEE SENSORS JOURNAL, 2021, 21 (18) : 20563 - 20577
  • [33] Air Traffic Controller Fatigue Detection Based on Facial and Vocal Features Using Long Short-Term Memory
    Huang, Zhousheng
    Tang, Weizhen
    Tian, Qiqi
    Huang, Ting
    Li, Jinze
    IEEE ACCESS, 2024, 12 : 56663 - 56682
  • [34] Disease Classification Based on Synthesis of Multiple Long Short-Term Memory Classifiers Corresponding to Eye Movement Features
    Mao, Yuxing
    He, Yinghong
    Liu, Lumei
    Chen, Xueshuo
    IEEE ACCESS, 2020, 8 : 151624 - 151633
  • [35] Long Short-Term Memory based Electrocardiogram diagnosis for Premature Ventricular Contraction in children
    Feng, Fei
    Huang, Yujuan
    Wang, Jianyi
    Liu, Li
    Luo, Jiajia
    2018 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2018, 10836
  • [36] Using Long Short-Term Memory for Building Outdoor Agricultural Machinery
    Wu, Chien-Hung
    Lu, Chun-Yi
    Zhan, Jun-We
    Wu, Hsin-Te
    FRONTIERS IN NEUROROBOTICS, 2020, 14
  • [37] De novo Molecular Design with Generative Long Short-term Memory
    Grisoni, Francesca
    Schneider, Gisbert
    CHIMIA, 2019, 73 (12) : 1006 - 1011
  • [38] ANALYSIS AND COMPARISON OF LONG SHORT-TERM MEMORY NETWORKS SHORT-TERM TRAFFIC PREDICTION PERFORMANCE
    Dogan, Erdem
    SCIENTIFIC JOURNAL OF SILESIAN UNIVERSITY OF TECHNOLOGY-SERIES TRANSPORT, 2020, 107 : 19 - 32
  • [39] Machine Learning Applications in Supply Chains: Long Short-Term Memory for Demand Forecasting
    Bousqaoui, Halima
    Achchab, Said
    Tikito, Kawtar
    CLOUD COMPUTING AND BIG DATA: TECHNOLOGIES, APPLICATIONS AND SECURITY, 2019, 49 : 301 - 317
  • [40] Automatic image generation based on deep learning long short-term memory network
    Yao, Xu
    Cao, Weiran
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2025, 25 (01) : 17 - 27