Diagnosis of pathological speech with streamlined features for long short-term memory learning

被引:4
|
作者
Pham, Tuan D. [1 ]
Holmes, Simon B. [1 ]
Zou, Lifong [1 ]
Patel, Mangala [1 ]
Coulthard, Paul [1 ]
机构
[1] Queen Mary Univ London, Barts & London Fac Med & Dent, Turner St, London E1 2AD, England
关键词
Pathological voice; Diagnosis; Feature extraction; Deep learning; Artificial intelligence; PARKINSONS-DISEASE; WAVE-PROPAGATION; SAMPLING THEORY; CLASSIFICATION; SCATTERING;
D O I
10.1016/j.compbiomed.2024.107976
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Pathological speech diagnosis is crucial for identifying and treating various speech disorders. Accurate diagnosis aids in developing targeted intervention strategies, improving patients' communication abilities, and enhancing their overall quality of life. With the rising incidence of speech -related conditions globally, including oral health, the need for efficient and reliable diagnostic tools has become paramount, emphasizing the significance of advanced research in this field. Methods: This paper introduces novel features for deep learning in the analysis of short voice signals. It proposes the incorporation of time -space and time-frequency features to accurately discern between two distinct groups: Individuals exhibiting normal vocal patterns and those manifesting pathological voice conditions. These advancements aim to enhance the precision and reliability of diagnostic procedures, paving the way for more targeted treatment approaches. Results: Utilizing a publicly available voice database, this study carried out training and validation using long short-term memory (LSTM) networks learning on the combined features, along with a data balancing strategy. The proposed approach yielded promising performance metrics: 90% accuracy, 93% sensitivity, 87% specificity, 88% precision, an F1 score of 0.90, and an area under the receiver operating characteristic curve of 0.96. The results surpassed those obtained by the networks trained using wavelet -time scattering coefficients, as well as several algorithms trained with alternative feature types. Conclusions: The incorporation of time-frequency and time -space features extracted from short segments of voice signals for LSTM learning demonstrates significant promise as an AI tool for the diagnosis of speech pathology. The proposed approach has the potential to enhance the accuracy and allow for real-time pathological speech assessment, thereby facilitating more targeted and effective therapeutic interventions.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Classification of Power Quality Disturbances Using Convolutional Network and Long Short-Term Memory Network
    Rodrigues Junior, Wilson Leal
    Silva Borges, Fabbio Anderson
    Lira Rabelo, Ricardo de A.
    Alves de Lima, Bruno Vicente
    Almeida de Alencar, Jose Eduardo
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [42] Deep learning framework for stock price prediction using long short-term memory
    Chandar S.K.
    Soft Comput., 17-18 (10557-10567): : 10557 - 10567
  • [43] Deep Learning for Wave Energy Converter Modeling Using Long Short-Term Memory
    Mousavi, Seyed Milad
    Ghasemi, Majid
    Dehghan Manshadi, Mahsa
    Mosavi, Amir
    MATHEMATICS, 2021, 9 (08)
  • [44] Recognition of aggressive episodes of pigs based on convolutional neural network and long short-term memory
    Chen, Chen
    Zhu, Weixing
    Steibel, Juan
    Siegford, Janice
    Wurtz, Kaitlin
    Han, Junjie
    Norton, Tomas
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 169
  • [45] Application of deep learning to multivariate aviation weather forecasting by long short-term memory
    Chen, Chuen-Jyh
    Huang, Chieh-Ni
    Yang, Shih-Ming
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (03) : 4987 - 4997
  • [46] Prediction of Sorption Processes Using the Deep Learning Methods (Long Short-Term Memory)
    Skrobek, Dorian
    Krzywanski, Jaroslaw
    Sosnowski, Marcin
    Kulakowska, Anna
    Zylka, Anna
    Grabowska, Karolina
    Ciesielska, Katarzyna
    Nowak, Wojciech
    ENERGIES, 2020, 13 (24)
  • [47] Traffic Prediction Based on Random Connectivity in Deep Learning with Long Short-Term Memory
    Hua, Yuxiu
    Zhao, Zhifeng
    Liu, Zhiming
    Chen, Xianfu
    Li, Rongpeng
    Zhang, Honggang
    2018 IEEE 88TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2018,
  • [48] Computer vision and long short-term memory: Learning to predict unsafe behaviour in construction
    Kong, Ting
    Fang, Weili
    Love, Peter E. D.
    Luo, Hanbin
    Xu, Shuangjie
    Li, Heng
    ADVANCED ENGINEERING INFORMATICS, 2021, 50
  • [49] Brain tumor detection: a long short-term memory (LSTM)-based learning model
    Amin, Javaria
    Sharif, Muhammad
    Raza, Mudassar
    Saba, Tanzila
    Sial, Rafiq
    Shad, Shafqat Ali
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (20) : 15965 - 15973
  • [50] Remaining Useful Life Estimation Using Long Short-Term Memory Deep Learning
    Hsu, Che-Sheng
    Jiang, Jehn-Ruey
    PROCEEDINGS OF 4TH IEEE INTERNATIONAL CONFERENCE ON APPLIED SYSTEM INNOVATION 2018 ( IEEE ICASI 2018 ), 2018, : 58 - 61