Time-frequency analysis of speech signals using the Stockwell transform for the detection of upper respiratory tract infection

被引:1
作者
Warule, Pankaj [1 ]
Mishra, Siba Prasad [2 ]
Deb, Suman [2 ]
Krajewski, Jarek [3 ]
机构
[1] Pravara Rural Engn Coll, Loni, Maharashtra, India
[2] Sardar Vallabhbhai Natl Inst Technol, Surat, Gujarat, India
[3] Inst Expt Psychophysiol, Dusseldorf, Germany
关键词
Common cold; Ensemble of classifier; Stockwell transform; Support vector machines; Time-frequency analysis; Upper respiratory tract infections; CLASSIFICATION; FEATURES; COLD;
D O I
10.1016/j.apacoust.2024.110339
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The acoustic properties of speech demonstrate modifications in the presence of different health states. Biomedical engineering has great promise for creating non-invasive diagnostic processes that use speech as a biomarker. The use of speech indications to screen for upper respiratory tract infections (URTIs), such as the common cold, may have potential advantages in terms of limiting transmission. In this study, we have employed the Stockwell transform-based time-frequency (TF) analysis of speech signals for URTI detection. The Stockwell transform is applied on speech signals to derive their TF representation. Using a TF matrix, the various statistics of magnitude and phase are calculated and used as features for classifying speech of healthy speakers and speakers with URTI. The URTIC database is employed for evaluating the proposed features. The utilization of an ensemble of support vector machines (SVM) is proposed as a classification approach to address the issue of class imbalance. The results show that the proposed method produces comparable outcomes to state-of-the-art approaches. The proposed features obtain 66.53% and 64.65% UARs on the development and test partitions of the URTIC database.
引用
收藏
页数:7
相关论文
共 30 条
[1]   End-to-End Deep Learning Framework for Speech Paralinguistics Detection Based on Perception Aware Spectrum [J].
Cai, Danwei ;
Ni, Zhidong ;
Liu, Wenbo ;
Cai, Weicheng ;
Li, Gang ;
Li, Ming .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :3452-3456
[2]   Affect Detection: An Interdisciplinary Review of Models, Methods, and Their Applications [J].
Calvo, Rafael A. ;
D'Mello, Sidney .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2010, 1 (01) :18-37
[3]   Speech analysis for health: Current state-of-the-art and the increasing impact of deep learning [J].
Cummins, Nicholas ;
Baird, Alice ;
Schuller, Bjoern W. .
METHODS, 2018, 151 :41-54
[4]   Synchrosqueezed wavelet transforms: An empirical mode decomposition-like tool [J].
Daubechies, Ingrid ;
Lu, Jianfeng ;
Wu, Hau-Tieng .
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2011, 30 (02) :243-261
[5]   Detection of Common Cold from Speech Signals using Deep Neural Network [J].
Deb, Suman ;
Warule, Pankaj ;
Nair, Amrita ;
Sultan, Haider ;
Dash, Rahul ;
Krajewski, Jarek .
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 42 (3) :1707-1722
[6]   Analysis and Classification of Cold Speech Using Variational Mode Decomposition [J].
Deb, Suman ;
Dandapat, Samarendra ;
Krajewski, Jarek .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2020, 11 (02) :296-307
[7]   Frequency-based window width optimization for S-transform [J].
Djurovic, Igor ;
Sejdic, Ervin ;
Jiang, Jin .
AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2008, 62 (04) :245-250
[8]   Understanding the symptoms of the common cold and influenza [J].
Eccles, R .
LANCET INFECTIOUS DISEASES, 2005, 5 (11) :718-725
[9]   Using the Fisher Vector Approach for Cold Identification [J].
Egas-Lopez, Jose Vicente ;
Gosztolya, Gabor .
ACTA CYBERNETICA, 2021, 25 (02) :223-232
[10]   Survey on speech emotion recognition: Features, classification schemes, and databases [J].
El Ayadi, Moataz ;
Kamel, Mohamed S. ;
Karray, Fakhri .
PATTERN RECOGNITION, 2011, 44 (03) :572-587