Gradient Boosting Machine and Efficient Combination of Features for Speech-Based Detection of COVID-19

被引:33
作者
Dash, Tusar Kanti [1 ]
Chakraborty, Chinmay [2 ]
Mahapatra, Satyajit [3 ]
Panda, Ganapati [1 ]
机构
[1] CV Raman Global Univ, Elect & Commun Engn, Bhubaneswar 752054, India
[2] Birla Inst Technol, Elect & Commun Engn, Mesra 835215, India
[3] VIT Bhopal Univ, Sch Elect & Elect Engn, Bhopal 466114, India
关键词
COVID-19; Feature extraction; Boosting; Noise level; Speech recognition; Bioinformatics; Respiratory system; detection; LightGBM; speech classification; feature fusion; health informatics; CLASSIFICATION; RECOGNITION; SELECTION; NOISE;
D O I
10.1109/JBHI.2022.3197910
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent times, speech-based automatic disease detection systems have shown several promising results in biomedical and life science applications, especially in the case of respiratory diseases. It provides a quick, cost-effective, reliable, and non-invasive potential alternative detection option for COVID-19 in the ongoing pandemic scenario since the subject's voice can be remotely recorded and sent for further analysis. The existing COVID-19 detection methods including RT-PCR, and chest X-ray tests are not only costlier but also require the involvement of a trained technician. The present paper proposes a novel speech-based respiratory disease detection scheme for COVID-19 and Asthma using the Gradient Boosting Machine-based classifier. From the recorded speech samples, the spectral, cepstral, and periodicity features, as well as spectral descriptors, are computed and then homogeneously fused to obtain relevant statistical features. These features are subsequently used as inputs to the Gradient Boosting Machine. The various performance matrices of the proposed model have been obtained using thirteen sound categories' speech data collected from more than 50 countries using five standard datasets for accurate diagnosis of respiratory diseases including COVID-19. The overall average accuracy achieved by the proposed model using the stratified k-fold cross-validation test is above 97%. The analysis of various performance matrices demonstrates that under the current pandemic scenario, the proposed COVID-19 detection scheme can be gainfully employed by physicians.
引用
收藏
页码:5364 / 5371
页数:8
相关论文
共 46 条
[1]   Classification of speech dysfluencies with MFCC and LPCC features [J].
Ai, Ooi Chia ;
Hariharan, M. ;
Yaacob, Sazali ;
Chee, Lim Sin .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) :2157-2165
[2]  
[Anonymous], MATLAB AUD TOOLB
[3]   COVID-19 diagnosis system by deep learning approaches [J].
Bhuyan, Hemanta Kumar ;
Chakraborty, Chinmay ;
Shelke, Yogesh ;
Pani, Suvendu Kumar .
EXPERT SYSTEMS, 2022, 39 (03)
[4]   Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data [J].
Brown, Chloe ;
Chauhan, Jagmohan ;
Grammenos, Andreas ;
Han, Jing ;
Hasthanasombat, Apinan ;
Spathis, Dimitris ;
Xia, Tong ;
Cicuta, Pietro ;
Mascolo, Cecilia .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :3474-3484
[5]  
Chaudhari G, 2021, Arxiv, DOI [arXiv:2011.13320, DOI 10.48550/ARXIV.2011.13320]
[6]   LightGBM-PPI: Predicting protein-protein interactions through LightGBM with multi-information fusion [J].
Chen, Cheng ;
Zhang, Qingmei ;
Ma, Qin ;
Yu, Bin .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 191 :54-64
[7]   Online Monaural Speech Enhancement Based on Periodicity Analysis and A Priori SNR Estimation [J].
Chen, Zhangli ;
Hohmann, Volker .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) :1904-1916
[8]  
Chinmay C., 2021, EAI ENDORSED T PERVA, V21, P1
[9]   Detection of COVID-19 from speech signal using bio-inspired based cepstral features [J].
Dash, Tusar Kanti ;
Mishra, Soumya ;
Panda, Ganapati ;
Satapathy, Suresh Chandra .
PATTERN RECOGNITION, 2021, 117
[10]   Improved phase aware speech enhancement using bio-inspired and ANN techniques [J].
Dash, Tusar Kanti ;
Solanki, Sandeep Singh ;
Panda, Ganapati .
ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2020, 102 (03) :465-477