An AI-Enabled Bias-Free Respiratory Disease Diagnosis Model Using Cough Audio

被引:3
作者
Saeed, Tabish [1 ]
Ijaz, Aneeqa [1 ]
Sadiq, Ismail [2 ]
Qureshi, Haneya Naeem [1 ]
Rizwan, Ali [3 ]
Imran, Ali [1 ,2 ]
机构
[1] Univ Oklahoma, Res Ctr AI4Networks, Dept Elect & Comp Engn, Tulsa, OK 74135 USA
[2] Univ Glasgow, James Watt Sch Engn, Glasgow G12 8QQ, Scotland
[3] AI4lyf, Lahore 54000, Pakistan
来源
BIOENGINEERING-BASEL | 2024年 / 11卷 / 01期
关键词
cough; COVID-19; confounding variables; spectrograms; diagnosis; deep-learning; c-GAN; respiratory diseases;
D O I
10.3390/bioengineering11010055
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Cough-based diagnosis for respiratory diseases (RDs) using artificial intelligence (AI) has attracted considerable attention, yet many existing studies overlook confounding variables in their predictive models. These variables can distort the relationship between cough recordings (input data) and RD status (output variable), leading to biased associations and unrealistic model performance. To address this gap, we propose the Bias-Free Network (RBF-Net), an end-to-end solution that effectively mitigates the impact of confounders in the training data distribution. RBF-Net ensures accurate and unbiased RD diagnosis features, emphasizing its relevance by incorporating a COVID-19 dataset in this study. This approach aims to enhance the reliability of AI-based RD diagnosis models by navigating the challenges posed by confounding variables. A hybrid of a Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) networks is proposed for the feature encoder module of RBF-Net. An additional bias predictor is incorporated in the classification scheme to formulate a conditional Generative Adversarial Network (c-GAN) that helps in decorrelating the impact of confounding variables from RD prediction. The merit of RBF-Net is demonstrated by comparing classification performance with a State-of-The-Art (SoTA) Deep Learning (DL) model (CNN-LSTM) after training on different unbalanced COVID-19 data sets, created by using a large-scale proprietary cough data set. RBF-Net proved its robustness against extremely biased training scenarios by achieving test set accuracies of 84.1%, 84.6%, and 80.5% for the following confounding variables-gender, age, and smoking status, respectively. RBF-Net outperforms the CNN-LSTM model test set accuracies by 5.5%, 7.7%, and 8.2%, respectively.
引用
收藏
页数:17
相关论文
共 49 条
  • [1] A Generic Deep Learning Based Cough Analysis System From Clinically Validated Samples for Point-of-Need Covid-19 Test and Severity Levels
    Andreu-Perez, Javier
    Perez-Espinosa, Humberto
    Timonet, Eva
    Kiani, Mehrin
    Giron-Perez, Manuel, I
    Benitez-Trinidad, Alma B.
    Jarchi, Delaram
    Rosales-Perez, Alejandro
    Gkatzoulis, Nick
    Reyes-Galaviz, Orion F.
    Torres-Garcia, Alejandro
    Reyes-Garcia, Carlos A.
    Ali, Zulfiqar
    Rivas, Francisco
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (03) : 1220 - 1232
  • [2] [Anonymous], Respiratory Diseases in the World
  • [3] [Anonymous], 2021, AI4COVID-19: An Artificial Intelligence Powered App for Detecting COVID-19 from Cough Sound
  • [4] [Anonymous], CHRONIC OBSTRUCTIVE
  • [5] Audacity Free, Open Source Cross-Platform Audio Software
  • [6] AutoML.org, ABOUT US
  • [7] Aytekin I, 2022, Arxiv, DOI [arXiv:2207.09529, 10.1109/JBHI.2023.3339700, DOI 10.1109/JBHI.2023.3339700]
  • [8] Bales C, 2020, E-HEALTH BIOENG CONF
  • [9] Bansal Vipin, 2020, 2020 IEEE International Conference on Computing, Power and Communication Technologies (GUCON), P604, DOI 10.1109/GUCON48875.2020.9231094
  • [10] Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data
    Brown, Chloe
    Chauhan, Jagmohan
    Grammenos, Andreas
    Han, Jing
    Hasthanasombat, Apinan
    Spathis, Dimitris
    Xia, Tong
    Cicuta, Pietro
    Mascolo, Cecilia
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3474 - 3484