Detecting Cough Recordings in Crowdsourced Data Using CNN-RNN

被引:1
作者
Sharan, Roneel V. [1 ]
Xiong, Hao [1 ]
Berkovsky, Shlomo [1 ]
机构
[1] Macquarie Univ, Australian Inst Hlth Innovat, Sydney, NSW, Australia
来源
2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22) | 2022年
关键词
cough sound; crowdsourced; deep learning; melspectrogram; respiratory diseases;
D O I
10.1109/BHI56158.2022.9926896
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The sound of cough is an important indicator of the condition of the respiratory system. Automatic cough sound evaluation can aid the diagnosis of respiratory diseases. Large crow dsourced cough sound datasets have recently been used by several groups around the world to develop cough classification models. However, not all recordings in these datasets contain cough sounds. As such, it is important to screen the recordings for the presence of cough sounds before developing cough classification models. This work proposes a method to screen crowdsourced audio recordings for cough sounds using deep learning methods. The proposed approach divides the audio recording into overlapping frames and converts each frame into a mel-spectrogram representation. A pretrained convolutional neural network for audio classification is trained to learn the spectral characteristics of cough and non-cough frames from its mel-spectrogram representation. It is combined with a recurrent neural network to learn the dependencies between the sequence of frames. The proposed method is evaluated on 400 crowdsourced audio recordings, manually annotated as cough or non-cough. An accuracy of 0.9800 (AUC of 0.9973) is achieved in classifying cough and non-cough recordings using the proposed method. The trained network is used to analyze the remaining audio recordings in the dataset, identifying only about 67% of recordings as containing usable cough sounds. This shows the need to exercise caution when using crowdsourced cough data.
引用
收藏
页数:4
相关论文
共 16 条
  • [1] Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data
    Brown, Chloe
    Chauhan, Jagmohan
    Grammenos, Andreas
    Han, Jing
    Hasthanasombat, Apinan
    Spathis, Dimitris
    Xia, Tong
    Cicuta, Pietro
    Mascolo, Cecilia
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3474 - 3484
  • [2] End-to-end convolutional neural network enables COVID-19 detection from breath and cough audio: a pilot study
    Coppock, Harry
    Gaskell, Alex
    Tzirakis, Panagiotis
    Baird, Alice
    Jones, Lyn
    Schuller, Bjoern
    [J]. BMJ INNOVATIONS, 2021, 7 (02) : 356 - 362
  • [3] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES
    DAVIS, SB
    MERMELSTEIN, P
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04): : 357 - 366
  • [4] Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
  • [5] Gemmeke JF, 2017, INT CONF ACOUST SPEE, P776, DOI 10.1109/ICASSP.2017.7952261
  • [6] Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    Graves, A
    Schmidhuber, J
    [J]. NEURAL NETWORKS, 2005, 18 (5-6) : 602 - 610
  • [7] Hershey S, 2017, INT CONF ACOUST SPEE, P131, DOI 10.1109/ICASSP.2017.7952132
  • [8] COVID-19 Artificial Intelligence Diagnosis Using Only Cough Recordings
    Laguarta, Jordi
    Hueto, Ferran
    Subirana, Brian
    [J]. IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY, 2020, 1 (01): : 275 - 281
  • [9] Liu SY, 2015, PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, P730, DOI 10.1109/ACPR.2015.7486599
  • [10] Discrimination of productive and non-productive cough by sound analysis
    Murata, A
    Taniguchi, Y
    Hashimoto, Y
    Kaneko, Y
    Takasaki, Y
    Kudoh, S
    [J]. INTERNAL MEDICINE, 1998, 37 (09) : 732 - 735