Detecting Cough Recordings in Crowdsourced Data Using CNN-RNN

被引：1

作者：

Sharan, Roneel V. ^{[1
]}

Xiong, Hao ^{[1
]}

Berkovsky, Shlomo ^{[1
]}

机构：

[1] Macquarie Univ, Australian Inst Hlth Innovat, Sydney, NSW, Australia

来源：

2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22) | 2022年

关键词：

cough sound; crowdsourced; deep learning; melspectrogram; respiratory diseases;

D O I：

10.1109/BHI56158.2022.9926896

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The sound of cough is an important indicator of the condition of the respiratory system. Automatic cough sound evaluation can aid the diagnosis of respiratory diseases. Large crow dsourced cough sound datasets have recently been used by several groups around the world to develop cough classification models. However, not all recordings in these datasets contain cough sounds. As such, it is important to screen the recordings for the presence of cough sounds before developing cough classification models. This work proposes a method to screen crowdsourced audio recordings for cough sounds using deep learning methods. The proposed approach divides the audio recording into overlapping frames and converts each frame into a mel-spectrogram representation. A pretrained convolutional neural network for audio classification is trained to learn the spectral characteristics of cough and non-cough frames from its mel-spectrogram representation. It is combined with a recurrent neural network to learn the dependencies between the sequence of frames. The proposed method is evaluated on 400 crowdsourced audio recordings, manually annotated as cough or non-cough. An accuracy of 0.9800 (AUC of 0.9973) is achieved in classifying cough and non-cough recordings using the proposed method. The trained network is used to analyze the remaining audio recordings in the dataset, identifying only about 67% of recordings as containing usable cough sounds. This shows the need to exercise caution when using crowdsourced cough data.

引用

页数：4

共 16 条

[1] Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data
Brown, Chloe
Chauhan, Jagmohan
Grammenos, Andreas
Han, Jing
Hasthanasombat, Apinan
Spathis, Dimitris
Xia, Tong
Cicuta, Pietro
Mascolo, Cecilia
[J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3474 - 3484
[2] End-to-end convolutional neural network enables COVID-19 detection from breath and cough audio: a pilot study
Coppock, Harry
Gaskell, Alex
Tzirakis, Panagiotis
Baird, Alice
Jones, Lyn
Schuller, Bjoern
[J]. BMJ INNOVATIONS, 2021, 7 (02) : 356 - 362
[3] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES
DAVIS, SB
MERMELSTEIN, P
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04): : 357 - 366
[4] Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
[5] Gemmeke JF, 2017, INT CONF ACOUST SPEE, P776, DOI 10.1109/ICASSP.2017.7952261
[6] Framewise phoneme classification with bidirectional LSTM and other neural network architectures
Graves, A
Schmidhuber, J
[J]. NEURAL NETWORKS, 2005, 18 (5-6) : 602 - 610
[7] Hershey S, 2017, INT CONF ACOUST SPEE, P131, DOI 10.1109/ICASSP.2017.7952132
[8] COVID-19 Artificial Intelligence Diagnosis Using Only Cough Recordings
Laguarta, Jordi
Hueto, Ferran
Subirana, Brian
[J]. IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY, 2020, 1 (01): : 275 - 281
[9] Liu SY, 2015, PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, P730, DOI 10.1109/ACPR.2015.7486599
[10] Discrimination of productive and non-productive cough by sound analysis
Murata, A
Taniguchi, Y
Hashimoto, Y
Kaneko, Y
Takasaki, Y
Kudoh, S
[J]. INTERNAL MEDICINE, 1998, 37 (09) : 732 - 735

← 1 2 →