Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data

被引：208

作者：

Brown, Chloe ^{[1
]}

Chauhan, Jagmohan ^{[1
]}

Grammenos, Andreas ^{[1
,2
]}

Han, Jing ^{[1
]}

Hasthanasombat, Apinan ^{[1
]}

Spathis, Dimitris ^{[1
]}

Xia, Tong ^{[1
]}

Cicuta, Pietro ^{[1
]}

Mascolo, Cecilia ^{[1
]}

机构：

[1] Univ Cambridge, Cambridge, England

[2] Alan Turing Inst, London, England

来源：

KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2020年

关键词：

COVID-19; Crowdsourcing Platform; Audio Analysis; Coughing; Breathing;

D O I：

10.1145/3394486.3412865

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Audio signals generated by the human body (e.g., sighs, breathing, heart, digestion, vibration sounds) have routinely been used by clinicians as indicators to diagnose disease or assess disease progression. Until recently, such signals were usually collected through manual auscultation at scheduled visits. Research has now started to use digital technology to gather bodily sounds (e.g., from digital stethoscopes) for cardiovascular or respiratory examination, which could then be used for automatic analysis. Some initial work shows promise in detecting diagnostic signals of COVID-19 from voice and coughs. In this paper we describe our data analysis over a large-scale crowdsourced dataset of respiratory sounds collected to aid diagnosis of COVID-19. We use coughs and breathing to understand how discernible COVID-19 sounds are from those in asthma or healthy controls. Our results show that even a simple binary machine learning classifier is able to classify correctly healthy and COVID-19 sounds. We also show how we distinguish a user who tested positive for COVID-19 and has a cough from a healthy user with a cough, and users who tested positive for COVID-19 and have a cough from users with asthma and a cough. Our models achieve an AUC of above 80% across all tasks. These results are preliminary and only scratch the surface of the potential of this type of data and audio-based machine learning. This work opens the door to further investigation of how automatically analysed respiratory patterns could be used as pre-screening signals to aid COVID-19 diagnosis.

引用

页码：3474 / 3484

页数：11

共 30 条

[21]

Menni Cristina, 2020, NAT MED, V10, P1038

[22]

Oletic Dinko, 2016, IEEE SENS J, V16, P8295

[23]

Pramono Renard Xaviero Adhi, 2016, PLOS ONE, V11

[24]

Pramono Renard Xaviero Adhi, 2017, PLOS ONE, V12, DOI [10.1371/journal.pone. 0177926, DOI 10.1371/J0URNAL.P0NE.0177926]

[25]

Quatieri Thomas, 2020, IEEE OPEN J ENG MED, V2020

[26]

Rachuri KK, 2010, UBICOMP 2010: PROCEEDINGS OF THE 2010 ACM CONFERENCE ON UBIQUITOUS COMPUTING, P281

[27] Analyzing the effectiveness of vocal features in early telediagnosis of Parkinson's disease [J].

Sakar, Betul Erdogdu ;

Serbes, Gorkem ;

Sakar, C. Okan .

PLOS ONE, 2017, 12 (08)

[28]

Schluter Jan, 2015, INT SOC MUSIC INFORM, P121

[29]

Sharma Neeraj, 2020, ARXIV200510548EESSAS

[30]

Tobias Marc Weber, 2020, AI MED DIAGNOSTICS C

← 1 2 3 →