Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data

被引：208

作者：

Brown, Chloe ^{[1
]}

Chauhan, Jagmohan ^{[1
]}

Grammenos, Andreas ^{[1
,2
]}

Han, Jing ^{[1
]}

Hasthanasombat, Apinan ^{[1
]}

Spathis, Dimitris ^{[1
]}

Xia, Tong ^{[1
]}

Cicuta, Pietro ^{[1
]}

Mascolo, Cecilia ^{[1
]}

机构：

[1] Univ Cambridge, Cambridge, England

[2] Alan Turing Inst, London, England

来源：

KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2020年

关键词：

COVID-19; Crowdsourcing Platform; Audio Analysis; Coughing; Breathing;

D O I：

10.1145/3394486.3412865

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Audio signals generated by the human body (e.g., sighs, breathing, heart, digestion, vibration sounds) have routinely been used by clinicians as indicators to diagnose disease or assess disease progression. Until recently, such signals were usually collected through manual auscultation at scheduled visits. Research has now started to use digital technology to gather bodily sounds (e.g., from digital stethoscopes) for cardiovascular or respiratory examination, which could then be used for automatic analysis. Some initial work shows promise in detecting diagnostic signals of COVID-19 from voice and coughs. In this paper we describe our data analysis over a large-scale crowdsourced dataset of respiratory sounds collected to aid diagnosis of COVID-19. We use coughs and breathing to understand how discernible COVID-19 sounds are from those in asthma or healthy controls. Our results show that even a simple binary machine learning classifier is able to classify correctly healthy and COVID-19 sounds. We also show how we distinguish a user who tested positive for COVID-19 and has a cough from a healthy user with a cough, and users who tested positive for COVID-19 and have a cough from users with asthma and a cough. Our models achieve an AUC of above 80% across all tasks. These results are preliminary and only scratch the surface of the potential of this type of data and audio-based machine learning. This work opens the door to further investigation of how automatically analysed respiratory patterns could be used as pre-screening signals to aid COVID-19 diagnosis.

引用

页码：3474 / 3484

页数：11

共 30 条

[1]

[Anonymous], 2020, COUGHVID

[2]

[Anonymous], 2019, LIBR FEAT DELT

[3]

[Anonymous], 2020, DETECT NOW

[4]

Bales Charles, 2020, ARXIV200401495EESSAS

[5] A deep transfer learning approach for improved post-traumatic stress disorder diagnosis [J].

Banerjee, Debrup ;

Islam, Kazi ;

Xue, Keyi ;

Mei, Gang ;

Xiao, Lemin ;

Zhang, Guangfan ;

Xu, Roger ;

Lei, Cai ;

Ji, Shuiwang ;

Li, Jiang .

KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (03) :1693-1724

[6] Speech disorders in Parkinson's disease: early diagnostics and effects of medication and brain stimulation [J].

Brabenec, L. ;

Mekyska, J. ;

Galaz, Z. ;

Rektorova, Irena .

JOURNAL OF NEURAL TRANSMISSION, 2017, 124 (03) :303-334

[7]

Cawley GC, 2010, J MACH LEARN RES, V11, P2079

[8]

Chon Y, 2012, UBICOMP'12: PROCEEDINGS OF THE 2012 ACM INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING, P481

[9] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].

DAVIS, SB ;

MERMELSTEIN, P .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366

[10]

Deshpande G, 2020, ARXIV200508579

← 1 2 3 →