Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data

被引:202
作者
Brown, Chloe [1 ]
Chauhan, Jagmohan [1 ]
Grammenos, Andreas [1 ,2 ]
Han, Jing [1 ]
Hasthanasombat, Apinan [1 ]
Spathis, Dimitris [1 ]
Xia, Tong [1 ]
Cicuta, Pietro [1 ]
Mascolo, Cecilia [1 ]
机构
[1] Univ Cambridge, Cambridge, England
[2] Alan Turing Inst, London, England
来源
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2020年
关键词
COVID-19; Crowdsourcing Platform; Audio Analysis; Coughing; Breathing;
D O I
10.1145/3394486.3412865
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Audio signals generated by the human body (e.g., sighs, breathing, heart, digestion, vibration sounds) have routinely been used by clinicians as indicators to diagnose disease or assess disease progression. Until recently, such signals were usually collected through manual auscultation at scheduled visits. Research has now started to use digital technology to gather bodily sounds (e.g., from digital stethoscopes) for cardiovascular or respiratory examination, which could then be used for automatic analysis. Some initial work shows promise in detecting diagnostic signals of COVID-19 from voice and coughs. In this paper we describe our data analysis over a large-scale crowdsourced dataset of respiratory sounds collected to aid diagnosis of COVID-19. We use coughs and breathing to understand how discernible COVID-19 sounds are from those in asthma or healthy controls. Our results show that even a simple binary machine learning classifier is able to classify correctly healthy and COVID-19 sounds. We also show how we distinguish a user who tested positive for COVID-19 and has a cough from a healthy user with a cough, and users who tested positive for COVID-19 and have a cough from users with asthma and a cough. Our models achieve an AUC of above 80% across all tasks. These results are preliminary and only scratch the surface of the potential of this type of data and audio-based machine learning. This work opens the door to further investigation of how automatically analysed respiratory patterns could be used as pre-screening signals to aid COVID-19 diagnosis.
引用
收藏
页码:3474 / 3484
页数:11
相关论文
共 30 条
  • [1] [Anonymous], 2015, ACM MobiSys
  • [2] [Anonymous], 2020, COUGHVID
  • [3] [Anonymous], 2019, LIBR FEAT DELT
  • [4] [Anonymous], 2020, DETECT NOW
  • [5] Bales Charles, 2020, ARXIV200401495EESSAS
  • [6] A deep transfer learning approach for improved post-traumatic stress disorder diagnosis
    Banerjee, Debrup
    Islam, Kazi
    Xue, Keyi
    Mei, Gang
    Xiao, Lemin
    Zhang, Guangfan
    Xu, Roger
    Lei, Cai
    Ji, Shuiwang
    Li, Jiang
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (03) : 1693 - 1724
  • [7] Speech disorders in Parkinson's disease: early diagnostics and effects of medication and brain stimulation
    Brabenec, L.
    Mekyska, J.
    Galaz, Z.
    Rektorova, Irena
    [J]. JOURNAL OF NEURAL TRANSMISSION, 2017, 124 (03) : 303 - 334
  • [8] Cawley GC, 2010, J MACH LEARN RES, V11, P2079
  • [9] Chon Y, 2012, UBICOMP'12: PROCEEDINGS OF THE 2012 ACM INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING, P481
  • [10] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES
    DAVIS, SB
    MERMELSTEIN, P
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04): : 357 - 366