Transfer learning for the efficient detection of COVID-19 from smartphone audio data

被引:6
作者
Campana, Mattia Giovanni [1 ]
Delmastro, Franca [1 ]
Pagani, Elena [2 ]
机构
[1] Inst Informat & Telemat Natl Res Council Italy IIT, Pisa, Italy
[2] Univ Milan, Comp Sci Dept, Milan, Italy
关键词
Deep audio embeddings; Transfer learning; Deep learning; m-health; COVID-19;
D O I
10.1016/j.pmcj.2023.101754
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Disease detection from smartphone data represents an open research challenge in mobile health (m-health) systems. COVID-19 and its respiratory symptoms are an important case study in this area and their early detection is a potential real instrument to counteract the pandemic situation. The efficacy of this solution mainly depends on the performances of AI algorithms applied to the collected data and their possible implementation directly on the users' mobile devices. Considering these issues, and the limited amount of available data, in this paper we present the experimental evaluation of 3 different deep learning models, compared also with hand-crafted features, and of two main approaches of transfer learning in the considered scenario: both feature extraction and fine-tuning. Specifically, we considered VGGish, YAMNET, and L3-Net (including 12 different configurations) evaluated through user-independent experiments on 4 different datasets (13,447 samples in total). Results clearly show the advantages of L3-Net in all the experimental settings as it overcomes the other solutions by 12.3% in terms of Precision-Recall AUC as features extractor, and by 10% when the model is fine-tuned. Moreover, we note that to fine-tune only the fully-connected layers of the pre-trained models generally leads to worse performances, with an average drop of 6.6% with respect to feature extraction. Finally, we evaluate the memory footprints of the different models for their possible applications on commercial mobile devices.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 49 条
  • [1] Abu-El-Haija S., 2016, PREPRINT, DOI DOI 10.48550/ARXIV.1609.08675
  • [2] Look, Listen and Learn
    Arandjelovic, Relja
    Zisserman, Andrew
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 609 - 617
  • [3] Bengio Y., 2012, P ICML WORKSH UNS TR, V7, P19
  • [4] A Survey of Predictive Modeling on Im balanced Domains
    Branco, Paula
    Torgo, Luis
    Ribeiro, Rita P.
    [J]. ACM COMPUTING SURVEYS, 2016, 49 (02)
  • [5] Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data
    Brown, Chloe
    Chauhan, Jagmohan
    Grammenos, Andreas
    Han, Jing
    Hasthanasombat, Apinan
    Spathis, Dimitris
    Xia, Tong
    Cicuta, Pietro
    Mascolo, Cecilia
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3474 - 3484
  • [6] L3-Net Deep Audio Embeddings to Improve COVID-19 Detection from Smartphone Data
    Campana, Mattia Giovanni
    Rovati, Andrea
    Delmastro, Franca
    Pagani, Elena
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2022), 2022, : 100 - 107
  • [7] A Survey on Applications of Artificial Intelligence in Fighting Against COVID-19
    Chen, Jianguo
    Li, Kenli
    Zhang, Zhaolei
    Li, Keqin
    Yu, Philip S.
    [J]. ACM COMPUTING SURVEYS, 2021, 54 (08)
  • [8] Chen Z., 2022, P IEEE BIOM CIRC SYS, DOI [10.1109/biocas54905.2022.9948614, DOI 10.1109/BIOCAS54905.2022.9948614, 10.1109/BioCAS54905.2022.9948614]
  • [9] Cramer J, 2019, INT CONF ACOUST SPEE, P3852, DOI 10.1109/ICASSP.2019.8682475
  • [10] Convolutional neural networks based efficient approach for classification of lung diseases
    Demir, Fatih
    Sengur, Abdulkadir
    Bajaj, Varun
    [J]. HEALTH INFORMATION SCIENCE AND SYSTEMS, 2019, 8 (01)