Early Time Series Classification Using Reinforcement Learning for Pre-Symptomatic Covid-19 Screening From Imbalanced Health Tracker Data

被引:0
作者
Sarwar, Atifa [1 ]
Almadani, Abdulsalam [1 ]
Agu, Emmanuel O. [1 ]
机构
[1] Worcester Polytech Inst, Worcester, MA 01609 USA
关键词
COVID-19; Accuracy; Feature extraction; Heart rate; Time series analysis; Physiology; Infectious diseases; Testing; Reinforcement learning; Pandemics; Infectious Diseases; Covid-19; Passive Screening; Physiological signs; Early Time Series Classification; Class Imbalance; Reinforcement Learning; Health Tracker;
D O I
10.1109/JBHI.2024.3509630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Early detection of infectious diseases such as Covid-19 can limit transmission and curb pandemics. This study proposes EarlyDetect, an end-to-end framework for early Covid-19 detection using heart rate and step data collected passively from consumer-grade health trackers. A key challenge in early Covid-19 detection is determining the optimal amount of historical data (e.g., past days) a machine learning model should analyze to achieve the earliest possible, yet accurate, infection detection. Leveraging Reinforcement Learning-based Early Time Series Classification, EarlyDetect extracts 45 digital biomarkers (daily steps, daytime/nighttime HR, mesor, sedentary time), and feeds them into a deep Multi-layer Perceptron neural network model trained using Double Deep Q-Network. At each iteration, EarlyDetect dynamically decides whether to wait for more data or proceed with classifying the window of data observed so far. A novel reward function ensures early yet accurate classification in imbalanced class distributions. Using heart rate and steps values over 72 hours lookback window, EarlyDetect achieves an accuracy of 0.8 (95% CI 0.71-0.89), AUC-ROC of 0.73 (95% CI: 0.6-0.86), an earliness of 0.07 (95% CI: 0.05-0.09), thus requiring up to 86% less data than existing methods while predicting Covid-19 status 50% earlier (smaller detection window). Performance on two Covid-19 datasets was encouraging, identifying 61% and 46% of Covid+ cases before the coronavirus reached peak transmissibility. EarlyDetect is a significant advancement in early infectious disease screening, and is the first method to dynamically determine an optimal lookback window size for Covid-19 detection from physiological signs on imbalanced datasets using Reinforcement Learning-based Early Time Series Classification.
引用
收藏
页码:2246 / 2256
页数:11
相关论文
共 43 条
  • [1] PCovNet plus : A CNN-VAE anomaly detection framework with LSTM embeddings for smartwatch-based COVID-19 detection
    Abir, Farhan Fuad
    Chowdhury, Muhammad E. H.
    Tapotee, Malisha Islam
    Mushtak, Adam
    Khandakar, Amith
    Mahmud, Sakib
    Hasan, Md Anwarul
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [2] PCovNet: A presymptomatic COVID-19 detection framework using deep learning model using wearables data
    Abir, Farhan Fuad
    Alyafei, Khalid
    Chowdhury, Muhammad E. H.
    Khandakar, Amith
    Ahmed, Rashid
    Hossain, Muhammad Maqsud
    Mahmud, Sakib
    Rahman, Ashiqur
    Abbas, Tareq O.
    Zughaier, Susu M.
    Naji, Khalid Kamal
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 147
  • [3] Real-time alerting system for COVID-19 and other stress events using wearable data
    Alavi, Arash
    Bogu, Gireesh K.
    Wang, Meng
    Rangan, Ekanath Srihari
    Brooks, Andrew W.
    Wang, Qiwen
    Higgs, Emily
    Celli, Alessandra
    Mishra, Tejaswini
    Metwally, Ahmed A.
    Cha, Kexin
    Knowles, Peter
    Alavi, Amir A.
    Bhasin, Rajat
    Panchamukhi, Shrinivas
    Celis, Diego
    Aditya, Tagore
    Honkala, Alexander
    Rolnik, Benjamin
    Hunting, Erika
    Dagan-Rosenfeld, Orit
    Chauhanl, Arshdeep
    Li, Jessi W.
    Bejikian, Caroline
    Krishnan, Vandhana
    McGuire, Lettie
    Li, Xiao
    Bahmani, Amir
    Snyder, Michael P.
    [J]. NATURE MEDICINE, 2022, 28 (01) : 175 - +
  • [4] [Anonymous], 2024, Target heart rates chart
  • [5] [Anonymous], 2023, Worldometer: US Covid coronavirus statistics
  • [6] CALIMERA: A new early time series classification method
    Bilski, Jakub Michal
    Jastrzebska, Agnieszka
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (05)
  • [7] Bogu G.K., 2021, medRxiv
  • [8] Bridle J.S., 1989, ADV NEURAL INFORMATI, P211, DOI DOI 10.5555/2969830
  • [9] Early Time Series Anomaly Prediction With Multi-Objective Optimization
    Chao, Ting-En
    Huang, Yu
    Dai, Hao
    Yen, Gary G.
    Tseng, Vincent S.
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 972 - 987
  • [10] Development and validation of a deep learning model to diagnose COVID-19 using time-series heart rate values before the onset of symptoms
    Chung, Heewon
    Ko, Hoon
    Lee, Hooseok
    Yon, Dong Keon
    Lee, Won Hee
    Kim, Tae-Seong
    Kim, Kyung Won
    Lee, Jinseok
    [J]. JOURNAL OF MEDICAL VIROLOGY, 2023, 95 (02)