Factors Affecting the Quality of Person-Generated Wearable Device Data and Associated Challenges: Rapid Systematic Review

被引:52
作者
Cho, Sylvia [1 ]
Ensari, Ipek [2 ]
Weng, Chunhua [1 ]
Kahn, Michael G. [3 ]
Natarajan, Karthik [1 ,2 ]
机构
[1] Columbia Univ, Dept Biomed Informat, 622 West 168th St PH20, New York, NY 10032 USA
[2] Columbia Univ, Data Sci Inst, New York, NY USA
[3] Univ Colorado, Dept Pediat, Anschutz Med Campus, Denver, CO 80202 USA
来源
JMIR MHEALTH AND UHEALTH | 2021年 / 9卷 / 03期
关键词
patient generated health data; data accuracy; data quality; wearable device; fitness trackers; mobile phone; PHYSICAL-ACTIVITY; HEALTH DATA; TRACKERS; INTERNET; DESIGN; THINGS;
D O I
10.2196/20738
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: There is increasing interest in reusing person-generated wearable device data for research purposes, which raises concerns about data quality. However, the amount of literature on data quality challenges, specifically those for person-generated wearable device data, is sparse. Objective: This study aims to systematically review the literature on factors affecting the quality of person-generated wearable device data and their associated intrinsic data quality challenges for research. Methods: The literature was searched in the PubMed, Association for Computing Machinery, Institute of Electrical and Electronics Engineers, and Google Scholar databases by using search terms related to wearable devices and data quality. By using PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, studies were reviewed to identify factors affecting the quality of wearable device data. Studies were eligible if they included content on the data quality of wearable devices, such as fitness trackers and sleep monitors. Both research-grade and consumer-grade wearable devices were included in the review. Relevant content was annotated and iteratively categorized into semantically similar factors until a consensus was reached. If any data quality challenges were mentioned in the study, those contents were extracted and categorized as well. Results: A total of 19 papers were included in this review. We identified three high-level factors that affect data quality-device-and technical-related factors, user-related factors, and data governance-related factors. Device- and technical-related factors include problems with hardware, software, and the connectivity of the device; user-related factors include device nonwear and user error; and data governance-related factors include a lack of standardization. The identified factors can potentially lead to intrinsic data quality challenges, such as incomplete, incorrect, and heterogeneous data. Although missing and incorrect data are widely known data quality challenges for wearable devices, the heterogeneity of data is another aspect of data quality that should be considered for wearable devices. Heterogeneity in wearable device data exists at three levels: heterogeneity in data generated by a single person using a single device (within-person heterogeneity); heterogeneity in data generated by multiple people who use the same brand, model, and version of a device (between-person heterogeneity); and heterogeneity in data generated from multiple people using different devices (between-person heterogeneity), which would apply especially to data collected under a bring-your-own-device policy. Conclusions: Our study identifies potential intrinsic data quality challenges that could occur when analyzing wearable device data for research and three major contributing factors for these challenges. As poor data quality can compromise the reliability and accuracy of research results, further investigation is needed on how to address the data quality challenges of wearable devices.
引用
收藏
页数:12
相关论文
共 54 条
[21]   Utility of Consumer Physical Activity Trackers as an Intervention Tool in Cardiovascular Disease Prevention and Treatment [J].
Hickey, Amanda M. ;
Freedson, Patty S. .
PROGRESS IN CARDIOVASCULAR DISEASES, 2016, 58 (06) :613-619
[22]   Best practices for analyzing large-scale health data from wearables and smartphone apps [J].
Hicks, Jennifer L. ;
Althoff, Tim ;
Sosic, Rok ;
Kuhar, Peter ;
Bostjancic, Bojan ;
King, Abby C. ;
Leskovec, Jure ;
Delp, Scott L. .
NPJ DIGITAL MEDICINE, 2019, 2 (1)
[23]   Wearable Devices in Clinical Trials: Hype and Hypothesis [J].
Izmailova, Elena S. ;
Wagner, John A. ;
Perakslis, Eric D. .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2018, 104 (01) :42-52
[24]  
Julicher T., 2018, Big Data in Context, P81
[25]  
Kahn Michael G, 2016, EGEMS (Wash DC), V4, P1244
[26]   Data quality in internet of things: A state-of-the-art survey [J].
Karkouch, Aimad ;
Mousannif, Hajar ;
Al Moatassime, Hassan ;
Noel, Thomas .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 73 :57-81
[27]  
Khare Ritu, 2018, AMIA Jt Summits Transl Sci Proc, V2017, P113
[28]   Real World Home Blood Pressure Variability in Over 56,000 Individuals With Nearly 17 Million Measurements [J].
Kim, Kwang-il ;
Nikzad, Nima ;
Quer, Giorgio ;
Wineinger, Nathan E. ;
Vegreville, Matthieu ;
Normand, Alexis ;
Schmidt, Nicolas ;
Topol, Eric J. ;
Steinhubl, Steven .
AMERICAN JOURNAL OF HYPERTENSION, 2018, 31 (05) :566-573
[29]  
Lai A M, 2017, Yearb Med Inform, V26, P152, DOI 10.15265/IY-2017-016
[30]  
Liang Zilu., 2017, Proceedings of the 11th EAI International Conference on Pervasive Computing Technologies for Healthcare, P476, DOI [DOI 10.1145/3154862, 10.1145/3154862]