Generating real-world data from health records: design of a patient-centric study in multiple sclerosis using a commercial health records platform

被引:4
作者
Hanson, Gillian [1 ]
Chitnis, Tanuja [2 ,3 ,4 ,5 ]
Williams, Mitzi J. [6 ]
Gan, Ryan William [7 ,9 ]
Julian, Laura [7 ]
Mace, Kieran [1 ]
Chia, Jenny [7 ]
Wormser, David [8 ,10 ]
Martinec, Michael [8 ]
Astorino, Troy [1 ]
Leviner, Noga [1 ]
Maung, Pye [1 ]
Jan, Asif [8 ,11 ]
Belendiuk, Katherine [7 ]
机构
[1] PicnicHealth, San Francisco, CA USA
[2] Harvard Univ, Harvard Med Sch, Boston, MA 02115 USA
[3] Massachusetts Gen Hosp, Mass Gen Brigham Pediat Multiple Sclerosis Ctr, Boston, MA 02114 USA
[4] Brigham & Womens Hosp, Translat Neuroimmunol Res Ctr, 75 Francis St, Boston, MA 02115 USA
[5] Brigham & Womens Hosp, Brigham Multiple Sclerosis Ctr, 75 Francis St, Boston, MA 02115 USA
[6] Joi Life Wellness Grp, MS Neurol Ctr, Smyrna, GA USA
[7] Genentech Inc, 460 Point San Bruno Blvd, San Francisco, CA 94080 USA
[8] F Hoffmann La Roche Ltd, Basel, Switzerland
[9] Johnson & Johnson, Janssen Pharmaceut Co, San Francisco, CA USA
[10] Novartis Int AG, Basel, Switzerland
[11] Owkin Inc, Basel, Switzerland
关键词
real-world data; health records; data abstraction; machine learning; multiple sclerosis; CLINICAL-TRIAL; EXTRACTION;
D O I
10.1093/jamiaopen/ooab110
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective The FlywheelMS study will explore the use of a real-world health record data set generated by PicnicHealth, a patient-centric health records platform, to improve understanding of disease course and patterns of care for patients with multiple sclerosis (MS). Materials and Methods The FlywheelMS study aims to enroll 5000 adults with MS in the United States to create a large, deidentified, longitudinal data set for clinical research. PicnicHealth obtains health records, including paper charts, electronic health records, and radiology imaging files from any healthcare site. Using a large-scale health record processing pipeline, PicnicHealth abstracts standard and condition-specific data elements from structured (eg, laboratory test results) and unstructured (eg, narrative) text and maps these to standardized medical vocabularies. Researchers can use the resulting data set to answer empirical questions and study participants can access and share their harmonized health records using PicnicHealth's web application. Results As of November 24, 2020, more than 4176 participants from 49 of 50 US states have enrolled in the FlywheelMS study. A median of 200 pages of records have been collected from 14 different doctors over 8 years per participant. Abstraction precision, established through inter-abstractor agreement, is as high as 97.8% when identifying and mapping data elements to a standard ontology. Conclusion Using a commercial health records platform, the FlywheelMS study is generating a real-world, multimodal data set that could provide valuable insights about patients with MS. This approach to data collection and abstraction is disease-agnostic and could be used to address other clinical research questions in the future. Lay Summary Health records contain valuable information about patients and the care they receive in routine clinical practice; however, use of this data source in research is hindered by the difficulty of obtaining complete analyzable data sets. In the United States, health records for each patient are stored in paper and electronic formats across multiple healthcare providers. Furthermore, data must be extracted from health records before they can be analyzed, which is technically difficult for images and free text. In the first part of this paper, we describe how PicnicHealth, a commercial health records platform, collects health records on behalf of patients in any format and from all healthcare sites. Facilitated by tailored software tools and task-specific machine-learning models, data are extracted from health records by human experts in an efficient and precise manner. This enables patients to access and manage their health record data via a web application. The second part of this paper describes the design, rationale, and recruitment metrics for the ongoing FlywheelMS study, which is exploring whether an anonymized data set generated by PicnicHealth can improve our understanding of the disease course and patterns of care for patients with multiple sclerosis.
引用
收藏
页数:10
相关论文
共 27 条
[1]  
[Anonymous], 2006, P 23 INT C MACH LEAR, DOI 10.1145/1143844.1143891
[2]   Opportunities and challenges in leveraging electronic health record data in oncology [J].
Berger, Marc L. ;
Curtis, Melissa D. ;
Smith, Gregory ;
Harnett, James ;
Abernethy, Amy P. .
FUTURE ONCOLOGY, 2016, 12 (10) :1261-1274
[3]  
Blacketer C, 2020, COMMON DATA MODEL
[4]   Using Electronic Health Records for Population Health Research: A Review of Methods and Applications [J].
Casey, Joan A. ;
Schwartz, Brian S. ;
Stewart, Walter F. ;
Adler, Nancy E. .
ANNUAL REVIEW OF PUBLIC HEALTH, VOL 37, 2016, 37 :61-81
[5]   Electronic health records to facilitate clinical research [J].
Cowie, Martin R. ;
Blomster, Juuso I. ;
Curtis, Lesley H. ;
Duclaux, Sylvie ;
Ford, Ian ;
Fritz, Fleur ;
Goldman, Samantha ;
Janmohamed, Salim ;
Kreuzer, Joerg ;
Leenay, Mark ;
Michel, Alexander ;
Ong, Seleen ;
Pell, Jill P. ;
Southworth, Mary Ross ;
Stough, Wendy Gattis ;
Thoenes, Martin ;
Zannad, Faiez ;
Zalewski, Andrew .
CLINICAL RESEARCH IN CARDIOLOGY, 2017, 106 (01) :1-9
[6]  
Daniel G., 2018, Characterizing RWD quality and relevancy for regulatory purposes
[7]   Electronic Health Records vs Medicaid Claims: Completeness of Diabetes Preventive Care Data in Community Health Centers [J].
DeVoe, Jennifer E. ;
Gold, Rachel ;
McIntire, Patti ;
Puro, Jon ;
Chauvie, Susan ;
Gallia, Charles A. .
ANNALS OF FAMILY MEDICINE, 2011, 9 (04) :351-358
[8]   Radiomics: Images Are More than Pictures, They Are Data [J].
Gillies, Robert J. ;
Kinahan, Paul E. ;
Hricak, Hedvig .
RADIOLOGY, 2016, 278 (02) :563-577
[9]   Characteristics, burden of illness, and physical functioning of patients with relapsing-remitting and secondary progressive multiple sclerosis: a cross-sectional US survey [J].
Gross, Hillary J. ;
Watson, Crystal .
NEUROPSYCHIATRIC DISEASE AND TREATMENT, 2017, 13 :1349-1357
[10]  
Guzman B., 2020, ARXIV200200481