Toward Rigorous Data Harmonization in Cancer Epidemiology Research: One Approach

被引:44
|
作者
Rolland, Betsy [1 ,2 ]
Reid, Suzanna [2 ]
Stelling, Deanna [2 ]
Warnick, Greg [2 ]
Thornquist, Mark [2 ]
Feng, Ziding [3 ]
Potter, John D. [2 ,4 ,5 ]
机构
[1] NCI, Canc Prevent Fellowship Program, Bethesda, MD 20892 USA
[2] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98104 USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[4] Massey Univ, Ctr Publ Hlth Res, Wellington, New Zealand
[5] Univ Washington, Sch Publ Hlth, Dept Epidemiol, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
cancer epidemiology; data harmonization; data pooling; ASIA COHORT CONSORTIUM; BODY-MASS INDEX; DATASHAPER APPROACH; POOLED ANALYSIS; RISK; ASSOCIATION; DEATH;
D O I
10.1093/aje/kwv133
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Cancer epidemiologists have a long history of combining data sets in pooled analyses, often harmonizing heterogeneous data from multiple studies into 1 large data set. Although there are useful websites on data harmonization with recommendations and support, there is little research on best practices in data harmonization; each project conducts harmonization according to its own internal standards. The field would be greatly served by charting the process of data harmonization to enhance the quality of the harmonized data. Here, we describe the data harmonization process utilized at the Fred Hutchinson Cancer Research Center (Seattle, Washington) by the coordinating centers of several research projects. We describe a 6-step harmonization process, including: 1) identification of questions the harmonized data set is required to answer; 2) identification of high-level data concepts to answer those questions; 3) assessment of data availability for data concepts; 4) development of common data elements for each data concept; 5) mapping and transformation of individual data points to common data elements; and 6) quality-control procedures. Our aim here is not to claim a "correct" way of doing data harmonization but to encourage others to describe their processes in order that we can begin to create rigorous approaches. We also propose a research agenda around this issue.
引用
收藏
页码:1033 / 1038
页数:6
相关论文
共 50 条
  • [21] Toward International Harmonization of Breast Implant Registries: International Collaboration of Breast Registry Activities Global Common Data Set
    Spronk, Pauline E. R.
    Begum, Husna
    Vishwanath, Swarna
    Crosbie, Andy
    Earnest, Arul
    Elder, Elisabeth
    Lumenta, David B.
    Marinac-Dabic, Danica
    Moore, Colin C. M.
    Mureau, Marc A. M.
    Perks, Graeme
    Pusic, Andrea L.
    Stark, Birgit
    von Fritschen, Uwe
    Klein, Howard
    Cooter, Rodney D.
    Rakhorst, Hinne A.
    Hopper, Ingrid
    PLASTIC AND RECONSTRUCTIVE SURGERY, 2020, 146 (02) : 255 - 267
  • [22] A Road Map Toward a Globally Harmonized Approach for Occupational Health Surveillance and Epidemiology in Nanomaterial Workers
    Riediker, Michael
    Schubauer-Berigan, Mary K.
    Brouwer, Derk H.
    Nelissen, Inge
    Koppen, Gudrun
    Frijns, Evelien
    Clark, Katherine A.
    Hoeck, Juergen
    Liou, Saou-Hsing
    Ho, Sweet Far
    Bergamaschi, Enrico
    Gibson, Rosemary
    JOURNAL OF OCCUPATIONAL AND ENVIRONMENTAL MEDICINE, 2012, 54 (10) : 1214 - 1223
  • [23] Analysing bivariate survival data with interval sampling and application to cancer epidemiology
    Zhu, Hong
    Wang, Mei-Cheng
    BIOMETRIKA, 2012, 99 (02) : 345 - 361
  • [24] Association of Metabolic Health and Central Obesity with the Risk of Thyroid Cancer: Data from the Korean Genome and Epidemiology Study
    Nguyen, Dung N.
    Kim, Jin Hee
    Kim, Mi Kyung
    CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2022, 31 (03) : 543 - 553
  • [25] Enhancing the Infrastructure of the Atherosclerosis Risk in Communities (ARIC) Study for Cancer Epidemiology Research: ARIC Cancer
    Joshu, Corinne E.
    Barber, John R.
    Coresh, Josef
    Couper, David J.
    Mosley, Thomas H.
    Vitolins, Mara Z.
    Butler, Kenneth R.
    Nelson, Heather H.
    Prizment, Anna E.
    Selvin, Elizabeth
    Tooze, Janet A.
    Visvanathan, Kala
    Folsom, Aaron R.
    Platz, Elizabeth A.
    CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2018, 27 (03) : 295 - 305
  • [26] An ontology-based approach for harmonization and cross-cohort query of Alzheimer's disease data resources
    Hao, Xubing
    Li, Xiaojin
    Zhang, Guo-Qiang
    Tao, Cui
    Schulz, Paul E.
    Cui, Licong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (SUPPL 1)
  • [27] Colorectal Cancer Under Age 50: Recent Research about Epidemiology and Mechanism
    Lee, Jung Won
    KOREAN JOURNAL OF GASTROENTEROLOGY, 2020, 76 (06): : 340 - 342
  • [28] Association between Family Histories of Thyroid Cancer and Thyroid Cancer Incidence: A Cross-Sectional Study Using the Korean Genome and Epidemiology Study Data
    Byun, Soo-Hwan
    Min, Chanyang
    Choi, Hyo-Geun
    Hong, Seok-Jin
    GENES, 2020, 11 (09) : 1 - 13
  • [29] Coffee consumption and risk of endometrial cancer: a pooled analysis of individual participant data in the Epidemiology of Endometrial Cancer Consortium (E2C2)
    Crous-Bou, Marta
    Du, Mengmeng
    Gunter, Marc J.
    Setiawan, Veronica W.
    Schouten, Leo J.
    Shu, Xiao-ou
    Wentzensen, Nicolas
    Bertrand, Kimberly A.
    Cook, Linda S.
    Friedenreich, Christine M.
    Gapstur, Susan M.
    Goodman, Marc T.
    Ibiebele, Torukiri I.
    La Vecchia, Carlo
    Levi, Fabio
    Liao, Linda M.
    Negri, Eva
    McCann, Susan E.
    O'Connell, Kelly
    Palmer, Julie R.
    Patel, Alpa, V
    Ponte, Jeanette
    Reynolds, Peggy
    Sacerdote, Carlotta
    Sinha, Rashmi
    Spurdle, Amanda B.
    Trabert, Britton
    van den Brandt, Piet A.
    Webb, Penelope M.
    Petruzella, Stacey
    Olson, Sara H.
    De Vivo, Immaculata
    AMERICAN JOURNAL OF CLINICAL NUTRITION, 2022, 116 (05): : 1219 - 1228
  • [30] Data Harmonization, Standardization, and Collaboration for Diabetic Retinal Disease (DRD) Research: Report From the 2024 Mary Tyler Moore Vision Initiative Workshop on Data
    Domalpally, Amitha
    Fickweiler, Ward
    Levine, S. Robert
    Goetz, Kerry E.
    Vanderbeek, Brian L.
    Lee, Aaron
    Sundstrom, Jeffrey M.
    Markel, Dorene
    Sun, Jennifer K.
    TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2024, 13 (10):