Toward Rigorous Data Harmonization in Cancer Epidemiology Research: One Approach

被引:44
作者
Rolland, Betsy [1 ,2 ]
Reid, Suzanna [2 ]
Stelling, Deanna [2 ]
Warnick, Greg [2 ]
Thornquist, Mark [2 ]
Feng, Ziding [3 ]
Potter, John D. [2 ,4 ,5 ]
机构
[1] NCI, Canc Prevent Fellowship Program, Bethesda, MD 20892 USA
[2] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98104 USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[4] Massey Univ, Ctr Publ Hlth Res, Wellington, New Zealand
[5] Univ Washington, Sch Publ Hlth, Dept Epidemiol, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
cancer epidemiology; data harmonization; data pooling; ASIA COHORT CONSORTIUM; BODY-MASS INDEX; DATASHAPER APPROACH; POOLED ANALYSIS; RISK; ASSOCIATION; DEATH;
D O I
10.1093/aje/kwv133
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Cancer epidemiologists have a long history of combining data sets in pooled analyses, often harmonizing heterogeneous data from multiple studies into 1 large data set. Although there are useful websites on data harmonization with recommendations and support, there is little research on best practices in data harmonization; each project conducts harmonization according to its own internal standards. The field would be greatly served by charting the process of data harmonization to enhance the quality of the harmonized data. Here, we describe the data harmonization process utilized at the Fred Hutchinson Cancer Research Center (Seattle, Washington) by the coordinating centers of several research projects. We describe a 6-step harmonization process, including: 1) identification of questions the harmonized data set is required to answer; 2) identification of high-level data concepts to answer those questions; 3) assessment of data availability for data concepts; 4) development of common data elements for each data concept; 5) mapping and transformation of individual data points to common data elements; and 6) quality-control procedures. Our aim here is not to claim a "correct" way of doing data harmonization but to encourage others to describe their processes in order that we can begin to create rigorous approaches. We also propose a research agenda around this issue.
引用
收藏
页码:1033 / 1038
页数:6
相关论文
共 50 条
  • [41] The Cancer Epidemiology Descriptive Cohort Database: A Tool to Support Population-Based Interdisciplinary Research
    Kennedy, Amy E.
    Khoury, Muin J.
    Ioannidis, John P. A.
    Brotzman, Michelle
    Miller, Amy
    Lane, Crystal
    Lai, Gabriel Y.
    Rogers, Scott D.
    Harvey, Chinonye
    Elena, Joanne W.
    Seminara, Daniela
    CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2016, 25 (10) : 1392 - 1401
  • [42] A comprehensive approach of the gender bias in occupational cancer epidemiology: A systematic review of lung cancer studies (2003-2014)
    Betansedi, Charles-Olivier
    Vasquez, Patricia Vaca
    Counil, Emilie
    AMERICAN JOURNAL OF INDUSTRIAL MEDICINE, 2018, 61 (05) : 372 - 382
  • [43] Survival of epithelial ovarian cancer in Black women: a society to cell approach in the African American cancer epidemiology study (AACES)
    Schildkraut, Joellen M. M.
    Johnson, Courtney
    Dempsey, Lauren F. F.
    Qin, Bo
    Terry, Paul
    Akonde, Maxwell
    Peters, Edward S. S.
    Mandle, Hannah
    Cote, Michele L. L.
    Peres, Lauren
    Moorman, Patricia
    Schwartz, Ann G. G.
    Epstein, Michael
    Marks, Jeffrey
    Bondy, Melissa
    Lawson, Andrew B. B.
    Alberg, Anthony J. J.
    Bandera, Elisa V. V.
    CANCER CAUSES & CONTROL, 2023, 34 (03) : 251 - 265
  • [44] Applying an Exposome-Wide (ExWAS) Approach to Cancer Research
    Juarez, Paul D.
    Matthews-Juarez, Patricia
    FRONTIERS IN ONCOLOGY, 2018, 8
  • [45] piRNAs in Gastric Cancer: A New Approach Towards Translational Research
    Cabral, Gleyce Fonseca
    dos Santos Pinheiro, Jhully Azevedo
    Vidal, Amanda Ferreira
    Santos, Sidney
    Ribeiro-dos-Santos, Andrea
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2020, 21 (06)
  • [46] Toward Data Sense-Making in Digital Health Communication Research: Why Theory Matters in the Age of Big Data
    Lee, Edmund W. J.
    Yee, Andrew Z. H.
    FRONTIERS IN COMMUNICATION, 2020, 5
  • [47] Approaches to integrating germline and tumor genomic data in cancer research
    Feigelson, Heather Spencer
    Goddard, Katrina A. B.
    Hollombe, Celine
    Tingle, Sharna R.
    Gillanders, Elizabeth M.
    Mechanic, Leah E.
    Nelson, Stefanie A.
    CARCINOGENESIS, 2014, 35 (10) : 2157 - 2163
  • [48] Time trends in the epidemiology of food allergy in England: an observational analysis of Clinical Practice Research Datalink data
    Turner, Paul J.
    Conrado, Alessia Baseggio
    Kallis, Constantinos
    O'Rourke, Eimear
    Haider, Sadia
    Ullah, Anhar
    Custovic, Darije
    Custovic, Adnan
    Quint, Jennifer K.
    LANCET PUBLIC HEALTH, 2024, 9 (09) : e664 - e673
  • [49] Suicide in breast cancer patients: An individual-centered approach provides insight beyond epidemiology
    Gueth, Uwe
    Myrick, Mary Elizabeth
    Reisch, Thomas
    Bosshard, Georg
    Schmid, Seraina Margaretha
    ACTA ONCOLOGICA, 2011, 50 (07) : 1037 - 1044
  • [50] Epidemiology of 40 blood biomarkers of one-carbon metabolism, vitamin status, inflammation, and renal and endothelial function among cancer-free older adults
    Zahed, Hana
    Johansson, Mattias
    Ueland, Per M.
    Midttun, Oivind
    Milne, Roger L.
    Giles, Graham G.
    Manjer, Jonas
    Sandsveden, Malte
    Langhammer, Arnulf
    Sorgjerd, Elin Pettersen
    Grankvist, Kjell
    Johansson, Mikael
    Freedman, Neal D.
    Huang, Wen-Yi
    Chen, Chu
    Prentice, Ross
    Stevens, Victoria L.
    Wang, Ying
    Le Marchand, Loic
    Wilkens, Lynne R.
    Weinstein, Stephanie J.
    Albanes, Demetrius
    Cai, Qiuyin
    Blot, William J.
    Arslan, Alan A.
    Zeleniuch-Jacquotte, Anne
    Shu, Xiao-Ou
    Zheng, Wei
    Yuan, Jian-Min
    Koh, Woon-Puay
    Visvanathan, Kala
    Sesso, Howard D.
    Zhang, Xuehong
    Gaziano, J. Michael
    Fanidi, Anouar
    Muller, David
    Brennan, Paul
    Guida, Florence
    Robbins, Hilary A.
    SCIENTIFIC REPORTS, 2021, 11 (01)