Toward Rigorous Data Harmonization in Cancer Epidemiology Research: One Approach

被引:44
|
作者
Rolland, Betsy [1 ,2 ]
Reid, Suzanna [2 ]
Stelling, Deanna [2 ]
Warnick, Greg [2 ]
Thornquist, Mark [2 ]
Feng, Ziding [3 ]
Potter, John D. [2 ,4 ,5 ]
机构
[1] NCI, Canc Prevent Fellowship Program, Bethesda, MD 20892 USA
[2] Fred Hutchinson Canc Res Ctr, Div Publ Hlth Sci, Seattle, WA 98104 USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[4] Massey Univ, Ctr Publ Hlth Res, Wellington, New Zealand
[5] Univ Washington, Sch Publ Hlth, Dept Epidemiol, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
cancer epidemiology; data harmonization; data pooling; ASIA COHORT CONSORTIUM; BODY-MASS INDEX; DATASHAPER APPROACH; POOLED ANALYSIS; RISK; ASSOCIATION; DEATH;
D O I
10.1093/aje/kwv133
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Cancer epidemiologists have a long history of combining data sets in pooled analyses, often harmonizing heterogeneous data from multiple studies into 1 large data set. Although there are useful websites on data harmonization with recommendations and support, there is little research on best practices in data harmonization; each project conducts harmonization according to its own internal standards. The field would be greatly served by charting the process of data harmonization to enhance the quality of the harmonized data. Here, we describe the data harmonization process utilized at the Fred Hutchinson Cancer Research Center (Seattle, Washington) by the coordinating centers of several research projects. We describe a 6-step harmonization process, including: 1) identification of questions the harmonized data set is required to answer; 2) identification of high-level data concepts to answer those questions; 3) assessment of data availability for data concepts; 4) development of common data elements for each data concept; 5) mapping and transformation of individual data points to common data elements; and 6) quality-control procedures. Our aim here is not to claim a "correct" way of doing data harmonization but to encourage others to describe their processes in order that we can begin to create rigorous approaches. We also propose a research agenda around this issue.
引用
收藏
页码:1033 / 1038
页数:6
相关论文
共 50 条
  • [1] Maelstrom Research guidelines for rigorous retrospective data harmonization
    Fortier, Isabel
    Raina, Parminder
    Van den Heuvel, Edwin R.
    Griffith, Lauren E.
    Craig, Camille
    Saliba, Matilda
    Doiron, Dany
    Stolk, Ronald P.
    Knoppers, Bartha M.
    Ferretti, Vincent
    Granda, Peter
    Burton, Paul
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2017, 46 (01) : 103 - 115
  • [2] RESEARCH ON SPOUSAL VIOLENCE: TOWARD A BALANCED AND RIGOROUS APPROACH RESPONSE
    Ismayilova, Leyla
    AMERICAN JOURNAL OF PUBLIC HEALTH, 2016, 106 (05) : E20 - E21
  • [3] HARMONIZATION OF EPIDEMIOLOGIC AND CLINICAL DATA WITHIN THE GENETICS AND EPIDEMIOLOGY OF COLORECTAL CANCER CONSORTIUM
    Stelling, D. L.
    Smith, B. R.
    Warnick, G. S.
    Reid, S. L.
    Chang-Claude, J. C.
    Slattery, M. L.
    Hayes, R. B.
    Hazra, A.
    Ma, J.
    Figueiredo, J. C.
    Hoffmeister, M.
    Brenner, H.
    Bezieau, S.
    Hudson, T. J.
    Gallinger, S.
    Zanke, B. W.
    Goodman, G. E.
    Potter, J. D.
    White, E.
    Casey, G.
    LeMarchand, L.
    Thornquist, M. D.
    Chan, A. T.
    Peters, U.
    Hutter, C. M.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2010, 171 : S65 - S65
  • [4] Data harmonization for COVID-19 and cancer research registries.
    Belenkaya, Rimma
    Watson, Adam
    Bethusamy, Shantha
    Patel, Meera
    Sandler, Tatyana
    Schwartz, Julian
    Park, James
    Dobbins, Maggie
    Maloy, Molly
    Lam, Michael
    Bahadur, Nadia
    Philip, John
    CLINICAL CANCER RESEARCH, 2020, 26 (18)
  • [5] Promoting Reproducibility and Integrity in Observational Research: One Approach of an Epidemiology Research Community
    Stopsack, Konrad H.
    Mucci, Lorelei A.
    Tworoger, Shelley S.
    Kang, Jae H.
    Eliassen, A. Heather
    Willett, Walter C.
    Stampfer, Meir J.
    EPIDEMIOLOGY, 2023, 34 (03) : 389 - 395
  • [6] A relational data harmonization approach to XML
    Niemi, Timo
    Nappila, Turkka
    Jarvelin, Kalervo
    JOURNAL OF INFORMATION SCIENCE, 2009, 35 (05) : 571 - 601
  • [7] A Novel Approach for Clinical Data Harmonization
    Chondrogiannis, Efthymios
    Andronikou, Vassiliki
    Karanastasis, Efstathios
    Varvarigou, Theodora
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 563 - 570
  • [8] Unstructured data research in business: Toward a structured approach
    de Haan, Evert
    Padigar, Manjunath
    El Kihal, Siham
    Kubler, Raoul
    Wieringa, Jaap E.
    JOURNAL OF BUSINESS RESEARCH, 2024, 177
  • [9] Toward rigorous use of expert knowledge in ecological research
    Drescher, M.
    Perera, A. H.
    Johnson, C. J.
    Buse, L. J.
    Drew, C. A.
    Burgman, M. A.
    ECOSPHERE, 2013, 4 (07):
  • [10] Is rigorous retrospective harmonization possible? Application of the DataSHaPER approach across 53 large studies
    Fortier, Isabel
    Doiron, Dany
    Little, Julian
    Ferretti, Vincent
    L'Heureux, Francois
    Stolk, Ronald P.
    Knoppers, Bartha M.
    Hudson, Thomas J.
    Burton, Paul R.
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2011, 40 (05) : 1314 - 1328