Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: review and recommendations

被引:567
作者
Tennant, Peter W. G. [1 ,2 ,3 ]
Murray, Eleanor J. [4 ]
Arnold, Kellyn F. [1 ,2 ]
Berrie, Laurie [1 ,5 ,6 ]
Fox, Matthew P. [4 ,7 ]
Gadd, Sarah C. [1 ,5 ]
Harrison, Wendy J. [1 ,2 ]
Keeble, Claire [1 ]
Ranker, Lynsie R. [4 ]
Textor, Johannes [8 ]
Tomova, Georgia D. [1 ,2 ,3 ]
Gilthorpe, Mark S. [1 ,2 ,3 ]
Ellison, George T. H. [1 ,2 ,9 ]
机构
[1] Univ Leeds, Leeds Inst Data Analyt, Level 11 Worsley Bldg,Clarendon Way, Leeds LS2 9NL, W Yorkshire, England
[2] Univ Leeds, Fac Med & Hlth, Leeds, W Yorkshire, England
[3] British Lib, Alan Turing Inst, London, England
[4] Boston Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA USA
[5] Univ Leeds, Sch Geog, Leeds, W Yorkshire, England
[6] Univ Edinburgh, Sch GeoSci, Edinburgh, Midlothian, Scotland
[7] Boston Univ, Dept Global Hlth, Boston, MA 02215 USA
[8] Radboud Univ Nijmegen, Med Ctr, Dept Tumour Immunol, Nijmegen, Netherlands
[9] Univ Cent Lancashire, Fac Sci & Technol, Ctr Data Innovat, Preston, Lancs, England
基金
英国医学研究理事会; 英国经济与社会研究理事会;
关键词
Directed acyclic graphs; graphical model theory; causal diagrams; causal inference; observational studies; confounding; covariate adjustment; reporting practices; CAUSAL INFERENCE; RISK;
D O I
10.1093/ije/dyaa213
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Directed acyclic graphs (DAGs) are an increasingly popular approach for identifying confounding variables that require conditioning when estimating causal effects. This review examined the use of DAGs in applied health research to inform recommendations for improving their transparency and utility in future research. Methods: Original health research articles published during 1999-2017 mentioning 'directed acyclic graphs' (or similar) or citing DAGitty were identified from Scopus, Web of Science, Medline and Embase. Data were extracted on the reporting of: estimands, DAGs and adjustment sets, alongside the characteristics of each article's largest DAG. Results: A total of 234 articles were identified that reported using DAGs. A fifth (n= 48, 21%) reported their target estimand(s) and half (n =115, 48%) reported the adjustment set(s) implied by their DAG(s). Two-thirds of the articles (n = 144,62%) made at least one DAG available. DAGs varied in size but averaged 12 nodes [interquartile range (IQR): 9-16, range: 3-28] and 29 arcs (IQR: 19-42, range: 3-99). The median saturation (i.e. percentage of total possible arcs) was 46% (IQR: 31-67, range: 12-100). 37% (n =53) of the DAGs included unobserved variables, 17% (n = 25) included 'super-nodes' (i.e. nodes containing more than one variable) and 34% (n = 49) were visually arranged so that the constituent arcs flowed in the same direction (e.g. top-to-bottom). Conclusion: There is substantial variation in the use and reporting of DAGs in applied health research. Although this partly reflects their flexibility, it also highlights some potential areas for improvement. This review hence offers several recommendations to improve the reporting and use of DAGs in future research.
引用
收藏
页码:620 / 632
页数:13
相关论文
共 35 条
[1]  
[Anonymous], 2014, Handbook of Epidemiology
[2]  
[Anonymous], 2013, DEV PROTOCOL OBSERVA
[3]   Reflection on modern methods: generalized linear models for prognosis and intervention-theory, practice and implications for machine learning [J].
Arnold, Kellyn F. ;
Davies, Vinny ;
de Kamps, Marc ;
Tennant, Peter W. G. ;
Mbotwa, John ;
Gilthorpe, Mark S. .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2020, 49 (06) :2074-2082
[4]   Reflection on modern methods: when worlds collide-prediction, machine learning and causal inference [J].
Blakely, Tony ;
Lynch, John ;
Simons, Koen ;
Bentley, Rebecca ;
Rose, Sherri .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2020, 49 (06) :2058-2064
[5]   Association of Pre-Pregnancy Body Mass Index, Pregnancy-Related Weight Changes, and Parity With the Risk of Developing Degenerative Musculoskeletal Conditions [J].
Bliddal, Mette ;
Pottegard, Anton ;
Kirkegaard, Helene ;
Olsen, Jorn ;
Jorgensen, Jan S. ;
Sorensen, Thorkild I. A. ;
Dreyer, Lene ;
Nohr, Ellen A. .
ARTHRITIS & RHEUMATOLOGY, 2016, 68 (05) :1156-1164
[6]   A Suite of R Functions for Directed Acyclic Graphs [J].
Breitling, Lutz Philipp .
EPIDEMIOLOGY, 2010, 21 (04) :586-587
[7]   Cooking and Season as Risk Factors for Acute Lower Respiratory Infections in African Children: A Cross-Sectional Multi-Country Analysis [J].
Buchner, Hannes ;
Rehfuess, Eva A. .
PLOS ONE, 2015, 10 (06)
[8]   Understanding and misunderstanding randomized controlled trials [J].
Deaton, Angus ;
Cartwright, Nancy .
SOCIAL SCIENCE & MEDICINE, 2018, 210 :2-21
[9]   Road traffic noise and registry based use of sleep medication [J].
Evandt, Jorunn ;
Oftedal, Bente ;
Krog, Norun Hjertager ;
Skurtveit, Svetlana ;
Nafstad, Per ;
Schwarze, Per E. ;
Skovlund, Eva ;
Houthuijs, Danny ;
Aasvang, Gunn Marit .
ENVIRONMENTAL HEALTH, 2017, 16
[10]   Evidence synthesis for constructing directed acyclic graphs (ESC-DAGs): a novel and systematic method for building directed acyclic graphs [J].
Ferguson, Karl D. ;
McCann, Mark ;
Katikireddi, Srinivasa Vittal ;
Thomson, Hilary ;
Green, Michael J. ;
Smith, Daniel J. ;
Lewsey, James D. .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2020, 49 (01) :322-329