Challenges Raised by Mediation Analysis in a High-Dimension Setting

被引:23
作者
Blum, Michael G. B. [1 ,2 ,3 ]
Valeri, Linda [4 ]
Francois, Olivier [1 ,2 ]
Cadiou, Solene [5 ]
Siroux, Valerie [5 ]
Lepeule, Johanna [5 ]
Slama, Remy [5 ]
机构
[1] Univ Grenoble Alpes, IMAG, TIMC, Lab Tech Imagerie Med & Complexite, La Tronche, France
[2] Univ Grenoble Alpes, CNRS, French Natl Ctr Sci Res, La Tronche, France
[3] OWKIN, Paris, France
[4] Columbia Univ, Mailman Sch Publ Hlth, Dept Biostat, New York, NY USA
[5] Univ Grenoble Alpes, INSERM, Inst Adv Biosci IAB Joint Res Ctr, CNRS,Team Environm Epidemiol Appl Reprod & Resp H, Grenoble, France
关键词
DNA METHYLATION; NULL; EXPOSURE; SMOKING; MODEL;
D O I
10.1289/EHP6240
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
BACKGROUND: Mediation analysis is used in epidemiology to identify pathways through which exposures influence health. The advent of high-throughput (omits) technologies gives opportunities to perform mediation analysis with a high-dimension pool of covariates. OBJECTIVE: We aimed to highlight some biostatistical issues of this expanding field of high-dimension mediation. DISCUSSION: The mediation techniques used for a single mediator cannot be generalized in a straightforward manner to high-dimension mediation. Causal knowledge on the relation between covariates is required for mediation analysis, and it is expected to be more limited as dimension and system complexity increase. The methods developed in high dimension can be distinguished according to whether mediators are considered separately or as a whole. Methods considering each potential mediator separately do not allow efficient identification of the indirect effects when mutual influences exist among the mediators, which is expected for many biological (e.g., epigenetic) parameters. In this context, methods considering all potential mediators simultaneously, based, for example, on data reduction techniques, are more adapted to the causal inference framework. Their cost is a possible lack of ability to single out the causal mediators. Moreover, the ability of the mediators to predict the outcome can be overestimated, in particular because many machine-learning algorithms are optimized to increase predictive ability rather than their aptitude to make causal inference. Given the lack of overarching validated framework and the generally complex causal structure of high-dimension data, analysis of high-dimension mediation currently requires great caution and effort to incorporate a priori biological knowledge.
引用
收藏
页数:8
相关论文
共 55 条
[1]   Pregnancy exposure to atmospheric pollution and meteorological conditions and placental DNA methylation [J].
Abraham, Emilie ;
Rousseaux, Sophie ;
Agier, Lydiane ;
Giorgis-Allemand, Lise ;
Tost, Jorg ;
Galineau, Julien ;
Hulin, Agnes ;
Siroux, Valerie ;
Vaiman, Daniel ;
Charles, Marie-Aline ;
Heude, Barbara ;
Forhan, Anne ;
Schwartz, Joel ;
Chuffart, Florent ;
Bourova-Flin, Ekaterina ;
Khochbin, Saadi ;
Slama, Remy ;
Lepeule, Johanna .
ENVIRONMENT INTERNATIONAL, 2018, 118 :334-347
[2]   A Systematic Comparison of Linear Regression-Based Statistical Methods to Assess Exposome-Health Associations [J].
Agier, Lydiane ;
Portengen, Lutzen ;
Chadeau-Hyam, Marc ;
Basagana, Xavier ;
Giorgis-Allemand, Lise ;
Siroux, Valerie ;
Robinson, Oliver ;
Vlaanderen, Jelle ;
Gonzalez, Juan R. ;
Nieuwenhuijsen, Mark J. ;
Vineis, Paolo ;
Vrijheid, Martine ;
Slama, Remy ;
Vermeulen, Roel .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2016, 124 (12) :1848-1856
[3]  
[Anonymous], 2015, methods for mediation and interaction
[4]   Testing for the indirect effect under the null for genome-wide mediation analyses [J].
Barfield, Richard ;
Shen, Jincheng ;
Just, Allan C. ;
Vokonas, Pantel S. ;
Schwartz, Joel ;
Baccarelli, Andrea A. ;
VanderWeele, Tyler J. ;
Lin, Xihong .
GENETIC EPIDEMIOLOGY, 2017, 41 (08) :824-833
[5]   THE MODERATOR MEDIATOR VARIABLE DISTINCTION IN SOCIAL PSYCHOLOGICAL-RESEARCH - CONCEPTUAL, STRATEGIC, AND STATISTICAL CONSIDERATIONS [J].
BARON, RM ;
KENNY, DA .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1986, 51 (06) :1173-1182
[6]   Approaches for incorporating environmental mixtures as mediators in mediation analysis [J].
Bellavia, Andrea ;
James-Todd, Tamarra ;
Williams, Paige L. .
ENVIRONMENT INTERNATIONAL, 2019, 123 :368-374
[7]   Air pollution and gene-specific methylation in the Normative Aging Study [J].
Bind, Marie-Abele ;
Lepeule, Johanna ;
Zanobetti, Antonella ;
Gasparrini, Antonio ;
Baccarelli, Andrea ;
Coull, Brent A. ;
Tarantini, Letizia ;
Vokonas, Pantel S. ;
Koutrakis, Petros ;
Schwartz, Joel .
EPIGENETICS, 2014, 9 (03) :448-458
[8]   Testing multiple biological mediators simultaneously [J].
Boca, Simina M. ;
Sinha, Rashmi ;
Cross, Amanda J. ;
Moore, Steven C. ;
Sampson, Joshua N. .
BIOINFORMATICS, 2014, 30 (02) :214-220
[9]   Deciphering the complex: Methodological overview of statistical models to derive OMICS-based biomarkers [J].
Chadeau-Hyam, Marc ;
Campanella, Gianluca ;
Jombart, Thibaut ;
Bottolo, Leonardo ;
Portengen, Lutzen ;
Vineis, Paolo ;
Liquet, Benoit ;
Vermeulen, Roel C. H. .
ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2013, 54 (07) :542-557
[10]   High-dimensional multivariate mediation with application to neuroimaging data [J].
Chen, Oliver Y. ;
Crainiceanu, Ciprian ;
Ogburn, Elizabeth L. ;
Caffo, Brian S. ;
Wager, Tor D. ;
Lindquist, Martin A. .
BIOSTATISTICS, 2018, 19 (02) :121-136