Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models
被引:3
作者:
Malina, Stephen
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
Dyno Therapeut, Watertown, MA 02472 USAColumbia Univ, Dept Comp Sci, New York, NY 10027 USA
Malina, Stephen
[1
,2
]
Cizin, Daniel
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
Weill Cornell Med, Triinst PhD Program Computat Biol & Med, New York, NY USAColumbia Univ, Dept Comp Sci, New York, NY 10027 USA
Cizin, Daniel
[1
,3
]
Knowles, David A.
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
New York Genome Ctr, New York, NY USA
Columbia Univ, Dept Syst Biol, New York, NY USA
Columbia Univ, Data Sci Inst, New York, NY USAColumbia Univ, Dept Comp Sci, New York, NY 10027 USA
Knowles, David A.
[1
,4
,5
,6
]
机构:
[1] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
[2] Dyno Therapeut, Watertown, MA 02472 USA
[3] Weill Cornell Med, Triinst PhD Program Computat Biol & Med, New York, NY USA
[4] New York Genome Ctr, New York, NY USA
[5] Columbia Univ, Dept Syst Biol, New York, NY USA
[6] Columbia Univ, Data Sci Inst, New York, NY USA
Multi-task deep learning (DL) models can accurately predict diverse genomic marks from sequence, but whether these models learn the causal relationships between genomic marks is unknown. Here, we describe Deep Mendelian Randomization (DeepMR), a method for estimating causal relationships between genomic marks learned by genomic DL models. By combining Mendelian randomization with in silico mutagenesis, DeepMR obtains local (locus specific) and global estimates of (an assumed) linear causal relationship between marks. In a simulation designed to test recovery of pairwise causal relations between transcription factors (TFs), DeepMR gives accurate and unbiased estimates of the `true' global causal effect, but its coverage decays in the presence of sequence-dependent confounding. We then apply DeepMR to examine the global relationships learned by a state-of-the-art DL model, BPNet, between TFs involved in reprogramming. DeepMR's causal effect estimates validate previously hypothesized relationships between TFs and suggest new relationships for future investigation. Author summary Chromatin marks such as transcription factor (TF) binding, accessibility, and histone modifications play a critical role in controlling cell behavior and identity. In recent years, multi-task deep learning (DL) models have achieved remarkable success at predicting these and other chromatin marks. However, it is unclear to what extent these models learn meaningful mechanistic, even causal, relationships between these variables. Our work aims to fill this gap by combining in silico mutagenesis, deep learning uncertainty estimation and causal inference (specifically Mendelian randomization, MR), into a framework we call DeepMR. We describe DeepMR, apply it to a simulation intended to test its ability to recover causal relationships between features from a learned model, and then use it to examine the relationships learned by a state-of-the-art DL model, BPNet. Our results suggest that DeepMR can estimate causal relationships under its stated assumptions and provide further evidence for previously hypothesized relationships between TFs identified by BPNet.
机构:
Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, CanadaUniv Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
Delong, Andrew
Weirauch, Matthew T.
论文数: 0引用数: 0
h-index: 0
机构:
Canadian Inst Adv Res, Program Genet Networks, Toronto, ON, Canada
Canadian Inst Adv Res, Program Neural Computat, Toronto, ON, Canada
Cincinnati Childrens Hosp Med Ctr, Ctr Autoimmune Genom & Etiol, Cincinnati, OH 45229 USA
Cincinnati Childrens Hosp Med Ctr, Div Biomed Informat, Cincinnati, OH 45229 USA
Cincinnati Childrens Hosp Med Ctr, Div Dev Biol, Cincinnati, OH 45229 USAUniv Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
Weirauch, Matthew T.
Frey, Brendan J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
Univ Toronto, Donnelly Ctr Cellular & Biomol Res, Toronto, ON, Canada
Canadian Inst Adv Res, Program Genet Networks, Toronto, ON, Canada
Canadian Inst Adv Res, Program Neural Computat, Toronto, ON, CanadaUniv Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
机构:
Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, CanadaUniv Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
Delong, Andrew
Weirauch, Matthew T.
论文数: 0引用数: 0
h-index: 0
机构:
Canadian Inst Adv Res, Program Genet Networks, Toronto, ON, Canada
Canadian Inst Adv Res, Program Neural Computat, Toronto, ON, Canada
Cincinnati Childrens Hosp Med Ctr, Ctr Autoimmune Genom & Etiol, Cincinnati, OH 45229 USA
Cincinnati Childrens Hosp Med Ctr, Div Biomed Informat, Cincinnati, OH 45229 USA
Cincinnati Childrens Hosp Med Ctr, Div Dev Biol, Cincinnati, OH 45229 USAUniv Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
Weirauch, Matthew T.
Frey, Brendan J.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada
Univ Toronto, Donnelly Ctr Cellular & Biomol Res, Toronto, ON, Canada
Canadian Inst Adv Res, Program Genet Networks, Toronto, ON, Canada
Canadian Inst Adv Res, Program Neural Computat, Toronto, ON, CanadaUniv Toronto, Dept Elect & Comp Engn, Toronto, ON, Canada