Causal discovery of gene regulation with incomplete data

被引:14
作者
Foraita, Ronja [1 ]
Friemel, Juliane [1 ,2 ]
Guenther, Kathrin [1 ]
Behrens, Thomas [3 ]
Bullerdiek, Joern [4 ]
Nimzyk, Rolf [4 ]
Ahrens, Wolfgang [1 ,5 ]
Didelez, Vanessa [1 ,5 ]
机构
[1] Leibniz Inst Prevent Res & Epidemiol BIPS, Achterstr 30, D-28359 Bremen, Germany
[2] Univ & Univ Hosp Zurich, Zurich, Switzerland
[3] Ruhr Univ Bochum, Bochum, Germany
[4] Ctr Human Genet, Bremen, Germany
[5] Univ Bremen, Bremen, Germany
关键词
Gene expression; Graphical models; Head-and-neck squamous cell carcinoma; HMGA2; gene; Human papilloma-virus; PC algorithm; Protein 53 signalling pathway; HMGA2 PROTEIN EXPRESSION; SQUAMOUS-CELL CARCINOMAS; DIRECTED ACYCLIC GRAPHS; EQUIVALENCE CLASSES; MAXIMUM-LIKELIHOOD; SEROUS CARCINOMA; HEAD; P53; INFERENCE; CANCER;
D O I
10.1111/rssa.12565
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Causal discovery algorithms aim to identify causal relations from observational data and have become a popular tool for analysing genetic regulatory systems. In this work, we applied causal discovery to obtain novel insights into the genetic regulation underlying head-and-neck squamous cell carcinoma. Some methodological challenges needed to be resolved first. The available data contained missing values, but most approaches to causal discovery require complete data. Hence, we propose a new procedure combining constraint-based causal discovery with multiple imputation. This is based on using Rubin's rules for pooling tests of conditional independence. A second challenge was that causal discovery relies on strong assumptions and can be rather unstable. To assess the robustness of our results, we supplemented our investigation with sensitivity analyses, including a non-parametric bootstrap to quantify the variability of the estimated causal structures. We applied these methods to investigate how the high mobility group AT-Hook 2 (HMGA2) gene is incorporated in the protein 53 signalling pathway playing an important role in head-and-neck squamous cell carcinoma. Our results were quite stable and found direct associations between HMGA2 and other relevant proteins, but they did not provide clear support for the claim that HMGA2 itself is a key regulator gene.
引用
收藏
页码:1747 / 1775
页数:29
相关论文
共 50 条
[31]   A constraint optimization approach to causal discovery from subsampled time series data [J].
Hyttinen, Antti ;
Plis, Sergey ;
Jarvisalo, Matti ;
Eberhardt, Frederick ;
Danks, David .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2017, 90 :208-225
[32]   Review of Causal Discovery Methods Based on Graphical Models [J].
Glymour, Clark ;
Zhang, Kun ;
Spirtes, Peter .
FRONTIERS IN GENETICS, 2019, 10
[33]   Power analysis for causal discovery [J].
Kummerfeld, Erich ;
Williams, Leland ;
Ma, Sisi .
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, 17 (03) :289-304
[34]   Nonlinear Causal Discovery with Confounders [J].
Li, Chunlin ;
Shen, Xiaotong ;
Pan, Wei .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) :1205-1214
[35]   Introduction to the foundations of causal discovery [J].
Eberhardt F. .
International Journal of Data Science and Analytics, 2017, 3 (02) :81-91
[36]   Power analysis for causal discovery [J].
Erich Kummerfeld ;
Leland Williams ;
Sisi Ma .
International Journal of Data Science and Analytics, 2024, 17 :289-304
[37]   Constraint-based Causal Discovery from Multiple Interventions over Overlapping Variable Sets [J].
Triantafillou, Sofia ;
Tsamardinos, Ioannis .
JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 :2147-2205
[38]   Gene regulatory network inference based on causal discovery integrating with graph neural network [J].
Feng, Ke ;
Jiang, Hongyang ;
Yin, Chaoyi ;
Sun, Huiyan .
QUANTITATIVE BIOLOGY, 2023, 11 (04) :434-450
[39]   Inferring cell cycle feedback regulation from gene expression data [J].
Ferrazzi, Fulvia ;
Engel, Felix B. ;
Wu, Erxi ;
Moseman, Annie P. ;
Kohane, Isaac S. ;
Bellazzi, Riccardo ;
Ramoni, Marco F. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (04) :565-575
[40]   Genetic Regulation of SMC Gene Expression and Splicing Predict Causal CAD Genes [J].
Aherrahrou, Redouane ;
Lue, Dillon ;
Perry, R. Noah ;
Aberra, Yonathan Tamrat ;
Khan, Mohammad Daud ;
Soh, Joon Yuhl ;
Ord, Tiit ;
Singha, Prosanta ;
Yang, Qianyi ;
Gilani, Huda ;
Benavente, Ernest Diez ;
Wong, Doris ;
Hinkle, Jameson ;
Ma, Lijiang ;
Sheynkman, Gloria M. ;
den Ruijter, Hester M. ;
Miller, Clint L. ;
Bjoerkegren, Johan L. M. ;
Kaikkonen, Minna U. ;
Civelek, Mete .
CIRCULATION RESEARCH, 2023, 132 (03) :323-338