Exploring chromatin conformation and gene co-expression through graph embedding

被引:3
作者
Varrone, Marco [1 ]
Nanni, Luca [1 ]
Ciriello, Giovanni [2 ,3 ]
Ceri, Stefano [1 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, Milan, Italy
[2] Univ Lausanne, Dept Computat Biol, CH-1015 Lausanne, Switzerland
[3] Swiss Inst Bioinformat, CH-1015 Lausanne, Switzerland
关键词
NETWORK; PRINCIPLES; DISCOVERY; GENOME;
D O I
10.1093/bioinformatics/btaa803
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The relationship between gene co-expression and chromatin conformation is of great biological interest. Thanks to high-throughput chromosome conformation capture technologies (Hi-C), researchers are gaining insights on the tri-dimensional organization of the genome. Given the high complexity of Hi-C data and the difficult definition of gene co-expression networks, the development of proper computational tools to investigate such relationship is rapidly gaining the interest of researchers. One of the most fascinating questions in this context is how chromatin topology correlates with gene co-expression and which physical interaction patterns are most predictive of co-expression relationships. Results: To address these questions, we developed a computational framework for the prediction of co-expression networks from chromatin conformation data. We first define a gene chromatin interaction network where each gene is associated to its physical interaction profile; then, we apply two graph embedding techniques to extract a low-dimensional vector representation of each gene from the interaction network; finally, we train a classifier on gene embedding pairs to predict if they are co-expressed. Both graph embedding techniques outperform previous methods based on manually designed topological features, highlighting the need for more advanced strategies to encode chromatin information. We also establish that the most recent technique, based on random walks, is superior. Overall, our results demonstrate that chromatin conformation and gene regulation share a non-linear relationship and that gene topological embeddings encode relevant information, which could be used also for downstream analysis.
引用
收藏
页码:I700 / I708
页数:9
相关论文
共 51 条
[1]   Prediction of human disease genes by human-mouse conserved coexpression analysis [J].
Ala, Ugo ;
Piro, Rosario Michael ;
Grassi, Elena ;
Damasco, Christian ;
Silengo, Lorenzo ;
Oti, Martin ;
Provero, Paolo ;
Di Cunto, Ferdinando .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (03)
[2]  
[Anonymous], 2013, P WORKSHOP ICLR
[3]  
Arsov N., 2019, ARXIV191111726
[4]   Hi-C Chromatin Interaction Networks Predict Co-expression in the Mouse Cortex [J].
Babaei, Sepideh ;
Mahfouz, Ahmed ;
Hulsman, Marc ;
Lelieveldt, Boudewijn P. F. ;
de Ridder, Jeroen ;
Reinders, Marcel .
PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (05)
[5]   Chromatin interaction analysis reveals changes in small chromosome and telomere clustering between epithelial and breast cancer cells [J].
Barutcu, A. Rasim ;
Lajoie, Bryan R. ;
McCord, Rachel P. ;
Tye, Coralee E. ;
Hong, Deli ;
Messier, Terri L. ;
Browne, Gillian ;
van Wijnen, Andre J. ;
Lian, Jane B. ;
Stein, Janet L. ;
Dekker, Job ;
Imbalzano, Anthony N. ;
Stein, Gary S. .
GENOME BIOLOGY, 2015, 16
[6]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[7]  
Breiman L., 2001, IEEE Trans. Broadcast., V45, P5
[8]  
Butte A J, 2000, Pac Symp Biocomput, P418
[9]   Visual gene-network analysis reveals the cancer gene co-expression in human endometrial cancer [J].
Chou, Wei-Chun ;
Cheng, An-Lin ;
Brotto, Marco ;
Chuang, Chun-Yu .
BMC GENOMICS, 2014, 15
[10]   Finishing the euchromatic sequence of the human genome [J].
Collins, FS ;
Lander, ES ;
Rogers, J ;
Waterston, RH .
NATURE, 2004, 431 (7011) :931-945