CapsNetYY1: identifying YY1-mediated chromatin loops based on a capsule network architecture

被引:1
作者
Zhang, Zhimin [1 ]
Li, Fenglin [1 ]
Zhao, Jianping [1 ]
Zheng, Chunhou [2 ]
机构
[1] Xinjiang Univ, Coll Math & Syst Sci, Urumqi, Peoples R China
[2] Anhui Univ, Sch Artificial Intelligence, Key Lab Intelligent Comp & Signal Proc, Minist Educ,Informat Mat & Intelligent Sensing Lab, Hefei, Peoples R China
关键词
YY1-mediated chromatin loops; Capsule network; Enhancer-promoter interaction; SITES; ORGANIZATION; DIVERSITY; TOPOLOGY; COHESIN; UNITS; CTCF; YY1;
D O I
10.1186/s12864-023-09217-4
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background Previous studies have identified that chromosome structure plays a very important role in gene control. The transcription factor Yin Yang 1 (YY1), a multifunctional DNA binding protein, could form a dimer to mediate chromatin loops and active enhancer-promoter interactions. The deletion of YY1 or point mutations at the YY1 binding sites significantly inhibit the enhancer-promoter interactions and affect gene expression. To date, only a few computational methods are available for identifying YY1-mediated chromatin loops.Results We proposed a novel model named CapsNetYY1, which was based on capsule network architecture to identify whether a pair of YY1 motifs can form a chromatin loop. Firstly, we encode the DNA sequence using one-hot encoding method. Secondly, multi-scale convolution layer is used to extract local features of the sequence, and bidirectional gated recurrent unit is used to learn the features across time steps. Finally, capsule networks (convolution capsule layer and digital capsule layer) used to extract higher level features and recognize YY1-mediated chromatin loops. Compared with DeepYY1, the only prediction for YY1-mediated chromatin loops, our model CapsNetYY1 achieved the better performance on the independent datasets (AUC >0.99).Conclusion The results indicate that CapsNetYY1 is an excellent method for identifying YY1-mediated chromatin loops. We believe that the CapsNetYY1 method will be used for predictive classification of other DNA sequences.
引用
收藏
页数:9
相关论文
共 39 条
[1]  
Ali SD, 2020, IEEE ACM T COMP BIOL, P99
[2]   Choose your partners: dimerization in eukaryotic transcription factors [J].
Amoutzias, Grigoris D. ;
Robertson, David L. ;
Van de Peer, Yves ;
Oliver, Stephen G. .
TRENDS IN BIOCHEMICAL SCIENCES, 2008, 33 (05) :220-229
[3]   The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation [J].
Chicco, Davide ;
Jurman, Giuseppe .
BMC GENOMICS, 2020, 21 (01)
[4]  
Chung J., 2014, arXiv
[5]   Condensin-driven remodelling of X chromosome topology during dosage compensation [J].
Crane, Emily ;
Bian, Qian ;
McCord, Rachel Patton ;
Lajoie, Bryan R. ;
Wheeler, Bayly S. ;
Ralston, Edward J. ;
Uzawa, Satoru ;
Dekker, Job ;
Meyer, Barbara J. .
NATURE, 2015, 523 (7559) :240-U299
[6]   Global analysis of the insulator binding protein CTCF in chromatin barrier regions reveals demarcation of active and repressive domains [J].
Cuddapah, Suresh ;
Jothi, Raja ;
Schones, Dustin E. ;
Roh, Tae-Young ;
Cui, Kairong ;
Zhao, Keji .
GENOME RESEARCH, 2009, 19 (01) :24-32
[7]   DeepYY1: a deep learning approach to identify YY1-mediated chromatin loops [J].
Dao, Fu-Ying ;
Lv, Hao ;
Zhang, Dan ;
Zhang, Zi-Mei ;
Liu, Li ;
Lin, Hao .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
[8]  
Davis J., 2006, P 23 INT C MACH LEAR, P233, DOI [10.1145/1143844.1143874, DOI 10.1145/1143844.1143874]
[9]   Structural and functional diversity of Topologically Associating Domains [J].
Dekker, Job ;
Heard, Edith .
FEBS LETTERS, 2015, 589 (20) :2877-2884
[10]   CTCF: making the right connections [J].
Ghirlando, Rodolfo ;
Felsenfeld, Gary .
GENES & DEVELOPMENT, 2016, 30 (08) :881-891