Predicting enhancer-promoter interaction based on epigenomic signals

被引:3
作者
Zheng, Leqiong [1 ,2 ,3 ]
Liu, Li [2 ]
Zhu, Wen [1 ,3 ]
Ding, Yijie [3 ]
Wu, Fangxiang [1 ]
机构
[1] Hainan Normal Univ, Sch Math & Stat, Haikou, Peoples R China
[2] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Quzhou, Quzhou, Peoples R China
[3] Key Lab Computat Sci & Applicat Hainan Prov, Haikou, Peoples R China
关键词
enhancer-promoter interaction; machine learning; ChIA-PET; random forest; epigenomic signals; CURVE;
D O I
10.3389/fgene.2023.1133775
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Introduction: The physical interactions between enhancers and promoters are often involved in gene transcriptional regulation. High tissue-specific enhancer-promoter interactions (EPIs) are responsible for the differential expression of genes. Experimental methods are time-consuming and labor-intensive in measuring EPIs. An alternative approach, machine learning, has been widely used to predict EPIs. However, most existing machine learning methods require a large number of functional genomic and epigenomic features as input, which limits the application to different cell lines.Methods: In this paper, we developed a random forest model, HARD (H3K27ac, ATAC-seq, RAD21, and Distance), to predict EPI using only four types of features.Results: Independent tests on a benchmark dataset showed that HARD outperforms other models with the fewest features.Discussion: Our results revealed that chromatin accessibility and the binding of cohesin are important for cell-line-specific EPIs. Furthermore, we trained the HARD model in the GM12878 cell line and performed testing in the HeLa cell line. The cross-cell-lines prediction also performs well, suggesting it has the potential to be applied to other cell lines.
引用
收藏
页数:8
相关论文
共 42 条
  • [1] Machine learning polymer models of three-dimensional chromatin organization in human lymphoblastoid cells
    Al Bkhetan, Ziad
    Kadlof, Michal
    Kraft, Agnieszka
    Plewczynski, Dariusz
    [J]. METHODS, 2019, 166 : 83 - 90
  • [2] Ao C., 2022, RESEARCH-CHINA, V24, P1198, DOI [10.1093/bioinformatics/btn089, DOI 10.1093/BIOINFORMATICS/BTN089]
  • [3] NmRF: identification of multispecies RNA 2′-O-methylation modification sites from RNA sequences
    Ao, Chunyan
    Zou, Quan
    Yu, Liang
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [4] Polymer physics predicts the effects of structural variants on chromatin architecture
    Bianco, Simona
    Lupianez, Dario G.
    Chiariello, Andrea M.
    Annunziatella, Carlo
    Kraft, Katerina
    Schoepflin, Robert
    Wittler, Lars
    Andrey, Guillaume
    Vingron, Martin
    Pombo, Ana
    Mundlos, Stefan
    Nicodemi, Mario
    [J]. NATURE GENETICS, 2018, 50 (05) : 662 - +
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Potent antibiotic design via guided search from antibacterial activity evaluations
    Chen, Lu
    Yu, Liang
    Gao, Lin
    [J]. BIOINFORMATICS, 2023, 39 (02)
  • [7] Combinatorial effects of multiple enhancer variants in linkage disequilibrium dictate levels of gene expression to confer susceptibility to common traits
    Corradin, Olivia
    Saiakhova, Alina
    Akhtar-Zaidi, Batool
    Myeroff, Lois
    Willis, Joseph
    Iari, Richard Cowper-Sal
    Lupien, Mathieu
    Markowitz, Sanford
    Scacheri, Peter C.
    [J]. GENOME RESEARCH, 2014, 24 (01) : 1 - 13
  • [8] Topology of mammalian developmental enhancers and their regulatory landscapes
    de Laat, Wouter
    Duboule, Denis
    [J]. NATURE, 2013, 502 (7472) : 499 - 506
  • [9] Capturing chromosome conformation
    Dekker, J
    Rippe, K
    Dekker, M
    Kleckner, N
    [J]. SCIENCE, 2002, 295 (5558) : 1306 - 1311
  • [10] FORUM: Genomics ENCODE explained
    Ecker, Joseph R.
    [J]. NATURE, 2012, 489 (7414) : 52 - 53