Enhancer-DSNet: A Supervisedly Prepared Enriched Sequence Representation for the Identification of Enhancers and Their Strength

被引:12
作者
Asim, Muhammad Nabeel [1 ,2 ]
Ibrahim, Muhammad Ali [1 ,2 ]
Malik, Muhammad Imran [3 ]
Dengel, Andreas [1 ]
Ahmed, Sheraz [1 ]
机构
[1] German Res Ctr Artificial Intelligence DFKI, D-67663 Kaiserslautern, Germany
[2] Univ Kaiserslautern TU Kaiserslautern, D-67663 Kaiserslautern, Germany
[3] Natl Univ Sci & Technol, Natl Ctr Artificial Intelligence NCAI, Islamabad, Pakistan
来源
NEURAL INFORMATION PROCESSING, ICONIP 2020, PT III | 2020年 / 12534卷
关键词
Enhancer identification; Strong enhancer; Weak enhancers; Enhancer classification; Deep enhancer predictor; Enhancer strength identification; Enriched k-mers; PREDICTING ENHANCERS;
D O I
10.1007/978-3-030-63836-8_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identification of enhancers and their strength prediction plays an important role in gene expression regulation and currently an active area of research. However, its identification specifically through experimental approaches is extremely time consuming and labor-intensive task. Several machine learning methodologies have been proposed to accurately discriminate enhancers from regulatory elements and to estimate their strength. Existing approaches utilise different statistical measures for feature encoding which mainly capture residue specific physico-chemical properties upto certain extent but ignore semantic and positional information of residues. This paper presents "Enhancer-DSNet", a two-layer precisely deep neural network which makes use of a novel k-mer based sequence representation scheme prepared by fusing associations between k-mer positions and sequence type. Proposed Enhancer-DSNet methodology is evaluated on a publicly available benchmark dataset and independent test set. Experimental results over benchmark independent test set indicate that proposed Enhancer-DSNet methodology outshines the performance of most recent predictor by the figure of 2%, 1%, 2%, and 5% in terms of accuracy, specificity, sensitivity and matthews correlation coefficient for enhancer identification task and by the figure of 15%, 21%, and 39% in terms of accuracy, specificity, and matthews correlation coefficient for strong/weak enhancer prediction task.
引用
收藏
页码:38 / 48
页数:11
相关论文
共 24 条
[1]  
Almeida F, 2023, Arxiv, DOI [arXiv:1901.09069, DOI 10.48550/ARXIV.1901.09069]
[2]  
Bepler T, 2019, Arxiv, DOI [arXiv:1902.08661, 10.48550/arXiv.1902.08661, DOI 10.48550/ARXIV.1902.08661]
[3]   High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells [J].
Boyle, Alan P. ;
Song, Lingyun ;
Lee, Bum-Kyu ;
London, Darin ;
Keefe, Damian ;
Birney, Ewan ;
Iyer, Vishwanath R. ;
Crawford, Gregory E. ;
Furey, Terrence S. .
GENOME RESEARCH, 2011, 21 (03) :456-464
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   A new method for enhancer prediction based on deep belief network [J].
Bu, Hongda ;
Gan, Yanglan ;
Wang, Yang ;
Zhou, Shuigeng ;
Guan, Jihong .
BMC BIOINFORMATICS, 2017, 18
[6]   Prediction of linear B-cell epitopes using amino acid pair antigenicity scale [J].
Chen, J. ;
Liu, H. ;
Yang, J. ;
Chou, K.-C. .
AMINO ACIDS, 2007, 33 (03) :423-428
[7]   Enhancer RNAs: Insights Into Their Biological Role [J].
Cortes-Fernandez de Lara, Josue ;
Arzate-Mejia, Rodrigo G. ;
Recillas-Targa, Felix .
EPIGENETICS INSIGHTS, 2019, 12
[8]   Mapping and analysis of chromatin state dynamics in nine human cell types [J].
Ernst, Jason ;
Kheradpour, Pouya ;
Mikkelsen, Tarjei S. ;
Shoresh, Noam ;
Ward, Lucas D. ;
Epstein, Charles B. ;
Zhang, Xiaolan ;
Wang, Li ;
Issner, Robbyn ;
Coyne, Michael ;
Ku, Manching ;
Durham, Timothy ;
Kellis, Manolis ;
Bernstein, Bradley E. .
NATURE, 2011, 473 (7345) :43-U52
[9]   Integrating Diverse Datasets Improves Developmental Enhancer Prediction [J].
Erwin, Genevieve D. ;
Oksenberg, Nir ;
Truty, Rebecca M. ;
Kostka, Dennis ;
Murphy, Karl K. ;
Ahituv, Nadav ;
Pollard, Katherine S. ;
Capra, John A. .
PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (06)
[10]   Discover regulatory DNA elements using chromatin signatures and artificial neural network [J].
Firpi, Hiram A. ;
Ucar, Duygu ;
Tan, Kai .
BIOINFORMATICS, 2010, 26 (13) :1579-1586