Epigenomic annotation-based interpretation of genomic data: from enrichment analysis to machine learning

被引:20
作者
Dozmorov, Mikhail G. [1 ]
机构
[1] Virginia Commonwealth Univ, Dept Biostat, Richmond, VA 23298 USA
关键词
CHIP-SEQ DATA; R/BIOCONDUCTOR PACKAGE; HISTONE MODIFICATIONS; STATISTICAL-ANALYSIS; COMMON DISEASE; HI-C; FEATURES; DNA; ORGANIZATION; ASSOCIATION;
D O I
10.1093/bioinformatics/btx414
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One of the goals of functional genomics is to understand the regulatory implications of experimentally obtained genomic regions of interest (ROIs). Most sequencing technologies now generate ROIs distributed across the whole genome. The interpretation of these genome-wide ROIs represents a challenge as the majority of them lie outside of functionally well-defined protein coding regions. Recent efforts by the members of the International Human Epigenome Consortium have generated volumes of functional/regulatory data (reference epigenomic datasets), effectively annotating the genome with epigenomic properties. Consequently, a wide variety of computational tools has been developed utilizing these epigenomic datasets for the interpretation of genomic data. Results: The purpose of this review is to provide a structured overview of practical solutions for the interpretation of ROIs with the help of epigenomic data. Starting with epigenomic enrichment analysis, we discuss leading tools and machine learning methods utilizing epigenomic and 3D genome structure data. The hierarchy of tools and methods reviewed here presents a practical guide for the interpretation of genome-wide ROIs within an epigenomic context.
引用
收藏
页码:3323 / 3330
页数:8
相关论文
共 38 条
[1]   An operational definition of epigenetics [J].
Berger, Shelley L. ;
Kouzarides, Tony ;
Shiekhattar, Ramin ;
Shilatifard, Ali .
GENES & DEVELOPMENT, 2009, 23 (07) :781-783
[2]   SUBSAMPLING METHODS FOR GENOMIC INFERENCE [J].
Bickel, Peter J. ;
Boley, Nathan ;
Brown, James B. ;
Huang, Haiyan ;
Zhang, Nancy R. .
ANNALS OF APPLIED STATISTICS, 2010, 4 (04) :1660-1697
[3]   CPG-RICH ISLANDS AND THE FUNCTION OF DNA METHYLATION [J].
BIRD, AP .
NATURE, 1986, 321 (6067) :209-213
[4]   eFORGE: A Tool for Identifying Cell Type-Specific Signal in Epigenomic Data [J].
Breeze, Charles E. ;
Paul, Dirk S. ;
van Dongen, Jenny ;
Butcher, Lee M. ;
Ambrose, John C. ;
Barrett, James E. ;
Lowe, Robert ;
Rakyan, Vardhman K. ;
Iotchkova, Valentina ;
Frontini, Mattia ;
Downes, Kate ;
Ouwehand, Willem H. ;
Laperle, Jonathan ;
Jacques, Pierre-ETienne ;
Bourque, Guillaume ;
Bergmann, Anke K. ;
Siebert, Reiner ;
Vellenga, Edo ;
Saeed, Sadia ;
Matarese, Filomena ;
Martens, Joost H. A. ;
Stunnenberg, Hendrik G. ;
Teschendorff, Andrew E. ;
Herrero, Javier ;
Birney, Ewan ;
Dunham, Ian ;
Beck, Stephan .
CELL REPORTS, 2016, 17 (08) :2137-2150
[5]   Hi-C-constrained physical models of human chromosomes recover functionally-related properties of genome organization [J].
Di Stefano, Marco ;
Paulsen, Jonas ;
Lien, Tonje G. ;
Hovig, Eivind ;
Micheletti, Cristian .
SCIENTIFIC REPORTS, 2016, 6
[6]   GenomeRunner web server: regulatory similarity and differences define the functional impact of SNP sets [J].
Dozmorov, Mikhail G. ;
Cara, Lukas R. ;
Giles, Cory B. ;
Wren, Jonathan D. .
BIOINFORMATICS, 2016, 32 (15) :2256-2263
[7]   An integrated encyclopedia of DNA elements in the human genome [J].
Dunham, Ian ;
Kundaje, Anshul ;
Aldred, Shelley F. ;
Collins, Patrick J. ;
Davis, CarrieA. ;
Doyle, Francis ;
Epstein, Charles B. ;
Frietze, Seth ;
Harrow, Jennifer ;
Kaul, Rajinder ;
Khatun, Jainab ;
Lajoie, Bryan R. ;
Landt, Stephen G. ;
Lee, Bum-Kyu ;
Pauli, Florencia ;
Rosenbloom, Kate R. ;
Sabo, Peter ;
Safi, Alexias ;
Sanyal, Amartya ;
Shoresh, Noam ;
Simon, Jeremy M. ;
Song, Lingyun ;
Trinklein, Nathan D. ;
Altshuler, Robert C. ;
Birney, Ewan ;
Brown, James B. ;
Cheng, Chao ;
Djebali, Sarah ;
Dong, Xianjun ;
Dunham, Ian ;
Ernst, Jason ;
Furey, Terrence S. ;
Gerstein, Mark ;
Giardine, Belinda ;
Greven, Melissa ;
Hardison, Ross C. ;
Harris, Robert S. ;
Herrero, Javier ;
Hoffman, Michael M. ;
Iyer, Sowmya ;
Kellis, Manolis ;
Khatun, Jainab ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Lassmann, Timo ;
Li, Qunhua ;
Lin, Xinying ;
Marinov, Georgi K. ;
Merkel, Angelika ;
Mortazavi, Ali .
NATURE, 2012, 489 (7414) :57-74
[8]   Large-scale imputation of epigenomic datasets for systematic annotation of diverse human tissues [J].
Ernst, Jason ;
Kellis, Manolis .
NATURE BIOTECHNOLOGY, 2015, 33 (04) :364-U74
[9]   Exploring Massive, Genome Scale Datasets with the GenometriCorr Package [J].
Favorov, Alexander ;
Mularoni, Loris ;
Cope, Leslie M. ;
Medvedeva, Yulia ;
Mironov, Andrey A. ;
Makeev, Vsevolod J. ;
Wheelan, Sarah J. .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (05)
[10]   regioneR: an R/Bioconductor package for the association analysis of genomic regions based on permutation tests [J].
Gel, Bernat ;
Diez-Villanueva, Anna ;
Serra, Eduard ;
Buschbeck, Marcus ;
Peinado, Miguel A. ;
Malinverni, Roberto .
BIOINFORMATICS, 2016, 32 (02) :289-291