MaxHiC: A robust background correction model to identify biologically relevant chromatin interactions in Hi-C and capture Hi-C experiments

被引:8
|
作者
Alinejad-Rokny, Hamid [1 ,2 ,3 ,4 ]
Modegh, Rassa Ghavami [5 ]
Rabiee, Hamid R. R. [5 ]
Sarbandi, Ehsan Ramezani
Rezaie, Narges [6 ]
Tam, Kin Tung [1 ,2 ]
Forrest, Alistair R. R. [1 ,2 ]
机构
[1] Univ Western Australia, Harry Perkins Inst Med Res, QEII Med Ctr, Perth, Australia
[2] Univ Western Australia, Ctr Med Res, Perth, Australia
[3] UNSW Sydney, Grad Sch Biomed Engn, Bio Med Machine Learning Lab BML, Sydney, Australia
[4] Macquarie Univ, Alenabled Proc AIP Res Ctr, Hlth Data Analyt Program, Sydney, Australia
[5] Sharif Univ Technol, Dept Comp Engn, Bioinformat & Computat Biol Lab, Tehran, Iran
[6] Univ Calif Irvine, Ctr Complex Biol Syst, Irvine, CA USA
基金
澳大利亚研究理事会; 英国医学研究理事会;
关键词
EXPRESSION; REVEALS; ORGANIZATION; ANNOTATION; PRINCIPLES;
D O I
10.1371/journal.pcbi.1010241
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Author summaryMaxHiC is a robust machine learning based tool for identifying significant interacting regions from both Hi-C and capture Hi-C data. All the current existing models are designed for either Hi-C or capture Hi-C data, however we developed MaxHiC to be applicable for both Hi-C and capture Hi-C libraries (two different models have been used for Hi-C and capture Hi-C data). MaxHiC is also able to analyse very deep Hi-C libraries (e.g., Micro-C) without any computational issues. MaxHiC significantly outperforms current existing Hi-C significant interaction callers and even Hi-C loop callers in terms of enrichment of interactions between known regulatory regions as well as biologically relevant interactions. Hi-C is a genome-wide chromosome conformation capture technology that detects interactions between pairs of genomic regions and exploits higher order chromatin structures. Conceptually Hi-C data counts interaction frequencies between every position in the genome and every other position. Biologically functional interactions are expected to occur more frequently than transient background and artefactual interactions. To identify biologically relevant interactions, several background models that take biases such as distance, GC content and mappability into account have been proposed. Here we introduce MaxHiC, a background correction tool that deals with these complex biases and robustly identifies statistically significant interactions in both Hi-C and capture Hi-C experiments. MaxHiC uses a negative binomial distribution model and a maximum likelihood technique to correct biases in both Hi-C and capture Hi-C libraries. We systematically benchmark MaxHiC against major Hi-C background correction tools including Hi-C significant interaction callers (SIC) and Hi-C loop callers using published Hi-C, capture Hi-C, and Micro-C datasets. Our results demonstrate that 1) Interacting regions identified by MaxHiC have significantly greater levels of overlap with known regulatory features (e.g. active chromatin histone marks, CTCF binding sites, DNase sensitivity) and also disease-associated genome-wide association SNPs than those identified by currently existing models, 2) the pairs of interacting regions are more likely to be linked by eQTL pairs and 3) more likely to link known regulatory features including known functional enhancer-promoter pairs validated by CRISPRi than any of the existing methods. We also demonstrate that interactions between different genomic region types have distinct distance distributions only revealed by MaxHiC. MaxHiC is publicly available as a python package for the analysis of Hi-C, capture Hi-C and Micro-C data.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] MaxHiC: A robust background correction model to identify biologically relevant chromatin interactions in Hi-C and capture Hi-C experiments (vol 18, e1010241, 2022)
    Alinejad-Rokny, Hamid
    Modegh, Rassa Ghavami
    Rabiee, Hamid R.
    Sarbandi, Ehsan Ramezani
    Rezaie, Narges
    Tam, Kin Tung
    Forrest, Alistair R. R.
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (09)
  • [2] Erratum: MaxHiC: A robust background correction model to identify biologically relevant chromatin interactions in Hi-C and capture Hi-C experiments (PLoS Comput Biol (2022) 18:6 (e1010241) DOI: 10.1371/journal.pcbi.1010241)
    Alinejad-Rokny, Hamid
    Modegh, Rassa Ghavami
    Rabiee, Hamid R.
    Sarbandi, Ehsan Ramezani
    Rezaie, Narges
    Tam, Kin Tung
    Forrest, Alistair R.R.
    PLoS Computational Biology, 2022, 18 (9 September):
  • [3] y Computational Processing and Quality Control of Hi-C, Capture Hi-C and Capture-C Data
    Hansen, Peter
    Gargano, Michael
    Hecht, Jochen
    Ibn-Salem, Jonas
    Karlebach, Guy
    Roehr, Johannes T.
    Robinson, Peter N.
    GENES, 2019, 10 (07):
  • [4] covNorm: An R package for coverage based normalization of Hi-C and capture Hi-C data
    Kim, Kyukwang
    Jung, Inkyung
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 3149 - 3159
  • [5] Fine mapping chromatin contacts in capture Hi-C data
    Eijsbouts, Christiaan Q.
    Burren, Oliver S.
    Newcombe, Paul J.
    Wallace, Chris
    BMC GENOMICS, 2019, 20 (1)
  • [6] Fine mapping chromatin contacts in capture Hi-C data
    Christiaan Q Eijsbouts
    Oliver S Burren
    Paul J Newcombe
    Chris Wallace
    BMC Genomics, 20
  • [7] CHiCAGO: robust detection of DNA looping interactions in Capture Hi-C data
    Cairns, Jonathan
    Freire-Pritchett, Paula
    Wingett, Steven W.
    Varnai, Csilla
    Dimond, Andrew
    Plagnol, Vincent
    Zerbino, Daniel
    Schoenfelder, Stefan
    Javierre, Biola-Maria
    Osborne, Cameron
    Fraser, Peter
    Spivakov, Mikhail
    GENOME BIOLOGY, 2016, 17
  • [8] CHiCAGO: robust detection of DNA looping interactions in Capture Hi-C data
    Jonathan Cairns
    Paula Freire-Pritchett
    Steven W. Wingett
    Csilla Várnai
    Andrew Dimond
    Vincent Plagnol
    Daniel Zerbino
    Stefan Schoenfelder
    Biola-Maria Javierre
    Cameron Osborne
    Peter Fraser
    Mikhail Spivakov
    Genome Biology, 17
  • [9] ChiCMaxima: a robust and simple pipeline for detection and visualization of chromatin looping in Capture Hi-C
    Ben Zouari, Yousra
    Molitor, Anne M.
    Sikorska, Natalia
    Pancaldi, Vera
    Sexton, Tom
    GENOME BIOLOGY, 2019, 20 (1)
  • [10] ChiCMaxima: a robust and simple pipeline for detection and visualization of chromatin looping in Capture Hi-C
    Yousra Ben Zouari
    Anne M. Molitor
    Natalia Sikorska
    Vera Pancaldi
    Tom Sexton
    Genome Biology, 20