Assessing next-generation sequencing-based computational methods for predicting transcriptional regulators with query gene sets

被引:0
作者
Lu, Zeyu [1 ]
Xiao, Xue [2 ]
Zheng, Qiang [3 ]
Wang, Xinlei [3 ,4 ]
Xu, Lin [2 ,5 ]
机构
[1] Southern Methodist Univ, Moody Sch Grad & Adv Studies, Dept Stat & Data Sci, POB 750332,3225 Daniel Ave, Dallas, TX USA
[2] Univ Texas Southwestern Med Ctr, Quantitat Biomed Res Ctr, Peter ODonnell Jr Sch Publ Hlth, 5323 Harry Hines Blvd, Dallas, TX USA
[3] Univ Texas Arlington, Coll Sci, Div Data Sci, 501 S Nedderman Dr, Arlington, TX 76019 USA
[4] Univ Texas Arlington, Dept Math, 411 S Nedderman Dr, Arlington, TX 76019 USA
[5] Univ Texas Southwestern Med Ctr, Dept Pediat, Div Hematol Oncol, 5323 Harry Hines Blvd, Dallas, TX 75390 USA
基金
美国国家卫生研究院;
关键词
transcriptional regulator; benchmarking; query gene set; next-generation sequencing; prediction; CHIP-SEQ; DNA; ENHANCERS; GENOME; IDENTIFICATION; METHODOLOGIES; SPECIFICITIES; COLLECTION; MECHANISMS; EXPRESSION;
D O I
10.1093/bib/bbae366
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This article provides an in-depth review of computational methods for predicting transcriptional regulators (TRs) with query gene sets. Identification of TRs is of utmost importance in many biological applications, including but not limited to elucidating biological development mechanisms, identifying key disease genes, and predicting therapeutic targets. Various computational methods based on next-generation sequencing (NGS) data have been developed in the past decade, yet no systematic evaluation of NGS-based methods has been offered. We classified these methods into two categories based on shared characteristics, namely library-based and region-based methods. We further conducted benchmark studies to evaluate the accuracy, sensitivity, coverage, and usability of NGS-based methods with molecular experimental datasets. Results show that BART, ChIP-Atlas, and Lisa have relatively better performance. Besides, we point out the limitations of NGS-based methods and explore potential directions for further improvement.
引用
收藏
页数:18
相关论文
共 97 条
  • [1] Aibar S, 2017, NAT METHODS, V14, P1083, DOI [10.1038/nmeth.4463, 10.1038/NMETH.4463]
  • [2] Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries
    Aird, Daniel
    Ross, Michael G.
    Chen, Wei-Sheng
    Danielsson, Maxwell
    Fennell, Timothy
    Russ, Carsten
    Jaffe, David B.
    Nusbaum, Chad
    Gnirke, Andreas
    [J]. GENOME BIOLOGY, 2011, 12 (02)
  • [3] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [4] Relating genes to function: identifying enriched transcription factors using the ENCODE ChIP-Seq significance tool
    Auerbach, Raymond K.
    Chen, Bin
    Butte, Atul J.
    [J]. BIOINFORMATICS, 2013, 29 (15) : 1922 - 1924
  • [5] Gene regulatory network inference in the era of single-cell multi-omics
    Badia-i-Mompel, Pau
    Wessels, Lorna
    Mueller-Dott, Sophia
    Trimbour, Remi
    Flores, Ricardo Ramirez O.
    Argelaguet, Ricard
    Saez-Rodriguez, Julio
    [J]. NATURE REVIEWS GENETICS, 2023, 24 (11) : 739 - 754
  • [6] Bivariate Genomic Footprinting Detects Changes in Transcription Factor Activity
    Baek, Songjoon
    Goldstein, Ido
    Hager, Gordon L.
    [J]. CELL REPORTS, 2017, 19 (08): : 1710 - 1722
  • [7] Hi-C: A comprehensive technique to capture the conformation of genomes
    Belton, Jon-Matthew
    McCord, Rachel Patton
    Gibcus, Johan Harmen
    Naumova, Natalia
    Zhan, Ye
    Dekker, Job
    [J]. METHODS, 2012, 58 (03) : 268 - 276
  • [8] Summarizing and correcting the GC content bias in high-throughput sequencing
    Benjamini, Yuval
    Speed, Terence P.
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (10) : e72
  • [9] Targeting Transcription Factors in Cancer
    Bhagwat, Anand S.
    Vakoc, Christopher R.
    [J]. TRENDS IN CANCER, 2015, 1 (01): : 53 - 65
  • [10] Functional and Mechanistic Diversity of Distal Transcription Enhancers
    Bulger, Michael
    Groudine, Mark
    [J]. CELL, 2011, 144 (03) : 327 - 339