Navigating bottlenecks and trade-offs in genomic data analysis

被引:16
|
作者
Berger, Bonnie [1 ,2 ]
Yu, Yun William [3 ,4 ]
机构
[1] MIT, Dept Math, Cambridge, MA 02139 USA
[2] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[3] Univ Toronto Scarborough, Dept Comp & Math Sci, Toronto, ON, Canada
[4] Univ Toronto, Tri Campus Dept Math, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会; 美国国家卫生研究院;
关键词
BLAST-LIKE; ALIGNMENT; SEQUENCE; SEARCH; COMPRESSION; BIOBANK; FORMAT; FRAMEWORK; BIOLOGY; FASTQ;
D O I
10.1038/s41576-022-00551-z
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genome sequencing and analysis allow researchers to decode the functional information hidden in DNA sequences as well as to study cell to cell variation within a cell population. Traditionally, the primary bottleneck in genomic analysis pipelines has been the sequencing itself, which has been much more expensive than the computational analyses that follow. However, an important consequence of the continued drive to expand the throughput of sequencing platforms at lower cost is that often the analytical pipelines are struggling to keep up with the sheer amount of raw data produced. Computational cost and efficiency have thus become of ever increasing importance. Recent methodological advances, such as data sketching, accelerators and domain-specific libraries/languages, promise to address these modern computational challenges. However, despite being more efficient, these innovations come with a new set of trade-offs, both expected, such as accuracy versus memory and expense versus time, and more subtle, including the human expertise needed to use non-standard programming interfaces and set up complex infrastructure. In this Review, we discuss how to navigate these new methodological advances and their trade-offs.
引用
收藏
页码:235 / 250
页数:16
相关论文
共 50 条
  • [21] A biocultural approach to navigating conservation trade-offs through participatory methods
    Wengerd, Nicole
    Gilmore, Michael P.
    ECOLOGY AND SOCIETY, 2022, 27 (03):
  • [22] Testability Trade-Offs for BIST Data Paths
    Nicola Nicolici
    Bashir M. Al-Hashimi
    Journal of Electronic Testing, 2004, 20 : 169 - 179
  • [23] Disaggregated Data Centers: Challenges and Trade-offs
    Lin, Rui
    Cheng, Yuxin
    De Andrade, Marilet
    Wosinska, Lena
    Chen, Jiajia
    IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (02) : 20 - 26
  • [24] Data compression trade-offs in sensor networks
    Chen, M
    Fowler, ML
    MATHEMATICS OF DATA/IMAGE CODING, COMPRESSION, AND ENCRYPTION VII, WITH APPLICATIONS, 2004, 5561 : 96 - 107
  • [25] Testability trade-offs for BIST data paths
    Nicolici, N
    Al-Hashimi, BM
    JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2004, 20 (02): : 169 - 179
  • [26] PRESENTING THE AIMS DATA - SOME TRADE-OFFS
    HOLLAND, R
    RUNCIMAN, WB
    ANAESTHESIA AND INTENSIVE CARE, 1993, 21 (05) : 505 - 505
  • [27] Social and Technical Trade-Offs in Data Science
    Barocas, Solon
    Boyd, Danah
    Friedler, Sorelle
    Wallach, Hanna
    BIG DATA, 2017, 5 (02) : 71 - 72
  • [28] TRADE-OFFS - WHAT TRADE-OFFS - COMPETENCE AND COMPETITIVENESS IN MANUFACTURING STRATEGY
    CORBETT, C
    VANWASSENHOVE, L
    CALIFORNIA MANAGEMENT REVIEW, 1993, 35 (04) : 107 - 122
  • [29] Alternative Trade-Offs in Data Envelopment Analysis: An Application to Hydropower Plants
    Mirzaei, Mohsen
    Kordrostami, Sohrab
    Amirteimoori, Alireza
    Chegini, Mehrdad G.
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2016, 2016
  • [30] Logistics Trade-offs
    Pereira, Teresa
    Ferreira, Fernanda A.
    Martins, Alexandra
    INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM-2018), 2019, 2116