High-throughput deep learning variant effect prediction with Sequence UNET

被引:13
|
作者
Dunham, Alistair S. [1 ,2 ]
Beltrao, Pedro [1 ,3 ]
AlQuraishi, Mohammed [4 ]
机构
[1] European Bioinformat Inst EMBL EBI, European Mol Biol Lab, Wellcome Genome Campus, Hinxton CB10 1SD, Cambs, England
[2] Wellcome Sanger Inst, Wellcome Genome Campus, Hinxton CB10 1RQ, Cambs, England
[3] Swiss Fed Inst Technol, Inst Mol Syst Biol, Dept Biol, CH-8093 Zurich, Switzerland
[4] Columbia Univ, Dept Syst Biol, New York, NY 10027 USA
基金
英国惠康基金;
关键词
Variant effect prediction; Deep learning; Mutation; PSSM; Pathogenicity; Machine learning; SERVER;
D O I
10.1186/s13059-023-02948-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Understanding coding mutations is important for many applications in biology and medicine but the vast mutation space makes comprehensive experimental characterisation impossible. Current predictors are often computationally intensive and difficult to scale, including recent deep learning models. We introduce Sequence UNET, a highly scalable deep learning architecture that classifies and predicts variant frequency from sequence alone using multi-scale representations from a fully convolutional compression/expansion architecture. It achieves comparable pathogenicity prediction to recent methods. We demonstrate scalability by analysing 8.3B variants in 904,134 proteins detected through large-scale proteomics. Sequence UNET runs on modest hardware with a simple Python package.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] High-throughput deep learning variant effect prediction with Sequence UNET
    Alistair S. Dunham
    Pedro Beltrao
    Mohammed AlQuraishi
    Genome Biology, 24
  • [2] Deep learning enables high-quality and high-throughput prediction of enzyme commission numbers
    Ryu, Jae Yong
    Kim, Hyun Uk
    Lee, Sang Yup
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (28) : 13996 - 14001
  • [3] TOWARDS DEEP LEARNING APPROACHES FOR QUANTITATIVE ANALYSIS OF HIGH-THROUGHPUT DLD
    Gioe, Eric A.
    Chen, Xiaolin
    Kim, Jong-Hoon
    PROCEEDINGS OF THE ASME 2020 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2020, VOL 13, 2020,
  • [4] High-Throughput Deep Learning Detection of Mitral Regurgitation
    Vrudhula, Amey
    Duffy, Grant
    Vukadinovic, Milos
    Liang, David
    Cheng, Susan
    Ouyang, David
    CIRCULATION, 2024, 150 (12) : 923 - 933
  • [5] Machine Learning and Deep Learning for Throughput Prediction
    Lee, Dongwon
    Lee, Joohyun
    12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2021), 2021, : 452 - 454
  • [6] Deep learning: as the new frontier in high-throughput plant phenotyping
    Arya, Sunny
    Sandhu, Karansher Singh
    Singh, Jagmohan
    Kumar, Sudhir
    EUPHYTICA, 2022, 218 (04)
  • [7] Deep learning: as the new frontier in high-throughput plant phenotyping
    Sunny Arya
    Karansher Singh Sandhu
    Jagmohan Singh
    Sudhir kumar
    Euphytica, 2022, 218
  • [8] High-throughput characterization methods for Ni-based superalloys and phase prediction via deep learning
    Qin, Zijun
    Li, Weifu
    Wang, Zi
    Pan, Junlong
    Wang, Zexin
    Li, Zihang
    Wang, Guowei
    Pan, Jun
    Liu, Feng
    Huang, Lan
    Tan, Liming
    Zhang, Lina
    Han, Hua
    Chen, Hong
    Jiang, Liang
    JOURNAL OF MATERIALS RESEARCH AND TECHNOLOGY-JMR&T, 2022, 21 : 1984 - 1997
  • [9] Continuous high-throughput characterization of mechanical properties via deep learning
    Zhu, Gengxuan
    Hu, Xueyan
    Bao, Ronghao
    Chen, Weiqiu
    INTERNATIONAL JOURNAL OF MECHANICAL SCIENCES, 2025, 291
  • [10] Deep Learning Image Analysis of High-Throughput Toxicology Assay Images
    Tandon, Arpit
    Howard, Brian
    Ramaiahgari, Sreenivasa
    Maharana, Adyasha
    Ferguson, Stephen
    Shah, Ruchir
    Merrick, B. Alex
    SLAS DISCOVERY, 2022, 27 (01) : 29 - 38