3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints

被引:19
作者
Won, Dhong-Gun [1 ]
Kim, Dong-Wook [1 ]
Woo, Junwoo [1 ]
Lee, Kyoungyeul [1 ]
机构
[1] 3Bill, Res & Dev Ctr, Seoul 06193, South Korea
关键词
CONGENITAL HEART-DISEASE; SEQUENCE VARIANTS; MUTATIONS; DATABASE; IDENTIFICATION; GUIDELINES; IMPACT;
D O I
10.1093/bioinformatics/btab529
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Improvements in next-generation sequencing have enabled genome-based diagnosis for patients with genetic diseases. However, accurate interpretation of human variants requires knowledge from a number of clinical cases. In addition, manual analysis of each variant detected in a patient's genome requires enormous time and effort. To reduce the cost of diagnosis, various computational tools have been developed to predict the pathogenicity of human variants, but the shortage and bias of available clinical data can lead to overfitting of algorithms. Results: We developed a pathogenicity predictor, 3Cnet, that uses recurrent neural networks to analyze the amino acid context of human variants. As 3Cnet is trained on simulated variants reflecting evolutionary conservation and clinical data, it can find disease-causing variants in patient genomes with 2.2 times greater sensitivity than currently available tools, more effectively discovering pathogenic variants and thereby improving diagnosis rates.
引用
收藏
页码:4626 / 4634
页数:9
相关论文
共 33 条
[31]   Prediction of unknown compressive stress/damage in concrete structure using 3D convolutional neural network-based deep learning of raw electromechanical admittance signals [J].
Ai, Demi ;
Mo, Fang .
JOURNAL OF INTELLIGENT MATERIAL SYSTEMS AND STRUCTURES, 2025, 36 (01) :29-52
[32]   Development of a quantitative prediction algorithm for human cord blood-derived CD34+ hematopoietic stem-progenitor cells using parametric and non-parametric machine learning models [J].
Leung, Chi-Kwan ;
Zhu, Pengcheng ;
Loke, Ian ;
Tang, Kin Fai ;
Leung, Ho-Chuen ;
Yeung, Chin-Fung .
SCIENTIFIC REPORTS, 2024, 14 (01)
[33]   Recognition of 3′-end L1, Alu, processed pseudogenes, and mRNA stem-loops in the human genome using sequence-based and structure-based machine-learning models [J].
Shein, Alexander ;
Zaikin, Anton ;
Poptsova, Maria .
SCIENTIFIC REPORTS, 2019, 9 (1)