共 33 条
3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints
被引:19
作者:
Won, Dhong-Gun
[1
]
Kim, Dong-Wook
[1
]
Woo, Junwoo
[1
]
Lee, Kyoungyeul
[1
]
机构:
[1] 3Bill, Res & Dev Ctr, Seoul 06193, South Korea
关键词:
CONGENITAL HEART-DISEASE;
SEQUENCE VARIANTS;
MUTATIONS;
DATABASE;
IDENTIFICATION;
GUIDELINES;
IMPACT;
D O I:
10.1093/bioinformatics/btab529
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Motivation: Improvements in next-generation sequencing have enabled genome-based diagnosis for patients with genetic diseases. However, accurate interpretation of human variants requires knowledge from a number of clinical cases. In addition, manual analysis of each variant detected in a patient's genome requires enormous time and effort. To reduce the cost of diagnosis, various computational tools have been developed to predict the pathogenicity of human variants, but the shortage and bias of available clinical data can lead to overfitting of algorithms. Results: We developed a pathogenicity predictor, 3Cnet, that uses recurrent neural networks to analyze the amino acid context of human variants. As 3Cnet is trained on simulated variants reflecting evolutionary conservation and clinical data, it can find disease-causing variants in patient genomes with 2.2 times greater sensitivity than currently available tools, more effectively discovering pathogenic variants and thereby improving diagnosis rates.
引用
收藏
页码:4626 / 4634
页数:9
相关论文
共 33 条