共 50 条
mvPPT: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants
被引:4
作者:
Tong, Shi-Yuan
[1
]
Fan, Ke
[1
]
Zhou, Zai-Wei
[2
]
Liu, Lin-Yun
[1
]
Zhang, Shu-Qing
[1
]
Fu, Yinghui
[1
]
Wang, Guang-Zhong
[3
]
Zhu, Ying
[4
]
Yu, Yong-Chun
[1
]
机构:
[1] Fudan Univ, Jingan Dist Cent Hosp Shanghai, Inst Brain Sci, MOE Frontiers Ctr Brain Sci,State Key Lab Med Neur, Shanghai 200032, Peoples R China
[2] Shanghai Xunyin Biotechnol Co Ltd, Shanghai 201802, Peoples R China
[3] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[4] Fudan Univ, MOE Frontiers Ctr Brain Sci, Inst Brain Sci, State Key Lab Med Neurobiol,Huashan Hosp, Shanghai 200032, Peoples R China
基金:
国家重点研发计划;
中国国家自然科学基金;
关键词:
Machine learning;
Missense variant;
Genomics;
Computational biology;
Pathogenicity prediction;
FUNCTIONAL IMPACT;
DATABASE;
MUTATIONS;
DIAGNOSIS;
ELEMENTS;
D O I:
10.1016/j.gpb.2022.07.005
中图分类号:
Q3 [遗传学];
学科分类号:
071007 ;
090102 ;
摘要:
Next-generation sequencing technologies both boost the discovery of variants in the human genome and exacerbate the challenges of pathogenic variant identification. In this study, we developed Pathogenicity Prediction Tool for missense variants (mvPPT), a highly sensitive and accurate missense variant classifier based on gradient boosting. mvPPT adopts high-confidence training sets with a wide spectrum of variant profiles, and extracts three categories of features, including scores from existing prediction tools, frequencies (allele frequencies, amino acid frequencies, and genotype frequencies), and genomic context. Compared with established predictors, mvPPT achieves superior performance in all test sets, regardless of data source. In addition, our study also provides guidance for training set and feature selection strategies, as well as reveals highly relevant features, which may further provide biological insights into variant pathogenicity. mvPPT is freely available at http://www.mvppt.club/.
引用
收藏
页码:414 / 426
页数:13
相关论文