Protein threading using residue co-variation and deep learning

被引:53
作者
Zhu, Jianwei [1 ,2 ,3 ]
Wang, Sheng [1 ]
Bu, Dongbo [2 ,3 ]
Xu, Jinbo [1 ]
机构
[1] Toyota Technol Inst, Chicago, IL 60637 USA
[2] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 626011, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金; 美国国家卫生研究院;
关键词
STRUCTURE PREDICTION; FOLD RECOGNITION; STRUCTURE INFORMATION; ALIGNMENT; CONTACTS;
D O I
10.1093/bioinformatics/bty278
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Template-based modeling, including homology modeling and protein threading, is a popular method for protein 3D structure prediction. However, alignment generation and template selection for protein sequences without close templates remain very challenging. Results: We present a new method called DeepThreader to improve protein threading, including both alignment generation and template selection, by making use of deep learning (DL) and residue co-variation information. Our method first employs DL to predict inter-residue distance distribution from residue co-variation and sequential information (e.g. sequence profile and predicted secondary structure), and then builds sequence-template alignment by integrating predicted distance information and sequential features through an ADMM algorithm. Experimental results suggest that predicted inter-residue distance is helpful to both protein alignment and template selection especially for protein sequences without very close templates, and that our method outperforms currently popular homology modeling method HHpred and threading method CNFpred by a large margin and greatly outperforms the latest contact-assisted protein threading method EigenTHREADER.
引用
收藏
页码:263 / 273
页数:11
相关论文
共 41 条
[11]   MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins [J].
Jones, David T. ;
Singh, Tanya ;
Kosciolek, Tomasz ;
Tetchner, Stuart .
BIOINFORMATICS, 2015, 31 (07) :999-1006
[12]   Progress in protein structure prediction [J].
Jones, DT .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1997, 7 (03) :377-387
[13]   A NEW APPROACH TO PROTEIN FOLD RECOGNITION [J].
JONES, DT ;
TAYLOR, WR ;
THORNTON, JM .
NATURE, 1992, 358 (6381) :86-89
[14]   Template-based protein structure modeling using the RaptorX web server [J].
Kaellberg, Morten ;
Wang, Haipeng ;
Wang, Sheng ;
Peng, Jian ;
Wang, Zhiyong ;
Lu, Hui ;
Xu, Jinbo .
NATURE PROTOCOLS, 2012, 7 (08) :1511-1522
[15]   Evolution of protein structures and functions [J].
Kinch, LN ;
Grishin, NV .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (03) :400-408
[16]  
Ma J, 2014, PLOS COMPUT BIOL, V10
[17]   Protein threading using context-specific alignment potential [J].
Ma, Jianzhu ;
Wang, Sheng ;
Zhao, Feng ;
Xu, Jinbo .
BIOINFORMATICS, 2013, 29 (13) :257-265
[18]   A conditional neural fields model for protein threading [J].
Ma, Jianzhu ;
Peng, Jian ;
Wang, Sheng ;
Xu, Jinbo .
BIOINFORMATICS, 2012, 28 (12) :I59-I66
[19]   Critical assessment of methods of protein structure prediction (CASP)Round XII [J].
Moult, John ;
Fidelis, Krzysztof ;
Kryshtafovych, Andriy ;
Schwede, Torsten ;
Tramontano, Anna .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2018, 86 :7-15
[20]   Protein structure determination using metagenome sequence data [J].
Ovchinnikov, Sergey ;
Park, Hahnbeom ;
Varghese, Neha ;
Huang, Po-Ssu ;
Pavlopoulos, Georgios A. ;
Kim, David E. ;
Kamisetty, Hetunandan ;
Kyrpides, Nikos C. ;
Baker, David .
SCIENCE, 2017, 355 (6322) :294-297