Integrating co-evolutionary signals and other properties of residue pairs to distinguish biological interfaces from crystal contacts

被引:9
|
作者
Hu, Jian [1 ,2 ]
Liu, Hui-Fang [1 ]
Sun, Jun [1 ]
Wang, Jia [1 ]
Liu, Rong [1 ]
机构
[1] Huazhong Agr Univ, Coll Informat, Hubei Key Lab Agr Bioinformat, Wuhan 430070, Hubei, Peoples R China
[2] South Cent Univ Nationalities, Coll Biomed Engn, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
biological interface; crystal contact; co-evolution; residue pair; machine learning; PROTEIN QUATERNARY STRUCTURE; DIRECT-COUPLING ANALYSIS; PACKING CONTACTS; RECOGNITION; COEVOLUTION; IDENTIFICATION; CLASSIFICATION; CONSERVATION; INFERENCE; VECTORS;
D O I
10.1002/pro.3448
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
It remains challenging to accurately discriminate between biological and crystal interfaces. Most existing analyses and algorithms focused on the features derived from a single side of the interface. However, less attention has been paid to the properties of residue pairs across protein interfaces. To address this problem, we defined a novel co-evolutionary feature for homodimers through integrating direct coupling analysis and image processing techniques. The residue pairs across biological homodimeric interfaces were significantly enriched in co-evolving residues compared to those across crystal contacts, resulting in a promising classification accuracy with area under the curves (AUCs) of >0.85. Considering the availability of co-evolutionary feature, we also designed other residue pair based features that were useful for both homodimers and heterodimers. The most informative residue pairs were identified to reflect the interaction preferences across protein interfaces. Regarding the other extant properties, we designed the new descriptors at the interface residue level as well as at the pairwise contact level. Extensive validation showed that these single properties can be used to identify biological interfaces with AUCs ranging from 0.60 to 0.88. By integrating co-evolutionary feature with other residue pair based properties, our final prediction model output excellent performance with AUCs of >0.91 on different datasets. Compared to existing methods, our algorithm not only yielded better or comparable results but also provided complementary information. An easy-to-use web server is freely accessible at .
引用
收藏
页码:1723 / 1735
页数:13
相关论文
empty
未找到相关数据