共 50 条
A quantitative structure-activity relationship (QSAR) study on glycan array data to determine the specificities of glycan-binding proteins
被引:19
|作者:
Xuan, Pengfei
[1
]
Zhang, Yuehua
[1
]
Tzeng, Tzuen-rong Jeremy
[2
]
Wan, Xiu-Feng
[3
]
Luo, Feng
[1
]
机构:
[1] Clemson Univ, Sch Comp, Clemson, SC 29634 USA
[2] Clemson Univ, Dept Biol Sci, Clemson, SC 29634 USA
[3] Mississippi State Univ, Coll Vet Med, Dept Basic Sci, Mississippi State, MS 39762 USA
基金:
美国国家科学基金会;
关键词:
glycan array;
PLS;
QSAR;
HEPARAN-SULFATE PROTEOGLYCANS;
CANCER;
D O I:
10.1093/glycob/cwr163
中图分类号:
Q5 [生物化学];
Q7 [分子生物学];
学科分类号:
071010 ;
081704 ;
摘要:
Advances in glycan array technology have provided opportunities to automatically and systematically characterize the binding specificities of glycan-binding proteins. However, there is still a lack of robust methods for such analyses. In this study, we developed a novel quantitative structure-activity relationship (QSAR) method to analyze glycan array data. We first decomposed glycan chains into mono-, di-, tri- or tetrasaccharide subtrees. The bond information was incorporated into subtrees to help distinguish glycan chain structures. Then, we performed partial least-squares (PLS) regression on glycan array data using the subtrees as features. The application of QSAR to the glycan array data of different glycan-binding proteins demonstrated that PLS regression using subtree features can obtain higher R-2 values and a higher percentage of variance explained in glycan array intensities. Based on the regression coefficients of PLS, we were able to effectively identify subtrees that indicate the binding specificities of a glycan-binding protein. Our approach will facilitate the glycan-binding specificity analysis using the glycan array. A user-friendly web tool of the QSAR method is available at http://bci.clemson.edu/tools/glycan_array.
引用
收藏
页码:552 / 560
页数:9
相关论文