Preprocessing peptide sequences for multivariate sequence-property analysis

被引:52
作者
Andersson, PM [1 ]
Sjostrom, M
Lundstedt, T
机构
[1] Umea Univ, Dept Organ Chem, Chemometr Res Grp, S-90187 Umea, Sweden
[2] Pharmacia & Upjohn Inc, SPOC, S-75182 Uppsala, Sweden
关键词
peptide sequences; peptide libraries; z-scales; auto cross covariances; QSAR; PLS; OSC;
D O I
10.1016/S0169-7439(98)00062-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing number of peptide sequences with different lengths, available from synthesised peptide libraries and sequenced proteins are potentially valuable for evaluating structure-activity relationships. However, in order to apply multivariate classification or Quantitative Structure-Activity Relationship (QSAR) analyses on such sequences, it is necessary to have a preprocessing method that translates them into a uniform set of variables. By describing each amino acid by principal properties (z-scales) and then calculating auto cross covariances (ACCs) for each sequence, a new uniform matrix is generated, i.e., each sequence is described by a vector with equal length. The ACC approach has been used before for classification of peptides, but hen, a QSAR analysis based on 20 peptide sequences of different lengths is presented. The results show that it is possible to obtain a predictive multivariate QSAR model (R-2 Y-cum = 86.2%, Q(cum)(2) = 60.3%) based on the ACC preprocessing method, together with Orthogonal Signal Correction (OSC) and Partial Least Squares (PLS). The model generated was further validated by permutation tests and found to be valid. The new variables generated by ACCs can also be interpreted, i.e., used to identify important features in the original sequences. (C) 1998 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:41 / 50
页数:10
相关论文
共 16 条
  • [1] [Anonymous], CHEMOMETRIC METHODS
  • [3] ERIKSSON L, 1997, IN PRESS
  • [4] AMIDE BOND REPLACEMENTS INCORPORATED INTO CCK-B SELECTIVE DIPEPTOIDS
    FINCHAM, CI
    HIGGINBOTTOM, M
    HILL, DR
    HORWELL, DC
    OTOOLE, JC
    RATCLIFFE, GS
    REES, DC
    ROBERTS, E
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1992, 35 (08) : 1472 - 1484
  • [5] APPLICATIONS OF COMBINATORIAL TECHNOLOGIES TO DRUG DISCOVERY .1. BACKGROUND AND PEPTIDE COMBINATORIAL LIBRARIES
    GALLOP, MA
    BARRETT, RW
    DOWER, WJ
    FODOR, SPA
    GORDON, EM
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1994, 37 (09) : 1233 - 1251
  • [6] PARTIAL LEAST-SQUARES REGRESSION - A TUTORIAL
    GELADI, P
    KOWALSKI, BR
    [J]. ANALYTICA CHIMICA ACTA, 1986, 185 : 1 - 17
  • [7] PEPTIDE QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS, A MULTIVARIATE APPROACH
    HELLBERG, S
    SJOSTROM, M
    SKAGERBERG, B
    WOLD, S
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1987, 30 (07) : 1126 - 1135
  • [8] HELLBERG S, 1991, INT J PEPT PROT RES, V37, P414
  • [9] SANDBERG M, 1997, IN PRESS J MED CHEM
  • [10] POLYPEPTIDE SEQUENCE PROPERTY RELATIONSHIPS IN ESCHERICHIA-COLI BASED ON AUTO CROSS COVARIANCES
    SJOSTROM, M
    RANNAR, S
    WIESLANDER, A
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1995, 29 (02) : 295 - 305