LEARNING AND ALIGNMENT METHODS APPLIED TO PROTEIN-STRUCTURE PREDICTION

被引:3
作者
GRACY, J [1 ]
CHICHE, L [1 ]
SALLANTIN, J [1 ]
机构
[1] CTR PHARMACOL ENDOCRINOL, CNRS, INSERM, F-34094 MONTPELLIER, FRANCE
关键词
MACHINE LEARNING; SECONDARY STRUCTURE PREDICTION; COMPATIBILITY SEQUENCE-STRUCTURE; SEQUENCE ALIGNMENT;
D O I
10.1016/0300-9084(93)90169-S
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Learning techniques are able to extract structural knowledge specific to a selected set of proteins. We describe two algorithms that optimize scores expressing the propensity of a polypeptide sequence to adopt a local fold. The first algorithm generates secondary structure prediction rules based on a dictionary of geometrical patterns frequently found in the learning database. The second algorithm leads to scores that indicate the fit between an amino acid and a given local structural environment. Dynamic programming is then used to align structural information profiles by modifying the local mutation cost with the above learned functions. The main features of the system are exemplified on the structural prediction of the N-terminal domain of the CD4 antigen. Then the usefulness of additional 3-D information in the alignment is benchmarked on eight pairs of weakly homologous proteins.
引用
收藏
页码:353 / 361
页数:9
相关论文
共 17 条
[1]  
[Anonymous], 1987, LEARNING INTERNAL RE
[2]   PATTERNS OF DIVERGENCE IN HOMOLOGOUS PROTEINS AS INDICATORS OF SECONDARY AND TERTIARY STRUCTURE - A PREDICTION OF THE STRUCTURE OF THE CATALYTIC DOMAIN OF PROTEIN-KINASES [J].
BENNER, SA ;
GERLOFF, D .
ADVANCES IN ENZYME REGULATION, 1991, 31 :121-181
[3]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[4]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[5]  
Dayhoff MO, 1978, ATL PROTEIN SEQ STRU, V5, P345
[6]  
DIDAY E, 1980, OPTIMISATION ELASSIF
[7]   A SEARCH FOR THE MOST STABLE FOLDS OF PROTEIN CHAINS [J].
FINKELSTEIN, AV ;
REVA, BA .
NATURE, 1991, 351 (6326) :497-499
[8]   ANALYSIS OF ACCURACY AND IMPLICATIONS OF SIMPLE METHODS FOR PREDICTING SECONDARY STRUCTURE OF GLOBULAR PROTEINS [J].
GARNIER, J ;
OSGUTHORPE, DJ ;
ROBSON, B .
JOURNAL OF MOLECULAR BIOLOGY, 1978, 120 (01) :97-120
[9]   PROTEIN SECONDARY STRUCTURE PREDICTION WITH A NEURAL NETWORK [J].
HOLLEY, LH ;
KARPLUS, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1989, 86 (01) :152-156
[10]   DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES [J].
KABSCH, W ;
SANDER, C .
BIOPOLYMERS, 1983, 22 (12) :2577-2637