Protein fold recognition using neural networks and support vector machines

被引:0
作者
Jiang, N [1 ]
Wu, WXY [1 ]
Mitchell, I [1 ]
机构
[1] Middlesex Univ, Sch Comp Sci, London NW4 4BA, England
来源
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2005, PROCEEDINGS | 2005年 / 3578卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new fold recognition model with mixed enviromnent-specific substitution mapping (called MESSM) is proposed with three key features: 1) a structurally-derived substitution score is generated using neural networks; 2) a mixed environment-specific substitution mapping is developed by combing the structural-derived substitution score with sequence profile from well-developed sequence substitution matrices; 3) a support vector machine is employed to measure the significance of the sequence-structure alignment. Tested on two benchmark problems, the MESSM model shows comparable performance to those more computational intensive, energy potential based fold recognition models. The results also demonstrate that the new fold recognition model with mixed substitution mapping has a better performance than the one with either structure or sequence profile only. The MESSM model presents a new way to develop an efficient tool for protein fold recognition.
引用
收藏
页码:462 / 469
页数:8
相关论文
共 16 条
[1]  
[Anonymous], 2005, SUPPORT VECTOR MACHI
[2]  
BALDI P, 2001, BIOINFORMAICS MACHIN
[3]  
Bologna G, 2002, ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, P2492
[4]   STATISTICS OF SEQUENCE-STRUCTURE THREADING [J].
BRYANT, SH ;
ALTSCHUL, SF .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1995, 5 (02) :236-244
[5]   Multi-class protein fold recognition using support vector machines and neural networks [J].
Ding, CHQ ;
Dubchak, I .
BIOINFORMATICS, 2001, 17 (04) :349-358
[6]  
Fischer D, 1996, PACIFIC SYMPOSIUM ON BIOCOMPUTING '97, P8
[7]   AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) :10915-10919
[8]  
JIANG N, 2004, P 2004 ACM S APPL CO, P209
[9]   GenTHREADER: An efficient and reliable protein fold recognition method for genomic sequences [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 287 (04) :797-815
[10]   Enhanced genome annotation using structural profiles in the program 3D-PSSM [J].
Kelley, LA ;
MacCallum, RM ;
Sternberg, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 299 (02) :499-520