Protease substrate site predictors derived from machine learning on multilevel substrate phage display data

被引:14
作者
Chen, Ching-Tai [1 ,2 ]
Yang, Ei-Wen [1 ]
Hsu, Hung-Ju [3 ,4 ]
Sun, Yi-Kun [3 ]
Hsu, Wen-Lian [1 ]
Yang, An-Suei [3 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei 115, Taiwan
[2] Natl Tsing Hua Univ, Inst Bioinformat, Hsinchu 300, Taiwan
[3] Acad Sinica, Genom Res Ctr, Taipei 115, Taiwan
[4] Natl Def Med Univ, Grad Inst Life Sci, Taipei 114, Taiwan
关键词
D O I
10.1093/bioinformatics/btn538
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Regulatory proteases modulate proteomic dynamics with a spectrum of specificities against substrate proteins. Predictions of the substrate sites in a proteome for the proteases would facilitate understanding the biological functions of the proteases. High-throughput experiments could generate suitable datasets for machine learning to grasp complex relationships between the substrate sequences and the enzymatic specificities. But the capability in predicting protease substrate sites by integrating the machine learning algorithms with the experimental methodology has yet to be demonstrated. Results: Factor Xa, a key regulatory protease in the blood coagulation system, was used as model system, for which effective substrate site predictors were developed and benchmarked. The predictors were derived from bootstrap aggregation (machine learning) algorithms trained with data obtained from multilevel substrate phage display experiments. The experimental sampling and computational learning on substrate specificities can be generalized to proteases for which the active forms are available for the in vitro experiments.
引用
收藏
页码:2691 / 2697
页数:7
相关论文
共 38 条
[11]   CaSPredictor:: a new computer-based tool for caspase substrate prediction [J].
Garay-Malpartida, HM ;
Occhiucci, JM ;
Alves, J ;
Belizário, JE .
BIOINFORMATICS, 2005, 21 :I169-I176
[12]   Profiling serine protease substrate specificity with solution phase fluorogenic peptide microarrays [J].
Gosalia, DN ;
Salisbury, CM ;
Maly, DJ ;
Ellman, JA ;
Diamond, SL .
PROTEOMICS, 2005, 5 (05) :1292-1298
[13]   The discovery of the factor Xa inhibitor otamixaban: From lead identification to clinical development [J].
Guertin, Kevin R. ;
Choi, Yong-Mi .
CURRENT MEDICINAL CHEMISTRY, 2007, 14 (23) :2471-2481
[14]   Rapid and general profiling of protease specificity by using combinatorial fluorogenic substrate libraries [J].
Harris, JL ;
Backes, BJ ;
Leonetti, F ;
Mahrus, S ;
Ellman, JA ;
Craik, CS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (14) :7754-7759
[15]   Serine protease mechanism and specificity [J].
Hedstrom, L .
CHEMICAL REVIEWS, 2002, 102 (12) :4501-4523
[16]   BIOCHEMISTRY OF FACTOR-X [J].
HERTZBERG, M .
BLOOD REVIEWS, 1994, 8 (01) :56-62
[17]   Factor Xa active site substrate specificity with substrate phage display and computational molecular modeling [J].
Hsu, Hung-Ju ;
Tsai, Keng-Chang ;
Sun, Yi-Kun ;
Chang, Hung-Ju ;
Huang, Yi-Jen ;
Yu, Hui-Ming ;
Lin, Chun-Hung ;
Mao, Shi-Shan ;
Yang, An-Suei .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2008, 283 (18) :12343-12353
[18]   A critical review of the methods for cleavage of fusion proteins with thrombin and factor Xa [J].
Jenny, RJ ;
Mann, KG ;
Lundblad, RL .
PROTEIN EXPRESSION AND PURIFICATION, 2003, 31 (01) :1-11
[19]   Overview of cell death signaling pathways [J].
Jin, ZY ;
El-Deiry, WS .
CANCER BIOLOGY & THERAPY, 2005, 4 (02) :139-163
[20]   AAindex: Amino acid index database [J].
Kawashima, S ;
Kanehisa, M .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :374-374