Solubility-Weighted Index: fast and accurate prediction of protein solubility

被引:51
作者
Bhandari, Bikash K. [1 ]
Gardner, Paul P. [1 ,2 ]
Lim, Chun Shen [1 ]
机构
[1] Univ Otago, Sch Biomed Sci, Dept Biochem, Dunedin, New Zealand
[2] Univ Canterbury, Biomol Interact Ctr, Christchurch, New Zealand
关键词
PRODUCTION PLATFORM; FLEXIBILITY; EXPRESSION; !text type='PYTHON']PYTHON[!/text; TOOL; CONSTRAINTS; EVOLUTION; PEPTIDE; SURFACE;
D O I
10.1093/bioinformatics/btaa578
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Recombinant protein production is a widely used technique in the biotechnology and biomedical industries, yet only a quarter of target proteins are soluble and can therefore be purified. Results: We have discovered that global structural flexibility, which can be modeled by normalized B-factors, accurately predicts the solubility of 12 216 recombinant proteins expressed in Escherichia coll. We have optimized these B-factors, and derived a new set of values for solubility scoring that further improves prediction accuracy. We call this new predictor the 'Solubility-Weighted Index' (SWI). Importantly, SWI outperforms many existing protein solubility prediction tools. Furthermore, we have developed 'SoDoPE' (Soluble Domain for Protein Expression), a web interface that allows users to choose a protein region of interest for predicting and maximizing both protein expression and solubility.
引用
收藏
页码:4691 / 4698
页数:8
相关论文
共 76 条
[61]   Amino acid contribution to protein solubility: Asp, Glu, and Ser contribute more favorably than the other hydrophilic amino acids in RNase Sa [J].
Trevino, Saul R. ;
Scholtz, J. Martin ;
Pace, C. Nick .
JOURNAL OF MOLECULAR BIOLOGY, 2007, 366 (02) :449-460
[62]   Practical considerations in refolding proteins from inclusion bodies [J].
Tsumoto, K ;
Ejima, D ;
Kumagai, I ;
Arakawa, T .
PROTEIN EXPRESSION AND PURIFICATION, 2003, 28 (01) :1-8
[63]   The NumPy Array: A Structure for Efficient Numerical Computation [J].
van der Walt, Stefan ;
Colbert, S. Chris ;
Varoquaux, Gael .
COMPUTING IN SCIENCE & ENGINEERING, 2011, 13 (02) :22-30
[64]   ACCURACY OF PROTEIN FLEXIBILITY PREDICTIONS [J].
VIHINEN, M ;
TORKKILA, E ;
RIIKONEN, P .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1994, 19 (02) :141-149
[65]   RELATIONSHIP OF PROTEIN FLEXIBILITY TO THERMOSTABILITY [J].
VIHINEN, M .
PROTEIN ENGINEERING, 1987, 1 (06) :477-480
[66]   Genetic screens and directed evolution for protein solubility [J].
Waldo, GS .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2003, 7 (01) :33-38
[67]   Potential aggregation prone regions in biotherapeutics A survey of commercial monoclonal antibodies [J].
Wang, Xiaoling ;
Das, Tapan K. ;
Singh, Satish K. ;
Kumar, Sandeep .
MABS, 2009, 1 (03) :254-267
[68]   Lysine and Arginine Content of Proteins: Computational Analysis Suggests a New Tool for Solubility Design [J].
Warwicker, Jim ;
Charonis, Spyros ;
Curtis, Robin A. .
MOLECULAR PHARMACEUTICS, 2014, 11 (01) :294-303
[69]  
WILKINSON DL, 1991, BIO-TECHNOL, V9, P443, DOI 10.1038/nbt0591-443
[70]   A new coronavirus associated with human respiratory disease in China [J].
Wu, Fan ;
Zhao, Su ;
Yu, Bin ;
Chen, Yan-Mei ;
Wang, Wen ;
Song, Zhi-Gang ;
Hu, Yi ;
Tao, Zhao-Wu ;
Tian, Jun-Hua ;
Pei, Yuan-Yuan ;
Yuan, Ming-Li ;
Zhang, Yu-Ling ;
Dai, Fa-Hui ;
Liu, Yi ;
Wang, Qi-Min ;
Zheng, Jiao-Jiao ;
Xu, Lin ;
Holmes, Edward C. ;
Zhang, Yong-Zhen .
NATURE, 2020, 579 (7798) :265-+