Prediction of delayed retention of antibodies in hydrophobic interaction chromatography from sequence using machine learning

被引:43
作者
Jain, Tushar [1 ]
Boland, Todd [1 ]
Lilov, Asparouh [2 ]
Burnina, Irina [2 ]
Brown, Michael [2 ]
Xu, Yingda [2 ]
Vasquez, Maximiliano [1 ]
机构
[1] Adimab, Computat Biol, Palo Alto, CA 94303 USA
[2] Adimab, Prot Analyt, Lebanon, NH USA
关键词
RELATIVE SOLVENT ACCESSIBILITY; REVERSIBLE SELF-ASSOCIATION; SECONDARY STRUCTURE; PROTEIN AGGREGATION; VISCOSITY BEHAVIOR; RANDOM FORESTS; PHAGE DISPLAY; SURFACE-AREA; DEVELOPABILITY; LIBRARIES;
D O I
10.1093/bioinformatics/btx519
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The hydrophobicity of a monoclonal antibody is an important biophysical property relevant for its developability into a therapeutic. In addition to characterizing heterogeneity, Hydrophobic Interaction Chromatography (HIC) is an assay that is often used to quantify the hydrophobicity of an antibody to assess downstream risks. Earlier studies have shown that retention times in this assay can be correlated to amino-acid or atomic propensities weighted by the surface areas obtained from protein 3-dimensional structures. The goal of this study is to develop models to enable prediction of delayed HIC retention times directly from sequence. Results: We utilize the randomforest machine learning approach to estimate the surface exposure of amino-acid side-chains in the variable region directly from the antibody sequence. We obtain mean-absolute errors of 4.6% for the prediction of surface exposure. Using experimental HIC data along with the estimated surface areas, we derive an amino-acid propensity scale that enables prediction of antibodies likely to have delayed retention times in the assay. We achieve a cross-validation Area Under Curve of 0.85 for the Receiver Operating Characteristic curve of our model. The low computational expense and high accuracy of this approach enables real-time assessment of hydrophobic character to enable prioritization of antibodies during the discovery process and rational engineering to reduce hydrophobic liabilities.
引用
收藏
页码:3758 / 3766
页数:9
相关论文
共 95 条
[1]   Combining prediction of secondary structure and solvent accessibility in proteins [J].
Adamczak, R ;
Porollo, A ;
Meller, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 59 (03) :467-475
[2]  
Agrawal N.J., 2016, MABS, V8, P1
[3]   Aggregation in Protein-Based Biotherapeutics: Computational Studies and Tools to Identify Aggregation-Prone Regions [J].
Agrawal, Neeraj J. ;
Kumar, Sandeep ;
Wang, Xiaoling ;
Helk, Bernhard ;
Singh, Satish K. ;
Trout, Bernhardt L. .
JOURNAL OF PHARMACEUTICAL SCIENCES, 2011, 100 (12) :5081-5095
[4]   NETASA: neural network based prediction of solvent accessibility [J].
Ahmad, S ;
Gromiha, MM .
BIOINFORMATICS, 2002, 18 (06) :819-824
[5]   A Review of Methods Available to Estimate Solvent-Accessible Surface Areas of Soluble Proteins in the Folded and Unfolded States [J].
Ali, Syed Ausaf ;
Hassan, Md Imtaiyaz ;
Islam, Asimul ;
Ahmad, Faizan .
CURRENT PROTEIN & PEPTIDE SCIENCE, 2014, 15 (05) :456-476
[6]   Standard conformations for the canonical structures of immunoglobulins [J].
AlLazikani, B ;
Lesk, AM ;
Chothia, C .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 273 (04) :927-948
[7]   Second Antibody Modeling Assessment (AMA-II) [J].
Almagro, Juan C. ;
Teplyakov, Alexey ;
Luo, Jinquan ;
Sweet, Raymond W. ;
Kodangattil, Sreekumar ;
Hernandez-Guzman, Francisco ;
Gilliland, Gary L. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2014, 82 (08) :1553-1562
[8]   Protein aggregation, particle formation, characterization & rheology [J].
Amin, Samiul ;
Barnett, Gregory V. ;
Pathak, Jai A. ;
Roberts, Christopher J. ;
Sarangapani, Prasad S. .
CURRENT OPINION IN COLLOID & INTERFACE SCIENCE, 2014, 19 (05) :438-449
[9]   Charge-mediated Fab-Fc interactions in an IgG1 antibody induce reversible self-association, cluster formation, and elevated viscosity [J].
Arora, Jayant ;
Hu, Yue ;
Esfandiary, Reza ;
Sathish, Hasige A. ;
Bishop, Steven M. ;
Joshi, Sangeeta B. ;
Middaugh, C. Russell ;
Volkin, David B. ;
Weis, David D. .
MABS, 2016, 8 (08) :1561-1574
[10]   A dissection of specific and non-specific protein - Protein interfaces [J].
Bahadur, RP ;
Chakrabarti, P ;
Rodier, F ;
Janin, J .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 336 (04) :943-955