Prediction of retention in hydrophilic interaction liquid chromatography using solute molecular descriptors based on chemical structures

被引:47
作者
Taraji, Maryam [1 ]
Haddad, Paul R. [1 ]
Amos, Ruth I. J. [1 ]
Talebi, Mohammad [1 ]
Szucs, Roman [2 ]
Dolan, John W. [3 ]
Pohl, Christopher A. [4 ]
机构
[1] Univ Tasmania, Sch Phys Sci Chem, Australian Ctr Res Separat Sci ACROSS, Private Bag 75, Hobart, Tas 7001, Australia
[2] Pfizer Global Res & Dev, Sandwich CT13 9NJ, Kent, England
[3] LC Resources Inc, 1795 NW Wallace Rd, Mcminnville, OR 97128 USA
[4] Thermo Fisher Sci, Sunnyvale, CA USA
基金
澳大利亚研究理事会;
关键词
HILIC method development; Retention prediction; QSRR; Optimized GA-PLS; HILIC mechanism; BETA-ADRENERGIC AGONISTS; ADRENORECEPTOR AGONISTS; FEATURE-SELECTION; FORCE FIELD; MASS-SPECTROMETRY; PLS-REGRESSION; PHASE; VALIDATION; PERFORMANCE; ANTAGONISTS;
D O I
10.1016/j.chroma.2016.12.025
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Quantitative structure-retention relationship (QSRR) models are developed to predict the retention times of analytes on five hydrophilic interaction liquid chromatography (HILIC) stationary phases (bare silica, amine, amide, diol and zwitterionic), with a view to selecting the most suitable stationary phase(s) for the separation of these analytes. The study was conducted using six P-adrenergic agonists as target analytes. Molecular descriptors were calculated based only on chemical structures optimized using density functional theory. A genetic algorithm (GA) was then used to select the most relevant molecular descriptors and these were used to build a retention model for each stationary phase using partial least squares (PLS) regression. This model was then used to predict the retention of the test set of target analytes. This process created an optimized descriptor set which enhanced the reliability of the developed QSRR models. Finally, the QSRR models developed in the work were utilized to provide some insight into the separation mechanisms operating in the HILIC mode. Three performance criteria - mean absolute error (MAE), root mean square error of prediction scaled to retention time (RMSEP), and the number of selected descriptors, were used to evaluate the developed models when applied to an external test set of six beta adrenergic agonists and showed highly predictive abilities. MAE values ranged from 13 to 25 s on four of the stationary phases, with a somewhat higher error (50 s) being observed for the zwitterionic phase. RMSEP values of 4.88-11.12% were recorded. Validation was performed through Y-randomization and chemical domain applicability, from which it was evident that the developed optimized GA-PLS models were robust. The high levels of accuracy, reliability and applicability of the models were to a large extent due to the optimization of the GA descriptor set and the presence of relevant structural and geometric molecular descriptors, together with descriptors based on important physicochemical properties, which establish a strong connection between retention time and meaningful chemical properties. The present strategy, while it is a pilot study, holds great promise for broader screening of HILIC stationary phases for desired separation, as well as for acquisition of information about molecular mechanisms of separation under chromatographic conditions. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:59 / 67
页数:9
相关论文
共 66 条
[51]   Comparative characterization of hydrophilic interaction liquid chromatography columns by linear solvation energy relationships [J].
Schuster, Georg ;
Lindner, Wolfgang .
JOURNAL OF CHROMATOGRAPHY A, 2013, 1273 :73-94
[52]   AB-INITIO CALCULATION OF VIBRATIONAL ABSORPTION AND CIRCULAR-DICHROISM SPECTRA USING DENSITY-FUNCTIONAL FORCE-FIELDS [J].
STEPHENS, PJ ;
DEVLIN, FJ ;
CHABALOWSKI, CF ;
FRISCH, MJ .
JOURNAL OF PHYSICAL CHEMISTRY, 1994, 98 (45) :11623-11627
[53]  
Stewart J. J. P., 2012, MOPAC2012 STEWART CO
[54]   Optimization of parameters for semiempirical methods VI: more modifications to the NDDO approximations and re-optimization of parameters [J].
Stewart, James J. P. .
JOURNAL OF MOLECULAR MODELING, 2013, 19 (01) :1-32
[55]   Performance comparison of partial least squares-related variable selection methods for quantitative structure retention relationships modelling of retention times in reversed-phase liquid chromatography [J].
Talebi, Mohammad ;
Schuster, Georg ;
Shellie, Robert A. ;
Szucs, Roman ;
Haddad, Paul R. .
JOURNAL OF CHROMATOGRAPHY A, 2015, 1424 :69-76
[56]  
Talete srl, 2014, DRAG 6 0 WIND SOFTW
[57]   Predicting liquid chromatographic retention times of peptides from the Drosophila melanogaster proteome by machine learning approaches [J].
Tian, Feifei ;
Yang, Li ;
Lv, Fenglin ;
Zhou, Peng .
ANALYTICA CHIMICA ACTA, 2009, 644 (1-2) :10-16
[58]  
Todeschini R., 2009, Molecular descriptors for chemoinformatics, DOI [10.1002/9783527628766, DOI 10.1002/9783527628766]
[59]   Quantum mechanical continuum solvation models [J].
Tomasi, J ;
Mennucci, B ;
Cammi, R .
CHEMICAL REVIEWS, 2005, 105 (08) :2999-3093
[60]   The importance of being earnest: Validation is the absolute essential for successful application and interpretation of QSPR models [J].
Tropsha, A ;
Gramatica, P ;
Gombar, VK .
QSAR & COMBINATORIAL SCIENCE, 2003, 22 (01) :69-77