Prediction of Retention Indices in LC-HRMS for Enhanced Structural Identification of Organic Micropollutants in Water: Selectivity-Based Filtration

被引:1
作者
Kajtazi, Ardiana [1 ]
Kajtazi, Marin [2 ]
Barbetta, Maike Felipe Santos [3 ]
Bandini, Elena [1 ]
Eghbali, Hamed [4 ]
Lynen, Frederic [1 ]
机构
[1] Univ Ghent, Dept Organ & Macromol Chem, Separat Sci Grp, B-9000 Ghent, Belgium
[2] Univ Zagreb, Fac Mech Engn & Naval Architecture, Zagreb 10000, Croatia
[3] Univ Saao Paulo, Fac Philosophy Sci & Letters Ribeirao Preto, Dept Chem, BR-14040901 Ribeirao Preto, SP, Brazil
[4] Dow Benelux BV, Packaging & Specialty Plast R&D, NL-4530 AA Terneuzen, Netherlands
基金
欧盟地平线“2020”;
关键词
DRINKING-WATER; PERFORMANCE;
D O I
10.1021/acs.analchem.4c01784
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Addressing the global challenge of ensuring access to safe drinking water, especially in developing countries, demands cost-effective, eco-friendly, and readily available technologies. The persistence, toxicity, and bioaccumulation potential of organic pollutants arising from various human activities pose substantial hurdles. While high-performance liquid chromatography coupled with high-resolution mass spectrometry (HPLC-HRMS) is a widely utilized technique for identifying pollutants in water, the multitude of structures for a single elemental composition complicates structural identification. While current HRMS and MS/MS databases often can provide hits for known molecules, these are often erroneous or misleading when authentic standards are unavailable. In this research, a machine-learning algorithm is developed to support the structural elucidation of small organic pollutants in water, with a focus on (carbon, oxygen, and hydrogen-based) molecules weighing less than 500 Da. The approach relies on a comparison of the experimental and predicted retention of the possible structures of unknowns for which an elemental composition was obtained by HRMS. A promising novelty is thereby the improved removal of erroneous structures via the combination of the retention information obtained from two reversed-phase-based stationary phases, depicting different selectivities (octadecylsilica, C18 and pentafluorphenylsilica, F5). The study translates retention times into retention indices for instrument independence and transferability across diverse HPLC-HRMS systems. The predictive algorithm, utilizing retention data and molecular descriptors, accurately predicts retention indices and proves its utility by eliminating incorrect structural formulas through a 2-stationary phase intersection-based filtration. Using a data set of 100 training compounds and 16 external test set compounds, two Multiple Linear Regression (MLR), MLR-C18 and MLR-F5 models were developed, employing the 16 most influential descriptors, out of 5666 screened. MLR-C18 achieves precise RI predictions, R 2 = 0.97, RMSE = 36, MAE = 26, while MLR-F5, though slightly less accurate, maintains a performance with R 2 = 0.96, RMSE = 44, MAE = 34. The intersection-based filtration (within +/- 1.5 sigma) showed the elimination of more than 70% of impossible structures for a given elemental composition. The model was further implemented in the identification of a drinking water sample to prove its potential. This tool holds significant promise for supporting water quality management and sustainable practices, contributing to faster structural identification of unknown organic micropollutants in water.
引用
收藏
页码:65 / 74
页数:10
相关论文
共 54 条
  • [1] Development of Liquid Chromatographic Retention Index Based on Cocamide Diethanolamine Homologous Series (C(n)-DEA)
    Aalizadeh, Reza
    Nikolopoulou, Varvara
    Thomaidis, Nikolaos S.
    [J]. ANALYTICAL CHEMISTRY, 2022, 94 (46) : 15987 - 15996
  • [2] A critical review on the treatment of dye-containing wastewater: Ecotoxicological and health concerns of textile dyes and possible remediation approaches for environmental safety
    Al-Tohamy, Rania
    Ali, Sameh S.
    Li, Fanghua
    Okasha, Kamal M.
    Mahmoud, Yehia A. -G.
    Elsamahy, Tamer
    Jiao, Haixin
    Fu, Yinyi
    Sun, Jianzhong
    [J]. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 2022, 231
  • [3] Molecular modeling and prediction accuracy in Quantitative Structure-Retention Relationship calculations for chromatography
    Amos, Ruth I. J.
    Haddad, Paul R.
    Szucs, Roman
    Dolan, John W.
    Pohl, Christopher A.
    [J]. TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2018, 105 : 352 - 359
  • [4] Chemical Contamination of Drinking Water in Resource-Constrained Settings: Global Prevalence and Piloted Mitigation Strategies
    Amrose, Susan E.
    Cherukumilli, Katya
    Wright, Natasha C.
    [J]. ANNUAL REVIEW OF ENVIRONMENT AND RESOURCES, VOL 45, 2020, 45 : 195 - 226
  • [5] [Anonymous], 2020, CURRENT DEV BIOTECHN
  • [6] Joint structural annotation of small molecules using liquid chromatography retention order and tandem mass spectrometry data
    Bach, Eric
    Schymanski, Emma L.
    Rousu, Juho
    [J]. NATURE MACHINE INTELLIGENCE, 2022, 4 (12) : 1224 - +
  • [7] Antibiotics traces in the aquatic environment: persistence and adverse environmental impact
    Bilal, Muhammad
    Mehmood, Shahid
    Rasheed, Tahir
    Iqbal, Hafiz M. N.
    [J]. CURRENT OPINION IN ENVIRONMENTAL SCIENCE & HEALTH, 2020, 13 : 68 - 74
  • [8] Predicting RP-LC retention indices of structurally unknown chemicals from mass spectrometry data
    Boelrijk, Jim
    van Herwerden, Denice
    Ensing, Bernd
    Forre, Patrick
    Samanipour, Saer
    [J]. JOURNAL OF CHEMINFORMATICS, 2023, 15 (01)
  • [9] Structure Elucidation of Unknown Metabolites in Metabolomics by Combined NMR and MS/MS Prediction
    Boiteau, Rene M.
    Hoyt, David W.
    Nicora, Carrie D.
    Kinmonth-Schultz, Hannah A.
    Ward, Joy K.
    Bingol, Kerem
    [J]. METABOLITES, 2018, 8 (01)
  • [10] Prediction of Retention Time and Collision Cross Section (CCSH plus , CCSH-, and CCSNa plus ) of Emerging Contaminants Using Multiple Adaptive Regression Splines
    Celma, Alberto
    Bade, Richard
    Sancho, Juan Vicente
    Hernandez, Felix
    Humphries, Melissa
    Bijlsma, Lubertus
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (22) : 5425 - 5434