Exploring Dimensionality Reduction Techniques for Deep Learning Driven QSAR Models of Mutagenicity

被引：3

作者：

Kalian, Alexander D. ^{[1
]}

Benfenati, Emilio ^{[2
]}

Osborne, Olivia J. ^{[3
]}

Gott, David ^{[3
]}

Potter, Claire ^{[3
]}

Dorne, Jean-Lou C. M. ^{[4
]}

Guo, Miao ^{[5
]}

Hogstrand, Christer ^{[6
]}

机构：

[1] Kings Coll London, Dept Nutr Sci, Franklin Wilkins Bldg,150 Stamford St, London SE1 9NH, England

[2] Ist Ric Farmacolog Mario Negri IRCCS, Via Mario Negri 2, I-20156 Milan, Italy

[3] Food Stand Agcy, 70 Petty France, London SW1H 9EX, England

[4] European Food Safety Author EFSA, Via Carlo Magno 1A, I-43126 Parma, Italy

[5] Kings Coll London, Dept Engn, Strand Campus, London WC2R 2LS, England

[6] Kings Coll London, Dept Analyt Environm & Forens Sci, Franklin Wilkins Bldg,150 Stamford St, London SE1 9NH, England

来源：

TOXICS | 2023年 / 11卷 / 07期

基金：

英国生物技术与生命科学研究理事会;

关键词：

QSAR; dimensionality reduction; deep learning; autoencoder; principal component analysis; locally linear embedding; grid search; hyperparameter optimisation; mutagenicity; cheminformatics;

D O I：

10.3390/toxics11070572

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Dimensionality reduction techniques are crucial for enabling deep learning driven quantitative structure-activity relationship (QSAR) models to navigate higher dimensional toxicological spaces, however the use of specific techniques is often arbitrary and poorly explored. Six dimensionality techniques (both linear and non-linear) were hence applied to a higher dimensionality mutagenicity dataset and compared in their ability to power a simple deep learning driven QSAR model, following grid searches for optimal hyperparameter values. It was found that comparatively simpler linear techniques, such as principal component analysis (PCA), were sufficient for enabling optimal QSAR model performances, which indicated that the original dataset was at least approximately linearly separable (in accordance with Cover's theorem). However certain non-linear techniques such as kernel PCA and autoencoders performed at closely comparable levels, while (especially in the case of autoencoders) being more widely applicable to potentially non-linearly separable datasets. Analysis of the chemical space, in terms of XLogP and molecular weight, uncovered that the vast majority of testing data occurred within the defined applicability domain, as well as that certain regions were measurably more problematic and antagonised performances. It was however indicated that certain dimensionality reduction techniques were able to facilitate uniquely beneficial navigations of the chemical space.

引用

页数：24

共 50 条

[31] Exploring combinations of dimensionality reduction, transfer learning, and regularization methods for predicting binary phenotypes with transcriptomic data
Oshternian, S. R.
Loipfinger, S.
Bhattacharya, A.
Fehrmann, R. S. N.
BMC BIOINFORMATICS, 2024, 25 (01):
[32] Data-driven models for accurate estimation of fuel consumption using Deep Learning techniques
Gracia-Berna, Antonio
Vega-Astorga, Ruben
del Pozo-Dominguez, Maria
Lopez-Leones, Javier
2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC, 2023,
[33] Urban individual tree crown detection research using multispectral image dimensionality reduction with deep learning
Xi X.
Xia K.
Yang Y.
Du X.
Feng H.
National Remote Sensing Bulletin, 2022, 26 (04) : 711 - 721
[34] Exploring new strategies for comparing deep learning models
Butler, Samantha J.
Price, Stanton R.
Hadia, Xian Mae D.
Price, Steven R.
Carley, Samantha C.
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
[35] Exploring Programming Semantic Analytics with Deep Learning Models
Lu, Yihan
Hsiao, I-Han
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE (LAK'19), 2019, : 155 - 159
[36] On the Use of Machine Learning Models for Prediction of Compressive Strength of Concrete: Influence of Dimensionality Reduction on the Model Performance
Wan, Zhi
Xu, Yading
Savija, Branko
MATERIALS, 2021, 14 (04) : 1 - 23
[37] Exploring the Application of Deep Learning Techniques on Medical Text Corpora
Minarro-Gimenez, Jose Antonio
Marin-Alonso, Oscar
Samwald, Matthias
E-HEALTH - FOR CONTINUITY OF CARE, 2014, 205 : 584 - 588
[38] Exploring the Effects of Dimensionality Reduction in Deep Networks for Force Estimation in Robotic-Assisted Surgery
Aviles, Angelica I.
Alsaleh, Samar
Sobrevilla, Pilar
Casals, Alicia
MEDICAL IMAGING 2016: IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, 2016, 9786
[39] Using deep neural networks along with dimensionality reduction techniques to assist the diagnosis of neurodegenerative disorders
Segovia, F.
Gorriz, J. M.
Ramirez, J.
Martinez-Murcia, F. J.
Garcia-Perez, M.
LOGIC JOURNAL OF THE IGPL, 2018, 26 (06) : 618 - 628
[40] Optimizing IoT Video Data: Dimensionality Reduction for Efficient Deep Learning on Edge Computing
Ortiz-Perez, David
Ruiz-Ponce, Pablo
Mulero-Perez, David
Benavent-Lledo, Manuel
Rodriguez-Juan, Javier
Hernandez-Lopez, Hugo
Iarovikov, Anatoli
Krco, Srdjan
Nedic, Daliborka
Vukobratovic, Dejan
Garcia-Rodriguez, Jose
FUTURE INTERNET, 2025, 17 (02)

← 1 2 3 4 5 →