Predicting blood-brain barrier permeability of molecules with a large language model and machine learning

被引:17
作者
Huang, Eddie T. C. [1 ]
Yang, Jai-Sing [2 ]
Liao, Ken Y. K. [1 ]
Tseng, Warren C. W. [1 ]
Lee, C. K. [1 ]
Gill, Michelle [1 ]
Compas, Colin [1 ]
See, Simon [1 ]
Tsai, Fuu-Jen [3 ,4 ]
机构
[1] NVIDIA Corp, NVIDIA AI Technol Ctr, Santa Clara, CA USA
[2] China Med Univ, China Med Univ Hosp, Dept Med Res, Taichung, Taiwan
[3] China Med Univ, Childrens Hosp, Coll Chinese Med, Sch Chinese Med, 2 Yude Rd, Taichung 404332, Taiwan
[4] China Med Univ, Childrens Hosp, Taichung, Taiwan
关键词
Blood-brain barrier (BBB) permeability; Machine learning; Artificial intelligence (AI); Natural Products Research Laboratories (NPRL); IN-SILICO PREDICTION; VALIDATION; PLASMA; PENETRATION; TRANSFORMER; INFORMATION; DISCOVERY; TOOLS;
D O I
10.1038/s41598-024-66897-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Predicting the blood-brain barrier (BBB) permeability of small-molecule compounds using a novel artificial intelligence platform is necessary for drug discovery. Machine learning and a large language model on artificial intelligence (AI) tools improve the accuracy and shorten the time for new drug development. The primary goal of this research is to develop artificial intelligence (AI) computing models and novel deep learning architectures capable of predicting whether molecules can permeate the human blood-brain barrier (BBB). The in silico (computational) and in vitro (experimental) results were validated by the Natural Products Research Laboratories (NPRL) at China Medical University Hospital (CMUH). The transformer-based MegaMolBART was used as the simplified molecular input line entry system (SMILES) encoder with an XGBoost classifier as an in silico method to check if a molecule could cross through the BBB. We used Morgan or Circular fingerprints to apply the Morgan algorithm to a set of atomic invariants as a baseline encoder also with an XGBoost classifier to compare the results. BBB permeability was assessed in vitro using three-dimensional (3D) human BBB spheroids (human brain microvascular endothelial cells, brain vascular pericytes, and astrocytes). Using multiple BBB databases, the results of the final in silico transformer and XGBoost model achieved an area under the receiver operating characteristic curve of 0.88 on the held-out test dataset. Temozolomide (TMZ) and 21 randomly selected BBB permeable compounds (Pred scores = 1, indicating BBB-permeable) from the NPRL penetrated human BBB spheroid cells. No evidence suggests that ferulic acid or five BBB-impermeable compounds (Pred scores < 1.29423E-05, which designate compounds that pass through the human BBB) can pass through the spheroid cells of the BBB. Our validation of in vitro experiments indicated that the in silico prediction of small-molecule permeation in the BBB model is accurate. Transformer-based models like MegaMolBART, leveraging the SMILES representations of molecules, show great promise for applications in new drug discovery. These models have the potential to accelerate the development of novel targeted treatments for disorders of the central nervous system.
引用
收藏
页数:9
相关论文
共 85 条
[11]   Targeting the central nervous system in lysosomal storage diseases: Strategies to deliver therapeutics across the blood-brain barrier [J].
Critchley, Bethan J. ;
Gaspar, H. Bobby ;
Benedetti, Sara .
MOLECULAR THERAPY, 2023, 31 (03) :657-675
[12]   Review of Machine Learning Techniques in Soft Tissue Biomechanics and Biomaterials [J].
Donmazov, Samir ;
Saruhan, Eda Nur ;
Pekkan, Kerem ;
Piskin, Senol .
CARDIOVASCULAR ENGINEERING AND TECHNOLOGY, 2024, 15 (05) :522-549
[13]   CGRdb2.0: A Python']Python Database Management System for Molecules, Reactions, and Chemical Data [J].
Gimadiev, Timur ;
Nugmanov, Ramil ;
Khakimova, Aigul ;
Fatykhova, Adeliya ;
Madzhidov, Timur ;
Sidorov, Pavel ;
Varnek, Alexandre .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (09) :2015-2020
[14]   Autoencoders for sample size estimation for fully connected neural network classifiers [J].
Gulamali, Faris F. F. ;
Sawant, Ashwin S. S. ;
Kovatch, Patricia ;
Glicksberg, Benjamin ;
Charney, Alexander ;
Nadkarni, Girish N. N. ;
Oermann, Eric .
NPJ DIGITAL MEDICINE, 2022, 5 (01)
[15]   Large-Scale Evaluation of Collision Cross Sections to Investigate Blood-Brain Barrier Permeation of Drugs [J].
Guntner, Armin Sebastian ;
Boegl, Thomas ;
Mlynek, Franz ;
Buchberger, Wolfgang .
PHARMACEUTICS, 2021, 13 (12)
[16]   Review of machine learning and deep learning models for toxicity prediction [J].
Guo, Wenjing ;
Liu, Jie ;
Dong, Fan ;
Song, Meng ;
Li, Zoe ;
Khan, Md Kamrul Hasan ;
Patterson, Tucker A. ;
Hong, Huixiao .
EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (21) :1952-1973
[17]   Neural Networks with Emotion Associations, Topic Modeling and Supervised Term Weighting for Sentiment Analysis [J].
Hajek, Petr ;
Barushka, Aliaksandr ;
Munk, Michal .
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2021, 31 (10)
[18]   In vivo methods for imaging blood-brain barrier function and dysfunction [J].
Harris, William James ;
Asselin, Marie-Claude ;
Hinz, Rainer ;
Parkes, Laura Michelle ;
Allan, Stuart ;
Schiessl, Ingo ;
Boutin, Herve ;
Dickie, Ben Robert .
EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2023, 50 (04) :1051-1083
[19]   ECG Heartbeat Classification Using Machine Learning and Metaheuristic Optimization for Smart Healthcare Systems [J].
Hassaballah, Mahmoud ;
Wazery, Yaser M. M. ;
Ibrahim, Ibrahim E. E. ;
Farag, Aly .
BIOENGINEERING-BASEL, 2023, 10 (04)
[20]   In Silico Target Analysis of Treatment for COVID-19 Using Huang-Lian-Shang-Qing-Wan, a Traditional Chinese Medicine Formula [J].
Huang, Ching-Wen ;
Ha, Hai-Anh ;
Tsai, Shih-Chang ;
Lu, Chi-Cheng ;
Lee, Chao-Ying ;
Tsai, Yuh-Feng ;
Tsai, Fuu-Jen ;
Chiu, Yu-Jen ;
Wang, Guo-Kai ;
Hsu, Chung-Hua ;
Yang, Jai-Sing .
NATURAL PRODUCT COMMUNICATIONS, 2021, 16 (10)