Predicting blood-brain barrier permeability of molecules with a large language model and machine learning

被引:12
作者
Huang, Eddie T. C. [1 ]
Yang, Jai-Sing [2 ]
Liao, Ken Y. K. [1 ]
Tseng, Warren C. W. [1 ]
Lee, C. K. [1 ]
Gill, Michelle [1 ]
Compas, Colin [1 ]
See, Simon [1 ]
Tsai, Fuu-Jen [3 ,4 ]
机构
[1] NVIDIA Corp, NVIDIA AI Technol Ctr, Santa Clara, CA USA
[2] China Med Univ, China Med Univ Hosp, Dept Med Res, Taichung, Taiwan
[3] China Med Univ, Childrens Hosp, Coll Chinese Med, Sch Chinese Med, 2 Yude Rd, Taichung 404332, Taiwan
[4] China Med Univ, Childrens Hosp, Taichung, Taiwan
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Blood-brain barrier (BBB) permeability; Machine learning; Artificial intelligence (AI); Natural Products Research Laboratories (NPRL); IN-SILICO PREDICTION; VALIDATION; PLASMA; PENETRATION; TRANSFORMER; INFORMATION; DISCOVERY; TOOLS;
D O I
10.1038/s41598-024-66897-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Predicting the blood-brain barrier (BBB) permeability of small-molecule compounds using a novel artificial intelligence platform is necessary for drug discovery. Machine learning and a large language model on artificial intelligence (AI) tools improve the accuracy and shorten the time for new drug development. The primary goal of this research is to develop artificial intelligence (AI) computing models and novel deep learning architectures capable of predicting whether molecules can permeate the human blood-brain barrier (BBB). The in silico (computational) and in vitro (experimental) results were validated by the Natural Products Research Laboratories (NPRL) at China Medical University Hospital (CMUH). The transformer-based MegaMolBART was used as the simplified molecular input line entry system (SMILES) encoder with an XGBoost classifier as an in silico method to check if a molecule could cross through the BBB. We used Morgan or Circular fingerprints to apply the Morgan algorithm to a set of atomic invariants as a baseline encoder also with an XGBoost classifier to compare the results. BBB permeability was assessed in vitro using three-dimensional (3D) human BBB spheroids (human brain microvascular endothelial cells, brain vascular pericytes, and astrocytes). Using multiple BBB databases, the results of the final in silico transformer and XGBoost model achieved an area under the receiver operating characteristic curve of 0.88 on the held-out test dataset. Temozolomide (TMZ) and 21 randomly selected BBB permeable compounds (Pred scores = 1, indicating BBB-permeable) from the NPRL penetrated human BBB spheroid cells. No evidence suggests that ferulic acid or five BBB-impermeable compounds (Pred scores < 1.29423E-05, which designate compounds that pass through the human BBB) can pass through the spheroid cells of the BBB. Our validation of in vitro experiments indicated that the in silico prediction of small-molecule permeation in the BBB model is accurate. Transformer-based models like MegaMolBART, leveraging the SMILES representations of molecules, show great promise for applications in new drug discovery. These models have the potential to accelerate the development of novel targeted treatments for disorders of the central nervous system.
引用
收藏
页数:9
相关论文
共 85 条
  • [11] Targeting the central nervous system in lysosomal storage diseases: Strategies to deliver therapeutics across the blood-brain barrier
    Critchley, Bethan J.
    Gaspar, H. Bobby
    Benedetti, Sara
    [J]. MOLECULAR THERAPY, 2023, 31 (03) : 657 - 675
  • [12] Review of Machine Learning Techniques in Soft Tissue Biomechanics and Biomaterials
    Donmazov, Samir
    Saruhan, Eda Nur
    Pekkan, Kerem
    Piskin, Senol
    [J]. CARDIOVASCULAR ENGINEERING AND TECHNOLOGY, 2024, 15 (05) : 522 - 549
  • [13] CGRdb2.0: A Python']Python Database Management System for Molecules, Reactions, and Chemical Data
    Gimadiev, Timur
    Nugmanov, Ramil
    Khakimova, Aigul
    Fatykhova, Adeliya
    Madzhidov, Timur
    Sidorov, Pavel
    Varnek, Alexandre
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (09) : 2015 - 2020
  • [14] Autoencoders for sample size estimation for fully connected neural network classifiers
    Gulamali, Faris F. F.
    Sawant, Ashwin S. S.
    Kovatch, Patricia
    Glicksberg, Benjamin
    Charney, Alexander
    Nadkarni, Girish N. N.
    Oermann, Eric
    [J]. NPJ DIGITAL MEDICINE, 2022, 5 (01)
  • [15] Large-Scale Evaluation of Collision Cross Sections to Investigate Blood-Brain Barrier Permeation of Drugs
    Guntner, Armin Sebastian
    Boegl, Thomas
    Mlynek, Franz
    Buchberger, Wolfgang
    [J]. PHARMACEUTICS, 2021, 13 (12)
  • [16] Review of machine learning and deep learning models for toxicity prediction
    Guo, Wenjing
    Liu, Jie
    Dong, Fan
    Song, Meng
    Li, Zoe
    Khan, Md Kamrul Hasan
    Patterson, Tucker A.
    Hong, Huixiao
    [J]. EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (21) : 1952 - 1973
  • [17] Neural Networks with Emotion Associations, Topic Modeling and Supervised Term Weighting for Sentiment Analysis
    Hajek, Petr
    Barushka, Aliaksandr
    Munk, Michal
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2021, 31 (10)
  • [18] In vivo methods for imaging blood-brain barrier function and dysfunction
    Harris, William James
    Asselin, Marie-Claude
    Hinz, Rainer
    Parkes, Laura Michelle
    Allan, Stuart
    Schiessl, Ingo
    Boutin, Herve
    Dickie, Ben Robert
    [J]. EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2023, 50 (04) : 1051 - 1083
  • [19] ECG Heartbeat Classification Using Machine Learning and Metaheuristic Optimization for Smart Healthcare Systems
    Hassaballah, Mahmoud
    Wazery, Yaser M. M.
    Ibrahim, Ibrahim E. E.
    Farag, Aly
    [J]. BIOENGINEERING-BASEL, 2023, 10 (04):
  • [20] In Silico Target Analysis of Treatment for COVID-19 Using Huang-Lian-Shang-Qing-Wan, a Traditional Chinese Medicine Formula
    Huang, Ching-Wen
    Ha, Hai-Anh
    Tsai, Shih-Chang
    Lu, Chi-Cheng
    Lee, Chao-Ying
    Tsai, Yuh-Feng
    Tsai, Fuu-Jen
    Chiu, Yu-Jen
    Wang, Guo-Kai
    Hsu, Chung-Hua
    Yang, Jai-Sing
    [J]. NATURAL PRODUCT COMMUNICATIONS, 2021, 16 (10)