Predicting blood-brain barrier permeability of molecules with a large language model and machine learning

被引:12
作者
Huang, Eddie T. C. [1 ]
Yang, Jai-Sing [2 ]
Liao, Ken Y. K. [1 ]
Tseng, Warren C. W. [1 ]
Lee, C. K. [1 ]
Gill, Michelle [1 ]
Compas, Colin [1 ]
See, Simon [1 ]
Tsai, Fuu-Jen [3 ,4 ]
机构
[1] NVIDIA Corp, NVIDIA AI Technol Ctr, Santa Clara, CA USA
[2] China Med Univ, China Med Univ Hosp, Dept Med Res, Taichung, Taiwan
[3] China Med Univ, Childrens Hosp, Coll Chinese Med, Sch Chinese Med, 2 Yude Rd, Taichung 404332, Taiwan
[4] China Med Univ, Childrens Hosp, Taichung, Taiwan
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Blood-brain barrier (BBB) permeability; Machine learning; Artificial intelligence (AI); Natural Products Research Laboratories (NPRL); IN-SILICO PREDICTION; VALIDATION; PLASMA; PENETRATION; TRANSFORMER; INFORMATION; DISCOVERY; TOOLS;
D O I
10.1038/s41598-024-66897-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Predicting the blood-brain barrier (BBB) permeability of small-molecule compounds using a novel artificial intelligence platform is necessary for drug discovery. Machine learning and a large language model on artificial intelligence (AI) tools improve the accuracy and shorten the time for new drug development. The primary goal of this research is to develop artificial intelligence (AI) computing models and novel deep learning architectures capable of predicting whether molecules can permeate the human blood-brain barrier (BBB). The in silico (computational) and in vitro (experimental) results were validated by the Natural Products Research Laboratories (NPRL) at China Medical University Hospital (CMUH). The transformer-based MegaMolBART was used as the simplified molecular input line entry system (SMILES) encoder with an XGBoost classifier as an in silico method to check if a molecule could cross through the BBB. We used Morgan or Circular fingerprints to apply the Morgan algorithm to a set of atomic invariants as a baseline encoder also with an XGBoost classifier to compare the results. BBB permeability was assessed in vitro using three-dimensional (3D) human BBB spheroids (human brain microvascular endothelial cells, brain vascular pericytes, and astrocytes). Using multiple BBB databases, the results of the final in silico transformer and XGBoost model achieved an area under the receiver operating characteristic curve of 0.88 on the held-out test dataset. Temozolomide (TMZ) and 21 randomly selected BBB permeable compounds (Pred scores = 1, indicating BBB-permeable) from the NPRL penetrated human BBB spheroid cells. No evidence suggests that ferulic acid or five BBB-impermeable compounds (Pred scores < 1.29423E-05, which designate compounds that pass through the human BBB) can pass through the spheroid cells of the BBB. Our validation of in vitro experiments indicated that the in silico prediction of small-molecule permeation in the BBB model is accurate. Transformer-based models like MegaMolBART, leveraging the SMILES representations of molecules, show great promise for applications in new drug discovery. These models have the potential to accelerate the development of novel targeted treatments for disorders of the central nervous system.
引用
收藏
页数:9
相关论文
共 85 条
  • [1] NanoSolveIT Project: Driving nanoinformatics research to develop innovative and integrated tools for in silico nanosafety assessment
    Afantitis, Antreas
    Melagraki, Georgia
    Isigonis, Panagiotis
    Tsoumanis, Andreas
    Varsou, Dimitra Danai
    Valsami-Jones, Eugenia
    Papadiamantis, Anastasios
    Ellis, Laura-Jayne A.
    Sarimveis, Haralambos
    Doganis, Philip
    Karatzas, Pantelis
    Tsiros, Periklis
    Liampa, Irene
    Lobaskin, Vladimir
    Greco, Dario
    Serra, Angela
    Kinaret, Pia Anneli Sofia
    Saarimaki, Laura Aliisa
    Grafstrom, Roland
    Kohonen, Pekka
    Nymark, Penny
    Willighagen, Egon
    Puzyn, Tomasz
    Rybinska-Fryca, Anna
    Lyubartsev, Alexander
    Jensen, Keld Alstrup
    Brandenburg, Jan Gerit
    Lofts, Stephen
    Svendsen, Claus
    Harrison, Samuel
    Maier, Dieter
    Tamm, Kaido
    Janes, Jaak
    Sikk, Lauri
    Dusinska, Maria
    Longhin, Eleonora
    Runden-Pran, Elise
    Mariussen, Espen
    El Yamani, Naouale
    Unger, Wolfgang
    Radnik, Joerg
    Tropsha, Alexander
    Cohen, Yoram
    Leszczynski, Jerzy
    Hendren, Christine Ogilvie
    Wiesner, Mark
    Winkler, David
    Suzuki, Noriyuki
    Yoon, Tae Hyun
    Choi, Jang-Sik
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 583 - 602
  • [2] Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation
    Afzal, Muhammad
    Alam, Fakhare
    Malik, Khalid Mahmood
    Malik, Ghaus M.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (10)
  • [3] Photobiomodulation in Alzheimer's Disease-A Complementary Method to State-of-the-Art Pharmaceutical Formulations and Nanomedicine?
    Ailioaie, Laura Marinela
    Ailioaie, Constantin
    Litscher, Gerhard
    [J]. PHARMACEUTICS, 2023, 15 (03)
  • [4] Prediction of ICU Patients' Deterioration Using Machine Learning Techniques
    Aldhoayan, Mohammed D.
    Aljubran, Yosra
    [J]. CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (05)
  • [5] Randomized SMILES strings improve the quality of molecular generative models
    Arus-Pous, Josep
    Johansson, Simon Viet
    Prykhodko, Oleksii
    Bjerrum, Esben Jannik
    Tyrchan, Christian
    Reymond, Jean-Louis
    Chen, Hongming
    Engkvist, Ola
    [J]. JOURNAL OF CHEMINFORMATICS, 2019, 11 (01)
  • [6] Multitask Quantum Study of the Curcumin-Based Complex Physicochemical and Biological Properties
    Baira, Kaouther
    Ounissi, Ali
    Merouani, Hafida
    Alam, Manawwer
    Ouddai, Nadia
    Erto, Alessandro
    Yadav, Krishna Kumar
    Islam, Saiful
    Cheon, Ji-Kwang
    Jeon, Byong-Hun
    Benguerba, Yacine
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (05)
  • [7] Bohlmann Aaron, 2021, JMIRx Med, V2, pe26993, DOI 10.2196/26993
  • [8] A review on machine learning approaches and trends in drug discovery
    Carracedo-Reboredo, Paula
    Linares-Blanco, Jose
    Rodriguez-Fernandez, Nereida
    Cedron, Francisco
    Novoa, Francisco J.
    Carballal, Adrian
    Maojo, Victor
    Pazos, Alejandro
    Fernandez-Lozano, Carlos
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 4538 - 4558
  • [9] In silico prediction of unbound brain-to-plasma concentration ratio using machine learning algorithms
    Chen, Hongming
    Winiwarter, Susanne
    Friden, Markus
    Antonsson, Madeleine
    Engkvist, Ola
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2011, 29 (08) : 985 - 995
  • [10] Next-generation sequencing analysis reveals that MTH-3, a novel curcuminoid derivative, suppresses the invasion of MDA-MB-231 triple-negative breast adenocarcinoma cells
    Chiu, Yu-Jen
    Tsai, Fuu-Jen
    Bau, Da-Tian
    Chang, Ling-Chu
    Hsieh, Min-Tsang
    Lu, Chi-Cheng
    Kuo, Sheng-Chu
    Yang, Jai-Sing
    [J]. ONCOLOGY REPORTS, 2021, 46 (01)