From intuition to AI: evolution of small molecule representations in drug discovery

被引:15
作者
Mcgibbon, Miles
Shave, Steven
Dong, Jie
Gao, Yumiao
Houston, Douglas R.
Xie, Jiancong
Yang, Yuedong
Schwaller, Philippe
Blay, Vincent
机构
关键词
drug discovery; autoencoders; transformers; artificial intelligence; SMILES; machine learning; AVAILABLE [!text type='PYTHON']PYTHON[!/text] PACKAGE; ELECTROTOPOLOGICAL-STATE; STRUCTURAL DESCRIPTORS; SIMILARITY; CHEMISTRY; DESIGN; CHEMOINFORMATICS;
D O I
10.1093/bib/bbad422
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Within drug discovery, the goal of AI scientists and cheminformaticians is to help identify molecular starting points that will develop into safe and efficacious drugs while reducing costs, time and failure rates. To achieve this goal, it is crucial to represent molecules in a digital format that makes them machine-readable and facilitates the accurate prediction of properties that drive decision-making. Over the years, molecular representations have evolved from intuitive and human-readable formats to bespoke numerical descriptors and fingerprints, and now to learned representations that capture patterns and salient features across vast chemical spaces. Among these, sequence-based and graph-based representations of small molecules have become highly popular. However, each approach has strengths and weaknesses across dimensions such as generality, computational cost, inversibility for generative applications and interpretability, which can be critical in informing practitioners' decisions. As the drug discovery landscape evolves, opportunities for innovation continue to emerge. These include the creation of molecular representations for high-value, low-data regimes, the distillation of broader biological and chemical knowledge into novel learned representations and the modeling of up-and-coming therapeutic modalities.
引用
收藏
页数:13
相关论文
共 146 条
  • [1] Achiam OJ, 2023, Arxiv, DOI [arXiv:2303.08774, 10.48550/arXiv.2303.08774]
  • [2] [Anonymous], MegaMolBart: Generally Applicable Chemical AI Models with Large-Scale Pretrained Transformers
  • [3] Natural products in drug discovery: advances and opportunities
    Atanasov, Atanas G.
    Zotchev, Sergey B.
    Dirsch, Verena M.
    Supuran, Claudiu T.
    [J]. NATURE REVIEWS DRUG DISCOVERY, 2021, 20 (03) : 200 - 216
  • [4] Δ-Quantum machine-learning for medicinal chemistry
    Atz, Kenneth
    Isert, Clemens
    Boecker, Markus N. A.
    Jimenez-Luna, Jose
    Schneider, Gisbert
    [J]. PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2022, 24 (18) : 10775 - 10783
  • [5] Predicting synthetic accessibility: Application in drug discovery and development
    Baber, JC
    Feher, M
    [J]. MINI-REVIEWS IN MEDICINAL CHEMISTRY, 2004, 4 (06) : 681 - 692
  • [6] Bachas S., 2022, Antibody optimization enabled by artificial intelligence predictions of binding affinity and naturalness, V2022, DOI [10.1101/2022.08.16.504181v1, DOI 10.1101/2022.08.16.504181V1]
  • [7] PROTAC targeted protein degraders: the past is prologue
    Bekes, Miklos
    Langley, David R.
    Crews, Craig M.
    [J]. NATURE REVIEWS DRUG DISCOVERY, 2022, 21 (03) : 181 - 200
  • [8] Molecular similarity: a key technique in molecular informatics
    Bender, A
    Glen, RC
    [J]. ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) : 3204 - 3218
  • [9] Artificial intelligence in drug discovery: what is realistic, what are illusions? Part 2: a discussion of chemical and biological data
    Bender, Andreas
    Cortes-Ciriano, Isidro
    [J]. DRUG DISCOVERY TODAY, 2021, 26 (04) : 1040 - 1052
  • [10] Bickerton GR, 2012, NAT CHEM, V4, P90, DOI [10.1038/NCHEM.1243, 10.1038/nchem.1243]