Applications of machine learning in microbial natural product drug discovery

被引:5
作者
Arnold, Autumn [1 ,2 ,3 ]
Alexander, Jeremie [1 ,2 ,3 ]
Liu, Gary [1 ,2 ,3 ]
Stokes, Jonathan M. [1 ,2 ,3 ]
机构
[1] McMaster Univ, Dept Biochem & Biomed Sci, Hamilton, ON, Canada
[2] McMaster Univ, Michael G DeGroote Inst Infect Dis Res, Hamilton, ON, Canada
[3] McMaster Univ, David Braley Ctr Antibiot Discovery, Hamilton, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Artificial Intelligence; Machine Learning; Natural Products; Drug Discovery; Genome Mining; Dereplication; Target Prediction; BIOSYNTHETIC GENE CLUSTERS; COMPLETE GENOME SEQUENCE; TARGET IDENTIFICATION; DATABASE; DEREPLICATION; REPRESENTATION; ANNOTATION; MOLECULES; LIBRARIES; INSIGHTS;
D O I
10.1080/17460441.2023.2251400
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
IntroductionNatural products (NPs) are a desirable source of new therapeutics due to their structural diversity and evolutionarily optimized bioactivities. NPs and their derivatives account for roughly 70% of approved pharmaceuticals. However, the rate at which novel NPs are discovered has decreased. To accelerate the microbial NP discovery process, machine learning (ML) is being applied to numerous areas of NP discovery and development.Areas coveredThis review explores the utility of ML at various phases of the microbial NP drug discovery pipeline, discussing concrete examples throughout each major phase: genome mining, dereplication, and biological target prediction. Moreover, the authors discuss how ML approaches can be applied to semi-synthetic approaches to drug discovery.Expert opinionDespite the important role that microbial NPs play in the development of novel drugs, their discovery has declined due to challenges associated with the conventional discovery process. ML is positioned to overcome these limitations given its ability to model complex datasets and generalize to novel chemical and sequence space. Unsurprisingly, ML comes with its own limitations that must be considered for its successful implementation. The authors stress the importance of continuing to build high quality and open access NP datasets to further increase the utility of ML in NP discovery.
引用
收藏
页码:1259 / 1272
页数:14
相关论文
共 127 条
  • [1] Computational/in silico methods in drug target and lead prediction
    Agamah, Francis E.
    Mazandu, Gaston K.
    Hassan, Radia
    Bope, Christian D.
    Thomford, Nicholas E.
    Ghansah, Anita
    Chimusa, Emile R.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2020, 21 (05) : 1663 - 1675
  • [2] Natural products in drug discovery: advances and opportunities
    Atanasov, Atanas G.
    Zotchev, Sergey B.
    Dirsch, Verena M.
    Supuran, Claudiu T.
    [J]. NATURE REVIEWS DRUG DISCOVERY, 2021, 20 (03) : 200 - 216
  • [3] Natural product drug discovery in the genomic era: realities, conjectures, misconceptions, and opportunities
    Baltz, Richard H.
    [J]. JOURNAL OF INDUSTRIAL MICROBIOLOGY & BIOTECHNOLOGY, 2019, 46 (3-4) : 281 - 299
  • [4] Bateman A, 2002, NUCLEIC ACIDS RES, V30, P276, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
  • [5] Expanding the range of 'druggable' targets with natural product-based libraries: an academic perspective
    Bauer, Renato A.
    Wurst, Jacqueline M.
    Tan, Derek S.
    [J]. CURRENT OPINION IN CHEMICAL BIOLOGY, 2010, 14 (03) : 308 - 314
  • [6] Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2)
    Bentley, SD
    Chater, KF
    Cerdeño-Tárraga, AM
    Challis, GL
    Thomson, NR
    James, KD
    Harris, DE
    Quail, MA
    Kieser, H
    Harper, D
    Bateman, A
    Brown, S
    Chandra, G
    Chen, CW
    Collins, M
    Cronin, A
    Fraser, A
    Goble, A
    Hidalgo, J
    Hornsby, T
    Howarth, S
    Huang, CH
    Kieser, T
    Larke, L
    Murphy, L
    Oliver, K
    O'Neil, S
    Rabbinowitsch, E
    Rajandream, MA
    Rutherford, K
    Rutter, S
    Seeger, K
    Saunders, D
    Sharp, S
    Squares, R
    Squares, S
    Taylor, K
    Warren, T
    Wietzorrek, A
    Woodward, J
    Barrell, BG
    Parkhill, J
    Hopwood, DA
    [J]. NATURE, 2002, 417 (6885) : 141 - 147
  • [7] Beutler John A, 2019, Curr Protoc Pharmacol, V86, pe67, DOI 10.1002/cpph.67
  • [8] Fully Automated Unconstrained Analysis of High-Resolution Mass Spectrometry Data with Machine Learning
    Boiko, Daniil A.
    Kozlov, Konstantin S.
    V. Burykina, Julia
    Ilyushenkova, Valentina V.
    Ananikov, Valentine P.
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2022, 144 (32) : 14590 - 14606
  • [9] Microbial community drivers of PK/NRP gene diversity in selected global soils
    Borsetto, Chiara
    Amos, Gregory C. A.
    da Rocha, Ulisses Nunes
    Mitchell, Alex L.
    Finn, Robert D.
    Laidi, Rabah Forar
    Vallin, Carlos
    Pearce, David A.
    Newsham, Kevin K.
    Wellington, Elizabeth M. H.
    [J]. MICROBIOME, 2019, 7
  • [10] Target identification of natural medicine with chemical proteomics approach: probe synthesis, target fishing and protein identification
    Chen, Xiao
    Wang, Yutong
    Ma, Nan
    Tian, Jing
    Shao, Yurou
    Zhu, Bo
    Wong, Yin Kwan
    Liang, Zhen
    Zou, Chang
    Wang, Jigang
    [J]. SIGNAL TRANSDUCTION AND TARGETED THERAPY, 2020, 5 (01)