Machine Learning Guides Peptide Nucleic Acid Flow Synthesis and Sequence Design

被引:6
|
作者
Li, Chengxi [1 ,2 ,3 ]
Zhang, Genwei [1 ]
Mohapatra, Somesh [4 ]
Callahan, Alex J. [1 ]
Loas, Andrei [1 ]
Gomez-Bombarelli, Rafael [4 ]
Pentelute, Bradley L. [1 ,5 ,6 ,7 ]
机构
[1] MIT, Dept Chem, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Zhejiang Univ, Coll Chem & Biol Engn, 866 Yuhangtang Rd, Hangzhou 310030, Zhejiang, Peoples R China
[3] ZJU Hangzhou Global Sci & Technol Innovat Ctr, 733 Jianshe San Rd, Hangzhou 311200, Zhejiang, Peoples R China
[4] MIT, Dept Mat Sci & Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[5] MIT, Koch Inst Integrat Canc Res, 500 Main St, Cambridge, MA 02142 USA
[6] MIT, Ctr Environm Hlth Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[7] Broad Inst MIT & Harvard, 415 Main St, Cambridge, MA 02142 USA
关键词
automated synthesis; drug design; machine learning; peptide nucleic acid; yield prediction; DISCOVERY; PREDICTION; STABILITY;
D O I
10.1002/advs.202201988
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Peptide nucleic acids (PNAs) are potential antisense therapies for genetic, acquired, and viral diseases. Efficiently selecting candidate PNA sequences for synthesis and evaluation from a genome containing hundreds to thousands of options can be challenging. To facilitate this process, this work leverages machine learning (ML) algorithms and automated synthesis technology to predict PNA synthesis efficiency and guide rational PNA sequence design. The training data is collected from individual fluorenylmethyloxycarbonyl (Fmoc) deprotection reactions performed on a fully automated PNA synthesizer. The optimized ML model allows for 93% prediction accuracy and 0.97 Pearson's r. The predicted synthesis scores are validated to be correlated with the experimental high-performance liquid chromatography (HPLC) crude purities (correlation coefficient R-2 = 0.95). Furthermore, a general applicability of ML is demonstrated through designing synthetically accessible antisense PNA sequences from 102 315 predicted candidates targeting exon 44 of the human dystrophin gene, SARS-CoV-2, HIV, as well as selected genes associated with cardiovascular diseases, type II diabetes, and various cancers. Collectively, ML provides an accurate prediction of PNA synthesis quality and serves as a useful computational tool for informing PNA sequence design.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Design and synthesis of novel peptide nucleic acid monomers
    Bai, JQ
    Li, Y
    Liu, KL
    CHINESE JOURNAL OF CHEMISTRY, 2001, 19 (03) : 276 - 281
  • [2] Automated synthesis of peptide nucleic acids and peptide nucleic acid peptide conjugates
    Mayfield, LD
    Corey, DR
    ANALYTICAL BIOCHEMISTRY, 1999, 268 (02) : 401 - 404
  • [3] An Efficient Approach for the Design and Synthesis of Antimicrobial Peptide-Peptide Nucleic Acid Conjugates
    Patil, Nitin A.
    Thombare, Varsha J.
    Li, Rong
    He, Xiaoji
    Lu, Jing
    Yu, Heidi H.
    Wickremasinghe, Hasini
    Pamulapati, Kavya
    Azad, Mohammad A. K.
    Velkov, Tony
    Roberts, Kade D.
    Li, Jian
    FRONTIERS IN CHEMISTRY, 2022, 10
  • [4] Design, Synthesis, Properties, and Applications of Chiral Peptide Nucleic Acid Monomers
    Dong, Bo
    Nie, Kaixuan
    Shi, Huanhuan
    Li, Wenjia
    Liu, Zhengchun
    CURRENT ORGANIC CHEMISTRY, 2016, 20 (25) : 2703 - 2717
  • [5] Diversifying Design of Nucleic Acid Aptamers Using Unsupervised Machine Learning
    Moussa, Siba
    Kilgour, Michael
    Jans, Clara
    Hernandez-Garcia, Alex
    Cuperlovic-Culf, Miroslava
    Bengio, Yoshua
    Simine, Lena
    JOURNAL OF PHYSICAL CHEMISTRY B, 2023, 127 (01): : 62 - 68
  • [6] Synthesis of peptide nucleic acid monomers
    Kovács, L
    Timár, Z
    Penke, B
    NUCLEOSIDES & NUCLEOTIDES, 1999, 18 (4-5): : 727 - 729
  • [7] The challenge of peptide nucleic acid synthesis
    Nandhini, K. P.
    Shaer, Danah Al
    Albericio, Fernando
    de la Torre, Beatriz G. G.
    CHEMICAL SOCIETY REVIEWS, 2023, 52 (08) : 2764 - 2789
  • [8] Design and synthesis of conformationally frozen peptide nucleic acid backbone: chiral piperidine PNA as a hexitol nucleic acid surrogate
    Lonkar, PS
    Kumar, VA
    BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2004, 14 (09) : 2147 - 2149
  • [9] Synthesis of peptide nucleic acid-peptide chimeras carrying the c-myc tag-sequence
    Gottschling D.
    Seliger H.
    Tarrasón G.
    Piulats J.
    Wiersma M.
    Eritja R.
    Letters in Peptide Science, 2000, 7 (1): : 35 - 39
  • [10] Synthesis of peptide nucleic acid-peptide chimeras carrying the c-myc tag-sequence
    Gottschling, D
    Seliger, H
    Tarrasón, G
    Piulats, J
    Wiersma, M
    Eritja, R
    LETTERS IN PEPTIDE SCIENCE, 2000, 7 (01): : 35 - 39