Machine Learning Guides Peptide Nucleic Acid Flow Synthesis and Sequence Design

被引:6
|
作者
Li, Chengxi [1 ,2 ,3 ]
Zhang, Genwei [1 ]
Mohapatra, Somesh [4 ]
Callahan, Alex J. [1 ]
Loas, Andrei [1 ]
Gomez-Bombarelli, Rafael [4 ]
Pentelute, Bradley L. [1 ,5 ,6 ,7 ]
机构
[1] MIT, Dept Chem, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Zhejiang Univ, Coll Chem & Biol Engn, 866 Yuhangtang Rd, Hangzhou 310030, Zhejiang, Peoples R China
[3] ZJU Hangzhou Global Sci & Technol Innovat Ctr, 733 Jianshe San Rd, Hangzhou 311200, Zhejiang, Peoples R China
[4] MIT, Dept Mat Sci & Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[5] MIT, Koch Inst Integrat Canc Res, 500 Main St, Cambridge, MA 02142 USA
[6] MIT, Ctr Environm Hlth Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[7] Broad Inst MIT & Harvard, 415 Main St, Cambridge, MA 02142 USA
关键词
automated synthesis; drug design; machine learning; peptide nucleic acid; yield prediction; DISCOVERY; PREDICTION; STABILITY;
D O I
10.1002/advs.202201988
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Peptide nucleic acids (PNAs) are potential antisense therapies for genetic, acquired, and viral diseases. Efficiently selecting candidate PNA sequences for synthesis and evaluation from a genome containing hundreds to thousands of options can be challenging. To facilitate this process, this work leverages machine learning (ML) algorithms and automated synthesis technology to predict PNA synthesis efficiency and guide rational PNA sequence design. The training data is collected from individual fluorenylmethyloxycarbonyl (Fmoc) deprotection reactions performed on a fully automated PNA synthesizer. The optimized ML model allows for 93% prediction accuracy and 0.97 Pearson's r. The predicted synthesis scores are validated to be correlated with the experimental high-performance liquid chromatography (HPLC) crude purities (correlation coefficient R-2 = 0.95). Furthermore, a general applicability of ML is demonstrated through designing synthetically accessible antisense PNA sequences from 102 315 predicted candidates targeting exon 44 of the human dystrophin gene, SARS-CoV-2, HIV, as well as selected genes associated with cardiovascular diseases, type II diabetes, and various cancers. Collectively, ML provides an accurate prediction of PNA synthesis quality and serves as a useful computational tool for informing PNA sequence design.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Peptide nucleic acid synthesis by novel amide formation
    Lee, Hyunil
    Jeon, Jae Hoon
    Lim, Jong Chan
    Choi, Hoon
    Yoon, Yeohong
    Kim, Sung Kee
    ORGANIC LETTERS, 2007, 9 (17) : 3291 - 3293
  • [22] ROLE OF NUCLEIC ACID AND ENZYMES IN PEPTIDE CHAIN SYNTHESIS
    DOUNCE, AL
    MORRISON, M
    MONTY, KJ
    NATURE, 1955, 176 (4482) : 597 - 598
  • [23] Synthesis and properties of nucleic acid analogues with peptide backbone
    Fujii, M
    Yoshida, K
    Shimada, T
    Yamamoto, K
    PEPTIDE CHEMISTRY 1995, 1996, : 529 - 532
  • [24] Photolabile peptide nucleic acid monomers: Synthesis and photodeprotection
    Bhushan, Kumar R.
    SYNLETT, 2006, (13) : 2130 - 2132
  • [25] Peptide nucleic acid (PNA): Synthesis, modifications and applications
    Chu, Z
    Liu, KL
    CHINESE JOURNAL OF ORGANIC CHEMISTRY, 2005, 25 (03) : 254 - 263
  • [26] Progress towards a submonomer synthesis of peptide nucleic acid
    Hudson, RHE
    Viirre, RD
    NUCLEOSIDES NUCLEOTIDES & NUCLEIC ACIDS, 2003, 22 (5-8): : 1017 - 1022
  • [27] Synthesis and hybridization of ferrocenyl peptide nucleic acid.
    Campbell, NF
    Keen, JN
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 227 : U90 - U90
  • [28] Peptide nucleic acid conjugates: Synthesis, properties and applications
    Zhilina, ZV
    Ziemba, AJ
    Ebbinghaus, SW
    CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2005, 5 (12) : 1119 - 1131
  • [29] Synthesis of an electrochemical peptide nucleic acid probe.
    Robinson, AC
    Jivarajan, S
    Campbell, NF
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2002, 223 : U218 - U218
  • [30] New peptide nucleic acid analogues: synthesis and applications
    Pensato, Soccorsa
    Saviano, Michele
    Romanelli, Alessandra
    EXPERT OPINION ON BIOLOGICAL THERAPY, 2007, 7 (08) : 1219 - 1232