NeuralCodOpt: Codon optimization for the development of DNA vaccines

被引:0
作者
Chowdhury, Tapan [1 ]
Saha, Aishwarya [1 ]
Saha, Ananya [1 ]
Chakraborty, Arnab [1 ]
Das, Nibir [1 ]
机构
[1] Techno Main Salt Lake, Dept Comp Sci & Engn, EM 4-1,Sect 5, Kolkata 700091, West Bengal, India
关键词
Codon optimization; DNA; Codon; Adaptiveness index; Neural network; MAMMALIAN-CELLS; GENE-EXPRESSION; USAGE; PROTEIN;
D O I
10.1016/j.compbiolchem.2025.108377
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Inefficient gene translation, driven by organisms' codon preferences, is an emerging research area since this results in sluggish processes and diminished protein yields. Our research culminates in deriving efficient, optimized codon sequences by considering organism-specific Relative Codon Adaptiveness (RCA) ranges. In this research work, we have developed a novel algorithm, Neural Codon Optimization (NeuralCodOpt), to automate the process of codon optimization tailored to a specific organism and input sequence. Our algorithm has two main parts: the target Codon Adaptation Index generation using K-Means and the automation of sequence optimization using reinforcement learning. This algorithm has been tested across a set of 130 species, yielding highly optimal results that are quite significant compared to the previous works. NeuralCodOpt has shown a high accuracy of 86.7%, which would substantially contribute to Deoxyribonucleic Acid (DNA) vaccines by improving the efficiency of DNA expression vectors. These vectors are crucial in DNA vaccination and gene therapy as they enhance protein expression levels. By further incorporating it into plasmid construction, the translational efficiency of DNA vaccines will be significantly improved.
引用
收藏
页数:13
相关论文
共 37 条
  • [1] AKASHI H, 1994, GENETICS, V136, P927
  • [2] [Anonymous], 2021, Condon Usage Dataset
  • [3] Charter K, 2000, METMBS'00: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, VOLS I AND II, P239
  • [4] GPU Accelerated Drug Application on Signaling Pathways Containing Multiple Faults Using Boolean Networks
    Chowdhury, Tapan
    Chakraborty, Susanta
    Nandan, Argha
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 927 - 939
  • [5] An Efficient MapReduce-based Adaptive K-Means Clustering for Large Dataset
    Chowdhury, Tapan
    Mukherjee, Arijit
    Chakraborty, Susanta
    [J]. 2017 3RD IEEE INTERNATIONAL SYMPOSIUM ON NANOELECTRONIC AND INFORMATION SYSTEMS (INIS), 2017, : 157 - 162
  • [6] A synthetic E7 gene of human papillomavirus type 16 that yields enhanced expression of the protein in mammalian cells and is useful for DNA immunization studies
    Cid-Arregui, A
    Juárez, V
    zur Hausen, H
    [J]. JOURNAL OF VIROLOGY, 2003, 77 (08) : 4928 - 4937
  • [7] Dansbecker, 2018, Rectified Linear Units (RELU) in Deep Learning
  • [8] Dear S., 2001, Brief. Bioinform., V2, P405, DOI [10.1093/bib/2.4.405, DOI 10.1093/BIB/2.4.405]
  • [9] Codon optimization with deep learning to enhance protein expression
    Fu, Hongguang
    Liang, Yanbing
    Zhong, Xiuqin
    Pan, ZhiLing
    Huang, Lei
    Zhang, HaiLin
    Xu, Yang
    Zhou, Wei
    Liu, Zhong
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [10] Helper plasmids for production of HIV-1-derived vectors
    Fuller, M
    Anson, DS
    [J]. HUMAN GENE THERAPY, 2001, 12 (17) : 2081 - 2093