Exploring the Promoter Generation and Prediction of Halomonas spp. Based on GAN and Multi-Model Fusion Methods

被引:0
作者
Zhao, Cuihuan [1 ]
Guan, Yuying [1 ]
Yan, Shuan [2 ]
Li, Jiahang [3 ]
机构
[1] Tsinghua Univ, Sch Life Sci, Ctr Synthet & Syst Biol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Inst Publ Safety Res, Dept Engn Phys, Beijing 100084, Peoples R China
[3] Nankai Univ, Sch Math Sci, Tianjin 300071, Peoples R China
关键词
<italic>Halomonas</italic>; promoters; generative adversarial networks (GANs); multi-model fusion; quantile hit rate; SEQUENCE; STRENGTH; SIMILARITY; EXPRESSION;
D O I
10.3390/ijms252313137
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Promoters, as core elements in the regulation of gene expression, play a pivotal role in genetic engineering and synthetic biology. The accurate prediction and optimization of promoter strength are essential for advancing these fields. Here, we present the first promoter strength database tailored to Halomonas, an extremophilic microorganism, and propose a novel promoter design and prediction method based on generative adversarial networks (GANs) and multi-model fusion. The GAN model effectively learns the key features of Halomonas promoter sequences, such as the GC content and Moran's coefficients, to generate biologically plausible promoter sequences. To enhance prediction accuracy, we developed a multi-model fusion framework integrating deep learning and machine learning approaches. Deep learning models, incorporating BiLSTM and CNN architectures, capture k-mer and PSSM features, whereas machine learning models utilize engineered string and non-string features to construct comprehensive feature matrices for the multidimensional analysis and prediction of promoter strength. Using the proposed framework, newly generated promoters via mutation were predicted, and their functional validity was experimentally confirmed. The integration of multiple models significantly reduced the experimental validation space through an intersection-based strategy, achieving a notable improvement in top quantile prediction accuracy, particularly within the top five quantiles. The robustness and applicability of this model were further validated on diverse datasets, including test sets and out-of-sample promoters. This study not only introduces an innovative approach for promoter design and prediction in Halomonas but also lays a foundation for advancing industrial biotechnology. Additionally, the proposed strategy of GAN-based generation coupled with multi-model prediction demonstrates versatility, offering a valuable reference for promoter design and strength prediction in other extremophiles. Our findings highlight the promising synergy between artificial intelligence and synthetic biology, underscoring their profound academic and practical implications.
引用
收藏
页数:27
相关论文
共 50 条
  • [31] Early warning of drillstring faulty conditions based on multi-model fusion in geological drilling processes
    Li, Yupeng
    Cao, Weihua
    Gopaluni, R. Bhushan
    Hu, Wenkai
    Wu, Min
    JOURNAL OF PROCESS CONTROL, 2023, 126 : 26 - 35
  • [32] A hybrid deep learning technique based integrated multi-model data fusion for forensic investigation
    Senthil P.
    Selvakumar S.
    Journal of Intelligent and Fuzzy Systems, 2022, 43 (05) : 6849 - 6862
  • [33] Civil aviation safety risk intelligent early warning model based on text mining and multi-model fusion
    Hou, Zhaoguo
    Xiong, Minglan
    Wang, Huawei
    Lv, Shaolan
    Chen, Lingzi
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2023, 237 (10) : 2402 - 2427
  • [34] Research on multi-model prediction of skeleton curves of prefabricated concrete columns based on Residual fusion Long Short-Term Memory -Transformer
    Zhang, Wangxi
    Yan, Baoqi
    Yi, Weijian
    JOURNAL OF BUILDING ENGINEERING, 2023, 79
  • [35] Research on vacuum glass insulation performance prediction based on unsteady state multivariate data screening and multi-model fusion self-optimization
    Li, Xiaoling
    Wang, Yuanqi
    Zhou, Fuquan
    Wang, Lei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [36] A multi-feature-based multi-model fusion method for state of health estimation of lithium-ion batteries
    Lin, Mingqiang
    Wu, Denggao
    Meng, Jinhao
    Wu, Ji
    Wu, Haitao
    JOURNAL OF POWER SOURCES, 2022, 518
  • [37] TAL-SRX: an intelligent typing evaluation method for KASP primers based on multi-model fusion
    Chen, Xiaojing
    Fan, Jingchao
    Yan, Shen
    Huang, Longyu
    Zhou, Guomin
    Zhang, Jianhua
    FRONTIERS IN PLANT SCIENCE, 2025, 16
  • [38] High-accuracy and robust object tracking based on multi-model fusion and re-detection
    Bai Z.
    Zhu L.
    Li Z.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2019, 40 (09): : 132 - 141
  • [39] RETRACTED: Research on online marketing effects based on multi-model fusion and artificial intelligence algorithms (Retracted Article)
    Zhao, Rong
    Cai, Yangtian
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (Suppl 1) : 105 - 105
  • [40] Contactless blood oxygen estimation from face videos: A multi-model fusion method based on deep learning
    Hu, Min
    Wu, Xia
    Wang, Xiaohua
    Xing, Yan
    An, Ning
    Shi, Piao
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81