Exploring the Promoter Generation and Prediction of Halomonas spp. Based on GAN and Multi-Model Fusion Methods

被引:0
|
作者
Zhao, Cuihuan [1 ]
Guan, Yuying [1 ]
Yan, Shuan [2 ]
Li, Jiahang [3 ]
机构
[1] Tsinghua Univ, Sch Life Sci, Ctr Synthet & Syst Biol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Inst Publ Safety Res, Dept Engn Phys, Beijing 100084, Peoples R China
[3] Nankai Univ, Sch Math Sci, Tianjin 300071, Peoples R China
关键词
<italic>Halomonas</italic>; promoters; generative adversarial networks (GANs); multi-model fusion; quantile hit rate; SEQUENCE; STRENGTH; SIMILARITY; EXPRESSION;
D O I
10.3390/ijms252313137
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Promoters, as core elements in the regulation of gene expression, play a pivotal role in genetic engineering and synthetic biology. The accurate prediction and optimization of promoter strength are essential for advancing these fields. Here, we present the first promoter strength database tailored to Halomonas, an extremophilic microorganism, and propose a novel promoter design and prediction method based on generative adversarial networks (GANs) and multi-model fusion. The GAN model effectively learns the key features of Halomonas promoter sequences, such as the GC content and Moran's coefficients, to generate biologically plausible promoter sequences. To enhance prediction accuracy, we developed a multi-model fusion framework integrating deep learning and machine learning approaches. Deep learning models, incorporating BiLSTM and CNN architectures, capture k-mer and PSSM features, whereas machine learning models utilize engineered string and non-string features to construct comprehensive feature matrices for the multidimensional analysis and prediction of promoter strength. Using the proposed framework, newly generated promoters via mutation were predicted, and their functional validity was experimentally confirmed. The integration of multiple models significantly reduced the experimental validation space through an intersection-based strategy, achieving a notable improvement in top quantile prediction accuracy, particularly within the top five quantiles. The robustness and applicability of this model were further validated on diverse datasets, including test sets and out-of-sample promoters. This study not only introduces an innovative approach for promoter design and prediction in Halomonas but also lays a foundation for advancing industrial biotechnology. Additionally, the proposed strategy of GAN-based generation coupled with multi-model prediction demonstrates versatility, offering a valuable reference for promoter design and strength prediction in other extremophiles. Our findings highlight the promising synergy between artificial intelligence and synthetic biology, underscoring their profound academic and practical implications.
引用
收藏
页数:27
相关论文
共 50 条
  • [11] Prediction of CO2 Solubility in Ionic Liquids Based on Multi-Model Fusion Method
    Xia, Luyue
    Wang, Jiachen
    Liu, Shanshan
    Li, Zhuo
    Pan, Haitian
    PROCESSES, 2019, 7 (05)
  • [12] a CNN-Attention-LightGBM Arrester Defect Prediction Method based on Multi-model Fusion
    Sheng, Jizheng
    Liu, Xinmin
    Li, Bing
    Cui, Yang
    Zhu, Lei
    Zhang, Xiuping
    2023 6TH INTERNATIONAL CONFERENCE ON ELECTRONICS AND ELECTRICAL ENGINEERING TECHNOLOGY, EEET 2023, 2023, : 122 - 127
  • [13] Multi-step short-term wind speed prediction based on integrated multi-model fusion
    Tian, Zhongda
    Chen, Hao
    APPLIED ENERGY, 2021, 298
  • [14] Pathogenic virus detection method based on multi-model fusion
    Zhao, Xiaoyong
    Wang, Jingwei
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2020, : 89 - 92
  • [15] Research on Multi-Model Fusion for Multi-Indicator Collaborative Anomaly Prediction in IoT Devices
    Wang, Donghao
    Wang, Tengjiang
    Qi, Hao
    Liu, Shijun
    Pan, Li
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 2816 - 2821
  • [16] Multi-Model Fusion of Encoding Methods-Based Visual Words for Target Recognition in Infrared Images
    Nebili, Billel
    Khellal, Atmane
    Nemra, Abdelkrim
    Mascarilla, Laurent
    UNMANNED SYSTEMS, 2023, 11 (04) : 351 - 366
  • [17] Multi-Model Fusion Demand Forecasting Framework Based on Attention Mechanism
    Lei, Chunrui
    Zhang, Heng
    Wang, Zhigang
    Miao, Qiang
    PROCESSES, 2024, 12 (11)
  • [18] Research on Multi-model Fusion Algorithm for Image Dehazing Based on Attention Mechanism
    Cui, Tong
    Zhang, Meng
    Ge, Silin
    Chen, Xuhao
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT II, 2022, 13456 : 523 - 535
  • [19] Multi-Model Fusion-Based Hierarchical Extraction for Chinese Epidemic Event
    Zenghua Liao
    Zongqiang Yang
    Peixin Huang
    Ning Pang
    Xiang Zhao
    Data Science and Engineering, 2023, 8 : 73 - 83
  • [20] A fault diagnosis method for complex chemical process based on multi-model fusion
    He, Yadong
    Yang, Zhe
    Wang, Dong
    Gou, Chengdong
    Li, Chuankun
    Guo, Yian
    CHEMICAL ENGINEERING RESEARCH & DESIGN, 2022, 184 : 662 - 677