Exploring the Promoter Generation and Prediction of Halomonas spp. Based on GAN and Multi-Model Fusion Methods

被引:0
|
作者
Zhao, Cuihuan [1 ]
Guan, Yuying [1 ]
Yan, Shuan [2 ]
Li, Jiahang [3 ]
机构
[1] Tsinghua Univ, Sch Life Sci, Ctr Synthet & Syst Biol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Inst Publ Safety Res, Dept Engn Phys, Beijing 100084, Peoples R China
[3] Nankai Univ, Sch Math Sci, Tianjin 300071, Peoples R China
关键词
<italic>Halomonas</italic>; promoters; generative adversarial networks (GANs); multi-model fusion; quantile hit rate; SEQUENCE; STRENGTH; SIMILARITY; EXPRESSION;
D O I
10.3390/ijms252313137
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Promoters, as core elements in the regulation of gene expression, play a pivotal role in genetic engineering and synthetic biology. The accurate prediction and optimization of promoter strength are essential for advancing these fields. Here, we present the first promoter strength database tailored to Halomonas, an extremophilic microorganism, and propose a novel promoter design and prediction method based on generative adversarial networks (GANs) and multi-model fusion. The GAN model effectively learns the key features of Halomonas promoter sequences, such as the GC content and Moran's coefficients, to generate biologically plausible promoter sequences. To enhance prediction accuracy, we developed a multi-model fusion framework integrating deep learning and machine learning approaches. Deep learning models, incorporating BiLSTM and CNN architectures, capture k-mer and PSSM features, whereas machine learning models utilize engineered string and non-string features to construct comprehensive feature matrices for the multidimensional analysis and prediction of promoter strength. Using the proposed framework, newly generated promoters via mutation were predicted, and their functional validity was experimentally confirmed. The integration of multiple models significantly reduced the experimental validation space through an intersection-based strategy, achieving a notable improvement in top quantile prediction accuracy, particularly within the top five quantiles. The robustness and applicability of this model were further validated on diverse datasets, including test sets and out-of-sample promoters. This study not only introduces an innovative approach for promoter design and prediction in Halomonas but also lays a foundation for advancing industrial biotechnology. Additionally, the proposed strategy of GAN-based generation coupled with multi-model prediction demonstrates versatility, offering a valuable reference for promoter design and strength prediction in other extremophiles. Our findings highlight the promising synergy between artificial intelligence and synthetic biology, underscoring their profound academic and practical implications.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] Multi-model fusion modeling method based on improved Kalman filtering algorithm
    Zhu, Pengfei
    Xia, Luyue
    Pan, Haitian
    Huagong Xuebao/CIESC Journal, 2015, 66 (04): : 1388 - 1394
  • [22] Multi-Model Fusion-Based Hierarchical Extraction for Chinese Epidemic Event
    Liao, Zenghua
    Yang, Zongqiang
    Huang, Peixin
    Pang, Ning
    Zhao, Xiang
    DATA SCIENCE AND ENGINEERING, 2023, 8 (01) : 73 - 83
  • [23] Data-driven prediction of building energy consumption using an adaptive multi-model fusion approach
    Lin, Penghui
    Zhang, Limao
    Zuo, Jian
    APPLIED SOFT COMPUTING, 2022, 129
  • [24] Automatic Retinal Vessel Segmentation Based on Multi-Model Fusion and Region Iterative Growth
    Lai X.-B.
    Xu M.-S.
    Xu X.-M.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (12): : 2611 - 2621
  • [25] An effective multi-model fusion method for EEG-based sleep stage classification
    An, Panfeng
    Yuan, Zhiyong
    Zhao, Jianhui
    Jiang, Xue
    Du, Bo
    KNOWLEDGE-BASED SYSTEMS, 2021, 219
  • [26] A digital twin approach for gas turbine performance based on deep multi-model fusion
    Zhang, Jingkai
    Wang, Zhitao
    Li, Shuying
    Wei, Pengfei
    APPLIED THERMAL ENGINEERING, 2024, 246
  • [27] Ultra-short term wind power prediction based on quadratic variational mode decomposition and multi-model fusion of deep learning
    Chen, Changqing
    Li, Shichun
    Wen, Ming
    Yu, Zongchao
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 116
  • [28] Application of a Multi-Model Fusion Forecasting Approach in Runoff Prediction: A Case Study of the Yangtze River Source Region
    Wang, Tingqi
    Guo, Yuting
    Evgenievna, Mazina Svetlana
    Wu, Zhenjiang
    SUSTAINABILITY, 2024, 16 (14)
  • [29] A hybrid deep learning technique based integrated multi-model data fusion for forensic investigation
    Senthil, P.
    Selvakumar, S.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 6849 - 6862
  • [30] Research on load frequency control system attack detection method based on multi-model fusion
    Feng Zheng
    Weixun Li
    Huifeng Li
    Libo Yang
    Zengjie Sun
    Energy Informatics, 8 (1)