Exploring the Promoter Generation and Prediction of Halomonas spp. Based on GAN and Multi-Model Fusion Methods

被引:0
作者
Zhao, Cuihuan [1 ]
Guan, Yuying [1 ]
Yan, Shuan [2 ]
Li, Jiahang [3 ]
机构
[1] Tsinghua Univ, Sch Life Sci, Ctr Synthet & Syst Biol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Inst Publ Safety Res, Dept Engn Phys, Beijing 100084, Peoples R China
[3] Nankai Univ, Sch Math Sci, Tianjin 300071, Peoples R China
关键词
<italic>Halomonas</italic>; promoters; generative adversarial networks (GANs); multi-model fusion; quantile hit rate; SEQUENCE; STRENGTH; SIMILARITY; EXPRESSION;
D O I
10.3390/ijms252313137
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Promoters, as core elements in the regulation of gene expression, play a pivotal role in genetic engineering and synthetic biology. The accurate prediction and optimization of promoter strength are essential for advancing these fields. Here, we present the first promoter strength database tailored to Halomonas, an extremophilic microorganism, and propose a novel promoter design and prediction method based on generative adversarial networks (GANs) and multi-model fusion. The GAN model effectively learns the key features of Halomonas promoter sequences, such as the GC content and Moran's coefficients, to generate biologically plausible promoter sequences. To enhance prediction accuracy, we developed a multi-model fusion framework integrating deep learning and machine learning approaches. Deep learning models, incorporating BiLSTM and CNN architectures, capture k-mer and PSSM features, whereas machine learning models utilize engineered string and non-string features to construct comprehensive feature matrices for the multidimensional analysis and prediction of promoter strength. Using the proposed framework, newly generated promoters via mutation were predicted, and their functional validity was experimentally confirmed. The integration of multiple models significantly reduced the experimental validation space through an intersection-based strategy, achieving a notable improvement in top quantile prediction accuracy, particularly within the top five quantiles. The robustness and applicability of this model were further validated on diverse datasets, including test sets and out-of-sample promoters. This study not only introduces an innovative approach for promoter design and prediction in Halomonas but also lays a foundation for advancing industrial biotechnology. Additionally, the proposed strategy of GAN-based generation coupled with multi-model prediction demonstrates versatility, offering a valuable reference for promoter design and strength prediction in other extremophiles. Our findings highlight the promising synergy between artificial intelligence and synthetic biology, underscoring their profound academic and practical implications.
引用
收藏
页数:27
相关论文
共 50 条
  • [41] Short-term load forecasting based on different characteristics of sub-sequences and multi-model fusion
    Chen, Changqing
    Yang, Xian
    Dai, Xueying
    Chen, Lisi
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
  • [42] A feature reuse based multi-model fusion method for state of health estimation of lithium-ion batteries
    Bai, Junqi
    Huang, Jiayin
    Luo, Kai
    Yang, Fan
    Xian, Yanhua
    JOURNAL OF ENERGY STORAGE, 2023, 70
  • [43] Fault diagnosis of HVAC system sensors: A method based on Box-Cox transformation and multi-model fusion
    Tang, Junhao
    You, Yuwen
    Zhao, Yuan
    Guo, Chunmei
    Li, Zhe
    Yang, Bin
    ENERGY REPORTS, 2025, 13 : 3489 - 3503
  • [44] Intelligent machine learning-based multi-model fusion monitoring: application to industrial physio-chemical systems
    Ali, Husnain
    Safdar, Rizwan
    Ding, Weilong
    Zhou, Yuanqiang
    Yao, Yuan
    Yao, Le
    Gao, Furong
    CONTROL ENGINEERING PRACTICE, 2025, 162
  • [45] A Multi-model Fusion Method Based on 0-1 Programming and Its Application in Early Warning of Coal Mill Failure
    Yang, Liu
    Zhai, Qiaozhu
    Wu, Yuxiang
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 4860 - 4865
  • [46] Multi-Model Fusion Short-Term Load Forecasting Based on Random Forest Feature Selection and Hybrid Neural Network
    Xuan, Yi
    Si, Weiguo
    Zhu, Jiong
    Sun, Zhiqing
    Zhao, Jian
    Xu, Mingjie
    Xu, Shouliang
    IEEE ACCESS, 2021, 9 : 69002 - 69009
  • [47] The left-behind human detection and tracking system based on vision with multi-model fusion and microwave radar inside the bus
    Liao, Jiacai
    Xiang, Guoliang
    Cao, Libo
    Xia, JiaHao
    Yue, Luyao
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2020, 234 (09) : 2342 - 2354
  • [48] Similarity-based multi-model ensemble approach for 1–15-day advance prediction of monsoon rainfall over India
    Neeru Jaiswal
    C. M. Kishtawal
    Swati Bhomia
    Theoretical and Applied Climatology, 2018, 132 : 639 - 645
  • [49] Similarity-based multi-model ensemble approach for 1-15-day advance prediction of monsoon rainfall over India
    Jaiswal, Neeru
    Kishtawal, C. M.
    Bhomia, Swati
    THEORETICAL AND APPLIED CLIMATOLOGY, 2018, 132 (1-2) : 639 - 645
  • [50] Silicon Carbide Surface Quality Prediction Based on Artificial Intelligence Methods on Multi-sensor Fusion Detection Test Platform
    Zhang, Yawei
    Li, Beizhi
    Yang, Jianguo
    Liu, Xiao
    Zhou, Jinqiang
    MACHINING SCIENCE AND TECHNOLOGY, 2019, 23 (01) : 131 - 152