Precise strength prediction of endogenous promoters from Escherichia coli and J-series promoters by artificial intelligence

被引:6
作者
Huang, Yu-Kuan [1 ]
Yu, Chi -Hua [2 ]
Ng, I-Son [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Chem Engn, Tainan 701, Taiwan
[2] Natl Cheng Kung Univ, Dept Engn Sci, Tainan 701, Taiwan
关键词
Promoter strength; Deep learning; Conventional neural network; Sigma factor 70; TRANSCRIPTION INITIATION; ELEMENTS; DATABASE; PROTEIN; LENGTH;
D O I
10.1016/j.jtice.2023.105211
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Background: Promoter strength plays a critical role in modulating protein expression in genetic engineering. However, there are only a few studies on the strength of promoters from the comprehensive genomic database of sigma factors. To circumvent the time and resource-intensive experimental approach, artificial intelligence (AI) is considered to construct a complete database of proposed promoters from Escherichia coli, and further utilizing prediction algorithms to evaluate the promoter strength and confirmed using intensity of green fluorescent protein (GFP). Methods: The promoter database was constructed using partial information from Ecocyc, and predictive strength of the promoters was calculated via the phiSITE hunter tool. Among the 1744 promoter entries in the database were derived from E. coli MG1655, while total of 935 sigma factor 70 (sigma 70) promoters were identified. Then, the training database was applied to develop a precise tool for predicting promoter strength using machine learning and six deep learning models. The accuracy of predictions was confirmed through wet experiments conducted on endogenous and J-series promoters. Significant findings: By employing a deep learning model, particularly the Convolutional Neural Network (CNN), the promoter prediction fitness of phiSITE, which relied on traditional alignment metrics, was approved. On the other hand, phiSITE demonstrated satisfied result in the fluorescence experiments using 7 endogenous promoters, achieving an R-squared (R2) at 0.93. When applied the same model to predict the strength of J-series promoters, the best R2 achieved 0.99. Thus, CNN model represents as an effective evaluation of AI-based promoter strength.
引用
收藏
页数:8
相关论文
共 37 条
[1]   Tuning genetic control through promoter engineering [J].
Alper, H ;
Fischer, C ;
Nevoigt, E ;
Stephanopoulos, G .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (36) :12678-12683
[2]   PPred-PCKSM: A multi-layer predictor for identifying promoter and its variants using position based features [J].
Bhukya, Raju ;
Kumari, Archana ;
Amilpur, Santhosh ;
Dasari, Chandra Mohan .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2022, 97
[3]   Structures of an RNA polymerase promoter melting intermediate elucidate DNA unwinding [J].
Boyaci, Hande ;
Chen, James ;
Jansen, Rolf ;
Darst, Seth A. ;
Campbell, Elizabeth A. .
NATURE, 2019, 565 (7739) :382-+
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Tuning Promoter Strength through RNA Polymerase Binding Site Design in Escherichia coli [J].
Brewster, Robert C. ;
Jones, Daniel L. ;
Phillips, Rob .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (12)
[6]   Metabolic engineering of Escherichia coli to enhance protein production by coupling ShCAST-based optimized transposon system and CRISPR interference [J].
Chang, Chin -Wei ;
Huang, Jing-Wen ;
Lu, You-Hsuan ;
Pham, Nam Ngoc ;
Tu, Jui ;
Tung, Yen-Tzu ;
Yen, Chia-Yi ;
Tu, Yi ;
Shen, Chih-Che ;
Chien, Ming -Chen ;
Lin, Ya-Hui ;
Yang, Shu-Wei ;
Nguyen, Mai Thanh Thi ;
Pham, Dang Huu ;
Hu, Yu-Chen .
JOURNAL OF THE TAIWAN INSTITUTE OF CHEMICAL ENGINEERS, 2023, 144
[7]   Structural basis for initiation of transcription from an RNA polymerase-promoter complex [J].
Cheetham, GMT ;
Jeruzalmi, D ;
Steitz, TA .
NATURE, 1999, 399 (6731) :80-83
[8]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[9]   Programmed cell-lysis system based on hybrid sigma factor-dependent promoters [J].
Chiang, Chung-Jen ;
Chang, Chih-Hsiang ;
Chao, Yun-Peng .
JOURNAL OF THE TAIWAN INSTITUTE OF CHEMICAL ENGINEERS, 2022, 141
[10]  
Cortes C, 2012, Arxiv, DOI [arXiv:1205.2653, DOI 10.48550/ARXIV.1205.2653]