rSeqTU-A Machine-Learning Based R Package for Prediction of Bacteria Transcription Units

被引：5

作者：

Niu, Sheng-Yong ^{[1
]}

Liu, Binqiang ^{[2
]}

Ma, Qin ^{[3
]}

Chou, Wen-Chi ^{[4
]}

机构：

[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA

[2] Shandong Univ, Sch Math, Jinan, Shandong, Peoples R China

[3] Ohio State Univ, Coll Med, Biomed Informat, Columbus, OH 43210 USA

[4] Broad Inst MIT & Harvard, Infect Dis & Microbiome Program, Cambridge, MA 02142 USA

来源：

FRONTIERS IN GENETICS | 2019年 / 10卷

基金：

美国国家科学基金会;

关键词：

machine learning; bacteria; transcription unit; R package; transcriptome;

D O I：

10.3389/fgene.2019.00374

中图分类号：

Q3 [遗传学];

学科分类号：

071007 ; 090102 ;

摘要：

A transcription unit (TU) is composed of one or multiple adjacent genes on the same strand that are co-transcribed in mostly prokaryotes. Accurate identification of TUs is a crucial first step to delineate the transcriptional regulatory networks and elucidate the dynamic regulatory mechanisms encoded in various prokaryotic genomes. Many genomic features, for example, gene intergenic distance, and transcriptomic features including continuous and stable RNA-seq reads count signals, have been collected from a large amount of experimental data and integrated into classification techniques to computationally predict genome-wide TUs. Although some tools and web servers are able to predict TUs based on bacterial RNA-seq data and genome sequences, there is a need to have an improved machine learning prediction approach and a better comprehensive pipeline handling QC, TU prediction, and TU visualization. To enable users to efficiently perform TU identification on their local computers or high-performance clusters and provide a more accurate prediction, we develop an R package, named rSeqTU. rSeqTU uses a random forest algorithm to select essential features describing TUs and then uses support vector machine (SVM) to build TU prediction models. rSeqTU (available at https://s18692001.githubio/rSeqTU/) has six computational functionalities including read quality control, read mapping, training set generation, random forest-based feature selection, TU prediction, and TU visualization.

引用

页数：6

共 50 条

[1] Machine-Learning Based TCP Security Action Prediction
Zhao, Quanling
Sun, Jiawei
Ren, Hongjia
Sun, Guodong
2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1325 - 1329
[2] Prediction of Nucleophilicity and Electrophilicity Based on a Machine-Learning Approach
Liu, Yidi
Yang, Qi
Cheng, Junjie
Zhang, Long
Luo, Sanzhong
Cheng, Jin-Pei
CHEMPHYSCHEM, 2023, 24 (14)
[3] Machine-Learning Aided Peer Prediction
Liu, Yang
Chen, Yiling
EC'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2017, : 63 - 80
[4] Prediction of cholinergic compounds by machine-learning
Wijeyesakere S.J.
Wilson D.M.
Sue Marty M.
Wilson, Daniel M. (MWilson3@dow.com), 1600, Elsevier B.V. (13):
[5] Development and Validation of a Machine-Learning Model for Prediction of Extubation Failure in Intensive Care Units
Zhao, Qin-Yu
Wang, Huan
Luo, Jing-Chao
Luo, Ming-Hao
Liu, Le-Ping
Yu, Shen-Ji
Liu, Kai
Zhang, Yi-Jie
Sun, Peng
Tu, Guo-Wei
Luo, Zhe
FRONTIERS IN MEDICINE, 2021, 8
[6] Machine-learning based prediction of crash response of tubular structures
Sakaridis, Emmanouil
Karathanasopoulos, Nikolaos
Mohr, Dirk
INTERNATIONAL JOURNAL OF IMPACT ENGINEERING, 2022, 166
[7] Machine-learning based VMAF prediction for HDR video content
Mueller, Christoph
Steglich, Stephan
Gross, Sandra
Kremer, Paul
PROCEEDINGS OF THE 2023 PROCEEDINGS OF THE 14TH ACM MULTIMEDIA SYSTEMS CONFERENCE, MMSYS 2023, 2023, : 328 - 332
[8] Development and validation of a machine-learning model for prediction of hypoxemia after extubation in intensive care units
Xia, Ming
Jin, Chenyu
Cao, Shuang
Pei, Bei
Wang, Jie
Xu, Tianyi
Jiang, Hong
ANNALS OF TRANSLATIONAL MEDICINE, 2022, 10 (10)
[9] Atom-centered machine-learning force field package
Li, Lei
Ciufo, Ryan A.
Lee, Jiyoung
Zhou, Chuan
Lin, Bo
Cho, Jaeyoung
Katyal, Naman
Henkelman, Graeme
COMPUTER PHYSICS COMMUNICATIONS, 2023, 292
[10] Machine-Learning Based Prediction Model for Prognosis of IgA Nephropathy Patients
Park, Sehoon
Koh, Eun Sil
Baek, Chung Hee
Kim, Yong Chul
Lee, Jung Pyo
Kim, Dong Ki
Han, Seung Hyeok
Chin, Ho Jun
Joo, Kwon Wook
Kim, Yon Su
Lee, Hajeong
JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2022, 33 (11): : 800 - 801

← 1 2 3 4 5 →