rSeqTU-A Machine-Learning Based R Package for Prediction of Bacteria Transcription Units

被引:5
|
作者
Niu, Sheng-Yong [1 ]
Liu, Binqiang [2 ]
Ma, Qin [3 ]
Chou, Wen-Chi [4 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Shandong Univ, Sch Math, Jinan, Shandong, Peoples R China
[3] Ohio State Univ, Coll Med, Biomed Informat, Columbus, OH 43210 USA
[4] Broad Inst MIT & Harvard, Infect Dis & Microbiome Program, Cambridge, MA 02142 USA
基金
美国国家科学基金会;
关键词
machine learning; bacteria; transcription unit; R package; transcriptome;
D O I
10.3389/fgene.2019.00374
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
A transcription unit (TU) is composed of one or multiple adjacent genes on the same strand that are co-transcribed in mostly prokaryotes. Accurate identification of TUs is a crucial first step to delineate the transcriptional regulatory networks and elucidate the dynamic regulatory mechanisms encoded in various prokaryotic genomes. Many genomic features, for example, gene intergenic distance, and transcriptomic features including continuous and stable RNA-seq reads count signals, have been collected from a large amount of experimental data and integrated into classification techniques to computationally predict genome-wide TUs. Although some tools and web servers are able to predict TUs based on bacterial RNA-seq data and genome sequences, there is a need to have an improved machine learning prediction approach and a better comprehensive pipeline handling QC, TU prediction, and TU visualization. To enable users to efficiently perform TU identification on their local computers or high-performance clusters and provide a more accurate prediction, we develop an R package, named rSeqTU. rSeqTU uses a random forest algorithm to select essential features describing TUs and then uses support vector machine (SVM) to build TU prediction models. rSeqTU (available at https://s18692001.githubio/rSeqTU/) has six computational functionalities including read quality control, read mapping, training set generation, random forest-based feature selection, TU prediction, and TU visualization.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Machine-Learning Based TCP Security Action Prediction
    Zhao, Quanling
    Sun, Jiawei
    Ren, Hongjia
    Sun, Guodong
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1325 - 1329
  • [2] Prediction of Nucleophilicity and Electrophilicity Based on a Machine-Learning Approach
    Liu, Yidi
    Yang, Qi
    Cheng, Junjie
    Zhang, Long
    Luo, Sanzhong
    Cheng, Jin-Pei
    CHEMPHYSCHEM, 2023, 24 (14)
  • [3] Machine-Learning Aided Peer Prediction
    Liu, Yang
    Chen, Yiling
    EC'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2017, : 63 - 80
  • [4] Prediction of cholinergic compounds by machine-learning
    Wijeyesakere S.J.
    Wilson D.M.
    Sue Marty M.
    Wilson, Daniel M. (MWilson3@dow.com), 1600, Elsevier B.V. (13):
  • [5] Development and Validation of a Machine-Learning Model for Prediction of Extubation Failure in Intensive Care Units
    Zhao, Qin-Yu
    Wang, Huan
    Luo, Jing-Chao
    Luo, Ming-Hao
    Liu, Le-Ping
    Yu, Shen-Ji
    Liu, Kai
    Zhang, Yi-Jie
    Sun, Peng
    Tu, Guo-Wei
    Luo, Zhe
    FRONTIERS IN MEDICINE, 2021, 8
  • [6] Machine-learning based prediction of crash response of tubular structures
    Sakaridis, Emmanouil
    Karathanasopoulos, Nikolaos
    Mohr, Dirk
    INTERNATIONAL JOURNAL OF IMPACT ENGINEERING, 2022, 166
  • [7] Machine-learning based VMAF prediction for HDR video content
    Mueller, Christoph
    Steglich, Stephan
    Gross, Sandra
    Kremer, Paul
    PROCEEDINGS OF THE 2023 PROCEEDINGS OF THE 14TH ACM MULTIMEDIA SYSTEMS CONFERENCE, MMSYS 2023, 2023, : 328 - 332
  • [8] Development and validation of a machine-learning model for prediction of hypoxemia after extubation in intensive care units
    Xia, Ming
    Jin, Chenyu
    Cao, Shuang
    Pei, Bei
    Wang, Jie
    Xu, Tianyi
    Jiang, Hong
    ANNALS OF TRANSLATIONAL MEDICINE, 2022, 10 (10)
  • [9] Atom-centered machine-learning force field package
    Li, Lei
    Ciufo, Ryan A.
    Lee, Jiyoung
    Zhou, Chuan
    Lin, Bo
    Cho, Jaeyoung
    Katyal, Naman
    Henkelman, Graeme
    COMPUTER PHYSICS COMMUNICATIONS, 2023, 292
  • [10] Machine-Learning Based Prediction Model for Prognosis of IgA Nephropathy Patients
    Park, Sehoon
    Koh, Eun Sil
    Baek, Chung Hee
    Kim, Yong Chul
    Lee, Jung Pyo
    Kim, Dong Ki
    Han, Seung Hyeok
    Chin, Ho Jun
    Joo, Kwon Wook
    Kim, Yon Su
    Lee, Hajeong
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2022, 33 (11): : 800 - 801