Velo-Predictor: an ensemble learning pipeline for RNA velocity prediction

被引:3
|
作者
Wang, Xin [1 ]
Zheng, Jie [1 ]
机构
[1] ShanghaiTech Univ, Sch Informat Sci & Technol, 393 Middle Huaxia Rd, Shanghai 201210, Peoples R China
关键词
RNA velocity; Single cell; Ensemble learning; Landscape; SINGLE; SMOTE;
D O I
10.1186/s12859-021-04330-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background RNA velocity is a novel and powerful concept which enables the inference of dynamical cell state changes from seemingly static single-cell RNA sequencing (scRNA-seq) data. However, accurate estimation of RNA velocity is still a challenging problem, and the underlying kinetic mechanisms of transcriptional and splicing regulations are not fully clear. Moreover, scRNA-seq data tend to be sparse compared with possible cell states, and a given dataset of estimated RNA velocities needs imputation for some cell states not yet covered. Results We formulate RNA velocity prediction as a supervised learning problem of classification for the first time, where a cell state space is divided into equal-sized segments by directions as classes, and the estimated RNA velocity vectors are considered as ground truth. We propose Velo-Predictor, an ensemble learning pipeline for predicting RNA velocities from scRNA-seq data. We test different models on two real datasets, Velo-Predictor exhibits good performance, especially when XGBoost was used as the base predictor. Parameter analysis and visualization also show that the method is robust and able to make biologically meaningful predictions. Conclusion The accurate result shows that Velo-Predictor can effectively simplify the procedure by learning a predictive model from gene expression data, which could help to construct a continous landscape and give biologists an intuitive picture about the trend of cellular dynamics.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Velo-Predictor: an ensemble learning pipeline for RNA velocity prediction
    Xin Wang
    Jie Zheng
    BMC Bioinformatics, 22
  • [2] Air Quality Prediction using Recurrent Air Quality Predictor with Ensemble Learning
    Padilla, Dionis A.
    Magwili, Glenn, V
    Mercado, Luis Benjamin Z.
    Reyes, Jean Tristan L.
    2020 IEEE 12TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT, AND MANAGEMENT (HNICEM), 2020,
  • [3] EMDLP: Ensemble multiscale deep learning model for RNA methylation site prediction
    Wang, Honglei
    Liu, Hui
    Huang, Tao
    Li, Gangshen
    Zhang, Lin
    Sun, Yanjing
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [4] Ensemble Learning for Rainfall Prediction
    Sani N.S.
    Rahman A.H.A.
    Adam A.
    Shlash I.
    Aliff M.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (11): : 153 - 162
  • [5] Ensemble Learning for Rainfall Prediction
    Sani, Nor Samsiah
    Abd Rahman, Abdul Hadi
    Adam, Afzan
    Shlash, Israa
    Aliff, Mohd
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 153 - 162
  • [6] Physics informed ensemble learning used for interval prediction of fracture toughness of pipeline steels in hydrogen environments
    Zhang, Yingjie
    Ai, Yibo
    Zhang, Weidong
    THEORETICAL AND APPLIED FRACTURE MECHANICS, 2024, 130
  • [7] Cascade Forest-Based Model for Prediction of RNA Velocity
    Zeng, Zhiliang
    Zhao, Shouwei
    Peng, Yu
    Hu, Xiang
    Yin, Zhixiang
    MOLECULES, 2022, 27 (22):
  • [8] A Stacking-Based Ensemble Learning Predictor Combined with Particle Swarm Optimizer for Identifying RNA Pseudouridine Sites
    Wang, Xiao
    Li, Pengfei
    Han, Lijun
    Wang, Rong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT III, 2023, 14088 : 521 - 531
  • [9] Repurchase Prediction Based on Ensemble Learning
    Xu, Danqi
    Yang, Wenyin
    Ma, Li
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 1317 - 1322
  • [10] Prediction of HDD Failures by Ensemble Learning
    Li, Qiang
    Li, Hui
    Zhang, Kai
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 237 - 240