LightGBM-LncLoc: A LightGBM-Based Computational Predictor for Recognizing Long Non-Coding RNA Subcellular Localization

被引:16
作者
Lyu, Jianyi [1 ]
Zheng, Peijie [1 ]
Qi, Yue [1 ]
Huang, Guohua [1 ]
机构
[1] Shaoyang Univ, Sch Informat Engn, Shaoyang 422000, Peoples R China
基金
中国国家自然科学基金;
关键词
lncRNA; subcellular localization; lightGBM; reverse complement k-mer; machine learning; CD-HIT; PROTEIN; GENOME;
D O I
10.3390/math11030602
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Long non-coding RNAs (lncRNA) are a class of RNA transcripts with more than 200 nucleotide residues. LncRNAs play versatile roles in cellular processes and are thus becoming a hot topic in the field of biomedicine. The function of lncRNAs was discovered to be closely associated with subcellular localization. Although many methods have been developed to identify the subcellular localization of lncRNAs, there still is much room for improvement. Herein, we present a lightGBM-based computational predictor for recognizing lncRNA subcellular localization, which is called LightGBM-LncLoc. LightGBM-LncLoc uses reverse complement k-mer and position-specific trinucleotide propensity based on the single strand for multi-class sequences to encode LncRNAs and employs LightGBM as the learning algorithm. LightGBM-LncLoc reaches state-of-the-art performance by five-fold cross-validation and independent test over the datasets of five categories of lncRNA subcellular localization. We also implemented LightGBM-LncLoc as a user-friendly web server.
引用
收藏
页数:13
相关论文
共 54 条
[51]   Role of lncRNA LUCAT1 in cancer [J].
Xing, Ce ;
Sun, Shou-gang ;
Yue, Zhi-Quan ;
Bai, Feng .
BIOMEDICINE & PHARMACOTHERAPY, 2021, 134
[52]   Deep4mC: systematic assessment and computational prediction for DNA N4-methylcytosine sites by deep learning [J].
Xu, Haodong ;
Jia, Peilin ;
Zhao, Zhongming .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
[53]   DeepLncLoc: a deep learning framework for long non-coding RNA subcellular localization prediction based on subsequence embedding [J].
Zeng, Min ;
Wu, Yifan ;
Lu, Chengqian ;
Zhang, Fuhao ;
Wu, Fang-Xiang ;
Li, Min .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
[54]   RNALocate: a resource for RNA subcellular localizations [J].
Zhang, Ting ;
Tan, Puwen ;
Wang, Liqiang ;
Jin, Nana ;
Li, Yana ;
Zhang, Lin ;
Yang, Huan ;
Hu, Zhenyu ;
Zhang, Lining ;
Hu, Chunyu ;
Li, Chunhua ;
Qian, Kun ;
Zhang, Changjian ;
Huang, Yan ;
Li, Kongning ;
Lin, Hao ;
Wang, Dong .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D135-D138