LncRNA–protein interaction prediction with reweighted feature selection

被引:0
作者
Guohao Lv
Yingchun Xia
Zhao Qi
Zihao Zhao
Lianggui Tang
Cheng Chen
Shuai Yang
Qingyong Wang
Lichuan Gu
机构
[1] Anhui Agricultural University,School of Information and Computer
来源
BMC Bioinformatics | / 24卷
关键词
LncRNA–protein prediction; Protein sequence; Feature selection; Boosting; Reweighting;
D O I
暂无
中图分类号
学科分类号
摘要
LncRNA–protein interactions are ubiquitous in organisms and play a crucial role in a variety of biological processes and complex diseases. Many computational methods have been reported for lncRNA–protein interaction prediction. However, the experimental techniques to detect lncRNA–protein interactions are laborious and time-consuming. Therefore, to address this challenge, this paper proposes a reweighting boosting feature selection (RBFS) method model to select key features. Specially, a reweighted apporach can adjust the contribution of each observational samples to learning model fitting; let higher weights are given more influence samples than those with lower weights. Feature selection with boosting can efficiently rank to iterate over important features to obtain the optimal feature subset. Besides, in the experiments, the RBFS method is applied to the prediction of lncRNA–protein interactions. The experimental results demonstrate that our method achieves higher accuracy and less redundancy with fewer features.
引用
收藏
相关论文
共 148 条
[1]  
Guttman M(2012)Modular regulatory principles of large non-coding RNAs Nature 482 339-346
[2]  
Rinn JL(2014)A survey of computational intelligence techniques in protein function prediction Int. J. Proteomics 2014 1298-1307
[3]  
Tiwari A(2013)Long noncoding RNAs: cellular address codes in development and disease Cell 152 20497-20502
[4]  
Srivastava R(2011)The genomic binding sites of a noncoding RNA Proc Natl Acad Sci 108 2241-2251
[5]  
Batista PJ(2015)Computational approaches towards understanding human long non-coding RNA biology Bioinformatics 31 1-10
[6]  
Chang HY(2013)Computational prediction of associations between long non-coding RNAs and proteins BMC Genomics 14 13486-13496
[7]  
Simon MD(2018)SFPEL-LPI: sequence-based feature projection ensemble learning for predicting LncRNA–protein interactions PLoS Comput Biol 14 1-22
[8]  
Wang CI(2019)LPI-KTASLP: prediction of LncRNA–protein interaction by semi-supervised link learning with multivariate information IEEE Access 7 276-284
[9]  
Kharchenko PV(2021)DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms BMC Bioinform 22 1136672-2116
[10]  
West JA(2022)Feature selection methods on gene expression microarray data for cancer classification: a systematic review Comput Biol Med 140 423-D165