DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms

被引:19
作者
Shaw, Dipan [1 ]
Chen, Hao [1 ]
Xie, Minzhu [2 ]
Jiang, Tao [1 ,3 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
[2] Hunan Normal Univ, Coll Informat Sci & Engn, Changsha, Peoples R China
[3] Tsinghua Univ, Bioinformat Div, BNRIST Dept Comp Sci & Technol, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
LONG NONCODING RNAS; ACCURATE PREDICTION; SECONDARY STRUCTURE; IDENTIFICATION; SEQUENCE; HETESIM; ASSOCIATIONS; FRAMEWORK; TOOL;
D O I
10.1186/s12859-020-03914-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Long non-coding RNAs (lncRNAs) regulate diverse biological processes via interactions with proteins. Since the experimental methods to identify these interactions are expensive and time-consuming, many computational methods have been proposed. Although these computational methods have achieved promising prediction performance, they neglect the fact that a gene may encode multiple protein isoforms and different isoforms of the same gene may interact differently with the same lncRNA. Results: In this study, we propose a novel method, DeepLPI, for predicting the interactions between lncRNAs and protein isoforms. Our method uses sequence and structure data to extract intrinsic features and expression data to extract topological features. To combine these different data, we adopt a hybrid framework by integrating a multimodal deep learning neural network and a conditional random field. To overcome the lack of known interactions between lncRNAs and protein isoforms, we apply a multiple instance learning (MIL) approach. In our experiment concerning the human lncRNA-protein interactions in the NPInter v3.0 database, DeepLPI improved the prediction performance by 4.7% in term of AUC and 5.9% in term of AUPRC over the state-of-the-art methods. Our further correlation analyses between interactive lncRNAs and protein isoforms also illustrated that their co-expression information helped predict the interactions. Finally, we give some examples where DeepLPI was able to outperform the other methods in predicting mouse lncRNA-protein interactions and novel human lncRNA-protein interactions. Conclusion: Our results demonstrated that the use of isoforms and MIL contributed significantly to the improvement of performance in predicting lncRNA and protein interactions. We believe that such an approach would find more applications in predicting other functional roles of RNAs and proteins.
引用
收藏
页数:22
相关论文
共 74 条
[1]   rpiCOOL: A tool for In Silico RNA-protein interaction detection using random forest [J].
Akbaripour-Elahabad, Mohammad ;
Zahiri, Javad ;
Rafeh, Reza ;
Eslami, Morteza ;
Azari, Mahboobeh .
JOURNAL OF THEORETICAL BIOLOGY, 2016, 402 :1-8
[2]  
ANDREWS S, 2002, MULTIPLE INSTANCE LE, P943
[3]   Predicting protein associations with long noncoding RNAs [J].
Bellucci, Matteo ;
Agostini, Federico ;
Masin, Marianela ;
Tartaglia, Gian Gaetano .
NATURE METHODS, 2011, 8 (06) :444-445
[4]  
Bengio Y, 2001, ADV NEUR IN, V13, P932
[5]   A systematic study of the class imbalance problem in convolutional neural networks [J].
Buda, Mateusz ;
Maki, Atsuto ;
Mazurowski, Maciej A. .
NEURAL NETWORKS, 2018, 106 :249-259
[6]   GOssTo: a stand-alone application and a web tool for calculating semantic similarities on the Gene Ontology [J].
Caniza, Horacio ;
Romero, Alfonso E. ;
Heron, Samuel ;
Yang, Haixuan ;
Devoto, Alessandra ;
Frasca, Marco ;
Mesiti, Marco ;
Valentini, Giorgio ;
Paccanaro, Alberto .
BIOINFORMATICS, 2014, 30 (15) :2235-2236
[7]   DIFFUSE: predicting isoform functions from sequences and expression profiles via deep learning [J].
Chen, Hao ;
Shaw, Dipan ;
Zeng, Jianyang ;
Bu, Dongbo ;
Jiang, Tao .
BIOINFORMATICS, 2019, 35 (14) :I284-I294
[8]   DM-RPIs: Predicting ncRNA-protein interactions using stacked ensembling strategy [J].
Cheng, Shuping ;
Zhang, Lu ;
Tan, Jianjun ;
Gong, Weikang ;
Li, Chunhua ;
Zhang, Xiaoyi .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 83
[9]  
Chollet F., 2015, Keras
[10]   Accurate prediction of protein-lncRNA interactions by diffusion and HeteSim features across heterogeneous network [J].
Deng, Lei ;
Wang, Junqiang ;
Xiao, Yun ;
Wang, Zixiang ;
Liu, Hui .
BMC BIOINFORMATICS, 2018, 19