DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms

Background: Long non-coding RNAs (lncRNAs) regulate diverse biological processes via interactions with proteins. Since the experimental methods to identify these interactions are expensive and time-consuming, many computational methods have been proposed. Although these computational methods have ac...

Full description

Bibliographic Details
Main Authors: Chen, H. (Author), Jiang, T. (Author), Shaw, D. (Author), Xie, M. (Author)
Format: Article
Language:English
Published: BioMed Central Ltd 2021
Subjects:
Online Access:View Fulltext in Publisher
LEADER 03437nam a2200505Ia 4500
001 10.1186-s12859-020-03914-7
008 220427s2021 CNT 000 0 und d
020 |a 14712105 (ISSN) 
245 1 0 |a DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms 
260 0 |b BioMed Central Ltd  |c 2021 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1186/s12859-020-03914-7 
520 3 |a Background: Long non-coding RNAs (lncRNAs) regulate diverse biological processes via interactions with proteins. Since the experimental methods to identify these interactions are expensive and time-consuming, many computational methods have been proposed. Although these computational methods have achieved promising prediction performance, they neglect the fact that a gene may encode multiple protein isoforms and different isoforms of the same gene may interact differently with the same lncRNA. Results: In this study, we propose a novel method, DeepLPI, for predicting the interactions between lncRNAs and protein isoforms. Our method uses sequence and structure data to extract intrinsic features and expression data to extract topological features. To combine these different data, we adopt a hybrid framework by integrating a multimodal deep learning neural network and a conditional random field. To overcome the lack of known interactions between lncRNAs and protein isoforms, we apply a multiple instance learning (MIL) approach. In our experiment concerning the human lncRNA-protein interactions in the NPInter v3.0 database, DeepLPI improved the prediction performance by 4.7% in term of AUC and 5.9% in term of AUPRC over the state-of-the-art methods. Our further correlation analyses between interactive lncRNAs and protein isoforms also illustrated that their co-expression information helped predict the interactions. Finally, we give some examples where DeepLPI was able to outperform the other methods in predicting mouse lncRNA-protein interactions and novel human lncRNA-protein interactions. Conclusion: Our results demonstrated that the use of isoforms and MIL contributed significantly to the improvement of performance in predicting lncRNA and protein interactions. We believe that such an approach would find more applications in predicting other functional roles of RNAs and proteins. © 2021, The Author(s). 
650 0 4 |a animal 
650 0 4 |a Animals 
650 0 4 |a biology 
650 0 4 |a Computational Biology 
650 0 4 |a Computational methods 
650 0 4 |a Conditional random field 
650 0 4 |a Correlation analysis 
650 0 4 |a Deep learning 
650 0 4 |a Deep Learning 
650 0 4 |a Deep neural networks 
650 0 4 |a Experimental methods 
650 0 4 |a Forecasting 
650 0 4 |a Genes 
650 0 4 |a genetics 
650 0 4 |a isoprotein 
650 0 4 |a Learning neural networks 
650 0 4 |a Learning systems 
650 0 4 |a long untranslated RNA 
650 0 4 |a Mice 
650 0 4 |a mouse 
650 0 4 |a Multiple-instance learning 
650 0 4 |a Neural Networks, Computer 
650 0 4 |a Prediction performance 
650 0 4 |a Protein Isoforms 
650 0 4 |a Proteins 
650 0 4 |a RNA, Long Noncoding 
650 0 4 |a State-of-the-art methods 
650 0 4 |a Topological features 
700 1 |a Chen, H.  |e author 
700 1 |a Jiang, T.  |e author 
700 1 |a Shaw, D.  |e author 
700 1 |a Xie, M.  |e author 
773 |t BMC Bioinformatics