Discovery of protein-lncRNA interactions by integrating large-scale CLIP-Seq and RNA-Seq datasets

Long non-coding RNAs (lncRNAs) are emerging as important regulatory molecules in developmental, physiological and pathological processes. However, the precise mechanism and functions of most of lncRNAs remain largely unknown. Recent advances in high-throughput sequencing of immunoprecipitated RNAs a...

Full description

Bibliographic Details
Main Authors: Jun-Hao eLi, Shun eLiu, Ling-Ling eZheng, Jie eWu, Wen-Ju eSun, Ze-Lin eWang, Hui eZhou, Liang-Hu eQu, Jian-Hua eYang
Format: Article
Language:English
Published: Frontiers Media S.A. 2015-01-01
Series:Frontiers in Bioengineering and Biotechnology
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fbioe.2014.00088/full
Description
Summary:Long non-coding RNAs (lncRNAs) are emerging as important regulatory molecules in developmental, physiological and pathological processes. However, the precise mechanism and functions of most of lncRNAs remain largely unknown. Recent advances in high-throughput sequencing of immunoprecipitated RNAs after cross-linking (CLIP-Seq) provide powerful ways to identify biologically relevant protein–lncRNA interactions. In this study, by analyzing millions of RNA-binding protein (RBP) binding sites from 117 CLIP-Seq datasets generated by 50 independent studies, we identified 22,735 RBP-lncRNA regulatory relationships. We found that one single lncRNA will generally be bound and regulated by one or multiple RBPs, the combination of which may coordinately regulate gene expression. We also revealed the expression correlation of these interaction networks by mining expression profiles of over 6000 normal and tumor samples from 14 cancer types. Our combined analysis of CLIP-Seq data and genome-wide association studies (GWAS) data discovered hundreds of disease-related SNPs resided in the RBP binding sites of lncRNAs. Finally, we developed interactive web implementations to provide visualization, analysis and downloading of the aforementioned large-scale datasets. Our study represented an important step in identification and analysis of RBP-lncRNA interactions and showed that these interactions may play crucial roles in cancer and genetic diseases.
ISSN:2296-4185