Sebnif: an integrated bioinformatics pipeline for the identification of novel large intergenic noncoding RNAs (lincRNAs)--application in human skeletal muscle cells.

Ab initio assembly of transcriptome sequencing data has been widely used to identify large intergenic non-coding RNAs (lincRNAs), a novel class of gene regulators involved in many biological processes. To differentiate real lincRNA transcripts from thousands of assembly artifacts, a series of filter...

Full description

Bibliographic Details
Main Authors: Kun Sun, Yu Zhao, Huating Wang, Hao Sun
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3882232?pdf=render
id doaj-acfcdeac069b4ee3aef2d2bf15adaa35
record_format Article
spelling doaj-acfcdeac069b4ee3aef2d2bf15adaa352020-11-24T21:55:23ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0191e8450010.1371/journal.pone.0084500Sebnif: an integrated bioinformatics pipeline for the identification of novel large intergenic noncoding RNAs (lincRNAs)--application in human skeletal muscle cells.Kun SunYu ZhaoHuating WangHao SunAb initio assembly of transcriptome sequencing data has been widely used to identify large intergenic non-coding RNAs (lincRNAs), a novel class of gene regulators involved in many biological processes. To differentiate real lincRNA transcripts from thousands of assembly artifacts, a series of filtering steps such as filters of transcript length, expression level and coding potential, need to be applied. However, an easy-to-use and publicly available bioinformatics pipeline that integrates these filters is not yet available. Hence, we implemented sebnif, an integrative bioinformatics pipeline to facilitate the discovery of bona fide novel lincRNAs that are suitable for further functional characterization. Specifically, sebnif is the only pipeline that implements an algorithm for identifying high-quality single-exonic lincRNAs that were often omitted in many studies. To demonstrate the usage of sebnif, we applied it on a real biological RNA-seq dataset from Human Skeletal Muscle Cells (HSkMC) and built a novel lincRNA catalog containing 917 highly reliable lincRNAs. Sebnif is available at http://sunlab.lihs.cuhk.edu.hk/sebnif/.http://europepmc.org/articles/PMC3882232?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Kun Sun
Yu Zhao
Huating Wang
Hao Sun
spellingShingle Kun Sun
Yu Zhao
Huating Wang
Hao Sun
Sebnif: an integrated bioinformatics pipeline for the identification of novel large intergenic noncoding RNAs (lincRNAs)--application in human skeletal muscle cells.
PLoS ONE
author_facet Kun Sun
Yu Zhao
Huating Wang
Hao Sun
author_sort Kun Sun
title Sebnif: an integrated bioinformatics pipeline for the identification of novel large intergenic noncoding RNAs (lincRNAs)--application in human skeletal muscle cells.
title_short Sebnif: an integrated bioinformatics pipeline for the identification of novel large intergenic noncoding RNAs (lincRNAs)--application in human skeletal muscle cells.
title_full Sebnif: an integrated bioinformatics pipeline for the identification of novel large intergenic noncoding RNAs (lincRNAs)--application in human skeletal muscle cells.
title_fullStr Sebnif: an integrated bioinformatics pipeline for the identification of novel large intergenic noncoding RNAs (lincRNAs)--application in human skeletal muscle cells.
title_full_unstemmed Sebnif: an integrated bioinformatics pipeline for the identification of novel large intergenic noncoding RNAs (lincRNAs)--application in human skeletal muscle cells.
title_sort sebnif: an integrated bioinformatics pipeline for the identification of novel large intergenic noncoding rnas (lincrnas)--application in human skeletal muscle cells.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description Ab initio assembly of transcriptome sequencing data has been widely used to identify large intergenic non-coding RNAs (lincRNAs), a novel class of gene regulators involved in many biological processes. To differentiate real lincRNA transcripts from thousands of assembly artifacts, a series of filtering steps such as filters of transcript length, expression level and coding potential, need to be applied. However, an easy-to-use and publicly available bioinformatics pipeline that integrates these filters is not yet available. Hence, we implemented sebnif, an integrative bioinformatics pipeline to facilitate the discovery of bona fide novel lincRNAs that are suitable for further functional characterization. Specifically, sebnif is the only pipeline that implements an algorithm for identifying high-quality single-exonic lincRNAs that were often omitted in many studies. To demonstrate the usage of sebnif, we applied it on a real biological RNA-seq dataset from Human Skeletal Muscle Cells (HSkMC) and built a novel lincRNA catalog containing 917 highly reliable lincRNAs. Sebnif is available at http://sunlab.lihs.cuhk.edu.hk/sebnif/.
url http://europepmc.org/articles/PMC3882232?pdf=render
work_keys_str_mv AT kunsun sebnifanintegratedbioinformaticspipelinefortheidentificationofnovellargeintergenicnoncodingrnaslincrnasapplicationinhumanskeletalmusclecells
AT yuzhao sebnifanintegratedbioinformaticspipelinefortheidentificationofnovellargeintergenicnoncodingrnaslincrnasapplicationinhumanskeletalmusclecells
AT huatingwang sebnifanintegratedbioinformaticspipelinefortheidentificationofnovellargeintergenicnoncodingrnaslincrnasapplicationinhumanskeletalmusclecells
AT haosun sebnifanintegratedbioinformaticspipelinefortheidentificationofnovellargeintergenicnoncodingrnaslincrnasapplicationinhumanskeletalmusclecells
_version_ 1725863019491098624