SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes

Abstract Numerous methods have been developed to analyse RNA sequencing (RNA-seq) data, but most rely on the availability of a reference genome, making them unsuitable for non-model organisms. Here we present superTranscripts, a substitute for a reference genome, where each gene with multiple transc...

Full description

Bibliographic Details
Main Authors: Nadia M. Davidson, Anthony D. K. Hawkins, Alicia Oshlack
Format: Article
Language:English
Published: BMC 2017-08-01
Series:Genome Biology
Online Access:http://link.springer.com/article/10.1186/s13059-017-1284-1
Description
Summary:Abstract Numerous methods have been developed to analyse RNA sequencing (RNA-seq) data, but most rely on the availability of a reference genome, making them unsuitable for non-model organisms. Here we present superTranscripts, a substitute for a reference genome, where each gene with multiple transcripts is represented by a single sequence. The Lace software is provided to construct superTranscripts from any set of transcripts, including de novo assemblies. We demonstrate how superTranscripts enable visualisation, variant detection and differential isoform detection in non-model organisms. We further use Lace to combine reference and assembled transcriptomes for chicken and recover hundreds of gaps in the reference genome.
ISSN:1474-760X