NaviSE: superenhancer navigator integrating epigenomics signal algebra

Abstract Background Superenhancers are crucial structural genomic elements determining cell fate, and they are also involved in the determination of several diseases, such as cancer or neurodegeneration. Although there are pipelines which use independent pieces of software to predict the presence of...

Full description

Bibliographic Details
Main Authors: Alex M. Ascensión, Mikel Arrospide-Elgarresta, Ander Izeta, Marcos J. Araúzo-Bravo
Format: Article
Language:English
Published: BMC 2017-06-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-017-1698-5
id doaj-d59ca3be9fa54b21a2b182796752f05e
record_format Article
spelling doaj-d59ca3be9fa54b21a2b182796752f05e2020-11-24T21:03:02ZengBMCBMC Bioinformatics1471-21052017-06-0118111810.1186/s12859-017-1698-5NaviSE: superenhancer navigator integrating epigenomics signal algebraAlex M. Ascensión0Mikel Arrospide-Elgarresta1Ander Izeta2Marcos J. Araúzo-Bravo3Computational Biology and Systems Biomedicine, Biodonostia Health Research InstituteComputational Biology and Systems Biomedicine, Biodonostia Health Research InstituteTissue Engineering Laboratory, Bioengineering Area, Biodonostia Health Research InstituteComputational Biology and Systems Biomedicine, Biodonostia Health Research InstituteAbstract Background Superenhancers are crucial structural genomic elements determining cell fate, and they are also involved in the determination of several diseases, such as cancer or neurodegeneration. Although there are pipelines which use independent pieces of software to predict the presence of superenhancers from genome-wide chromatin marks or DNA-interaction protein binding sites, there is not yet an integrated software tool that processes automatically algebra combinations of raw data sequencing into a comprehensive final annotated report of predicted superenhancers. Results We have developed NaviSE, a user-friendly streamlined tool which performs a fully-automated parallel processing of genome-wide epigenomics data from sequencing files into a final report, built with a comprehensive set of annotated files that are navigated through a graphic user interface dynamically generated by NaviSE. NaviSE also implements an ‘epigenomics signal algebra’ that allows the combination of multiple activation and repression epigenomics signals. NaviSE provides an interactive chromosomal landscaping of the locations of superenhancers, which can be navigated to obtain annotated information about superenhancer signal profile, associated genes, gene ontology enrichment analysis, motifs of transcription factor binding sites enriched in superenhancers, graphs of the metrics evaluating the superenhancers quality, protein-protein interaction networks and enriched metabolic pathways among other features. We have parallelised the most time-consuming tasks achieving a reduction up to 30% for a 15 CPUs machine. We have optimized the default parameters of NaviSE to facilitate its use. NaviSE allows different entry levels of data processing, from sra-fastq files to bed files; and unifies the processing of multiple replicates. NaviSE outperforms the more time-consuming processes required in a non-integrated pipeline. Alongside its high performance, NaviSE is able to provide biological insights, predicting cell type specific markers, such as SOX2 and ZIC3 in embryonic stem cells, CDK5R1 and REST in neurons and CD86 and TLR2 in monocytes. Conclusions NaviSE is a user-friendly streamlined solution for superenhancer analysis, annotation and navigation, requiring only basic computer and next generation sequencing knowledge. NaviSE binaries and documentation are available at: https://sourceforge.net/projects/navise-superenhancer/ .http://link.springer.com/article/10.1186/s12859-017-1698-5SuperenhancersNext-generation sequencingParallel computingEpigenomicsComputational biologyGraphics user interface
collection DOAJ
language English
format Article
sources DOAJ
author Alex M. Ascensión
Mikel Arrospide-Elgarresta
Ander Izeta
Marcos J. Araúzo-Bravo
spellingShingle Alex M. Ascensión
Mikel Arrospide-Elgarresta
Ander Izeta
Marcos J. Araúzo-Bravo
NaviSE: superenhancer navigator integrating epigenomics signal algebra
BMC Bioinformatics
Superenhancers
Next-generation sequencing
Parallel computing
Epigenomics
Computational biology
Graphics user interface
author_facet Alex M. Ascensión
Mikel Arrospide-Elgarresta
Ander Izeta
Marcos J. Araúzo-Bravo
author_sort Alex M. Ascensión
title NaviSE: superenhancer navigator integrating epigenomics signal algebra
title_short NaviSE: superenhancer navigator integrating epigenomics signal algebra
title_full NaviSE: superenhancer navigator integrating epigenomics signal algebra
title_fullStr NaviSE: superenhancer navigator integrating epigenomics signal algebra
title_full_unstemmed NaviSE: superenhancer navigator integrating epigenomics signal algebra
title_sort navise: superenhancer navigator integrating epigenomics signal algebra
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2017-06-01
description Abstract Background Superenhancers are crucial structural genomic elements determining cell fate, and they are also involved in the determination of several diseases, such as cancer or neurodegeneration. Although there are pipelines which use independent pieces of software to predict the presence of superenhancers from genome-wide chromatin marks or DNA-interaction protein binding sites, there is not yet an integrated software tool that processes automatically algebra combinations of raw data sequencing into a comprehensive final annotated report of predicted superenhancers. Results We have developed NaviSE, a user-friendly streamlined tool which performs a fully-automated parallel processing of genome-wide epigenomics data from sequencing files into a final report, built with a comprehensive set of annotated files that are navigated through a graphic user interface dynamically generated by NaviSE. NaviSE also implements an ‘epigenomics signal algebra’ that allows the combination of multiple activation and repression epigenomics signals. NaviSE provides an interactive chromosomal landscaping of the locations of superenhancers, which can be navigated to obtain annotated information about superenhancer signal profile, associated genes, gene ontology enrichment analysis, motifs of transcription factor binding sites enriched in superenhancers, graphs of the metrics evaluating the superenhancers quality, protein-protein interaction networks and enriched metabolic pathways among other features. We have parallelised the most time-consuming tasks achieving a reduction up to 30% for a 15 CPUs machine. We have optimized the default parameters of NaviSE to facilitate its use. NaviSE allows different entry levels of data processing, from sra-fastq files to bed files; and unifies the processing of multiple replicates. NaviSE outperforms the more time-consuming processes required in a non-integrated pipeline. Alongside its high performance, NaviSE is able to provide biological insights, predicting cell type specific markers, such as SOX2 and ZIC3 in embryonic stem cells, CDK5R1 and REST in neurons and CD86 and TLR2 in monocytes. Conclusions NaviSE is a user-friendly streamlined solution for superenhancer analysis, annotation and navigation, requiring only basic computer and next generation sequencing knowledge. NaviSE binaries and documentation are available at: https://sourceforge.net/projects/navise-superenhancer/ .
topic Superenhancers
Next-generation sequencing
Parallel computing
Epigenomics
Computational biology
Graphics user interface
url http://link.springer.com/article/10.1186/s12859-017-1698-5
work_keys_str_mv AT alexmascension navisesuperenhancernavigatorintegratingepigenomicssignalalgebra
AT mikelarrospideelgarresta navisesuperenhancernavigatorintegratingepigenomicssignalalgebra
AT anderizeta navisesuperenhancernavigatorintegratingepigenomicssignalalgebra
AT marcosjarauzobravo navisesuperenhancernavigatorintegratingepigenomicssignalalgebra
_version_ 1716774453062402048